Asset Details
MbrlCatalogueTitleDetail
Do you wish to reserve the book?
De-identified Bayesian personal identity matching for privacy-preserving record linkage despite errors: development and validation
by
Cardinal, Rudolf N.
, Lewis, Jonathan R.
, Moore, Anna
, Burchell, Martin
in
Agreements
/ Algorithms
/ Analysis
/ Anonyms and pseudonyms
/ Bayes Theorem
/ Bayesian analysis
/ Bayesian probabilistic linkage
/ Bayesian statistical decision theory
/ De-identification
/ Electronic health records
/ Electronic patient records
/ Electronic medical records
/ Epidemiology
/ Gender
/ Health Informatics
/ Homelessness
/ Humans
/ Identity
/ Information Systems and Communication Service
/ Male
/ Management of Computing and Information Systems
/ Matching
/ Mathematical analysis
/ Medical care
/ Medical Record Linkage
/ Medical records
/ Medicine
/ Medicine & Public Health
/ Mental disorders
/ Methods
/ Minority & ethnic groups
/ Open source software
/ Phonetics
/ Privacy
/ Probability
/ Programming languages
/ Pseudonymisation
/ Quality management
/ Regulatory approval
/ Representations
/ Residential areas
/ Software
/ State Medicine
/ Statistical analysis
2023
Hey, we have placed the reservation for you!
By the way, why not check out events that you can attend while you pick your title.
You are currently in the queue to collect this book. You will be notified once it is your turn to collect the book.
Oops! Something went wrong.
Looks like we were not able to place the reservation. Kindly try again later.
Are you sure you want to remove the book from the shelf?
De-identified Bayesian personal identity matching for privacy-preserving record linkage despite errors: development and validation
by
Cardinal, Rudolf N.
, Lewis, Jonathan R.
, Moore, Anna
, Burchell, Martin
in
Agreements
/ Algorithms
/ Analysis
/ Anonyms and pseudonyms
/ Bayes Theorem
/ Bayesian analysis
/ Bayesian probabilistic linkage
/ Bayesian statistical decision theory
/ De-identification
/ Electronic health records
/ Electronic patient records
/ Electronic medical records
/ Epidemiology
/ Gender
/ Health Informatics
/ Homelessness
/ Humans
/ Identity
/ Information Systems and Communication Service
/ Male
/ Management of Computing and Information Systems
/ Matching
/ Mathematical analysis
/ Medical care
/ Medical Record Linkage
/ Medical records
/ Medicine
/ Medicine & Public Health
/ Mental disorders
/ Methods
/ Minority & ethnic groups
/ Open source software
/ Phonetics
/ Privacy
/ Probability
/ Programming languages
/ Pseudonymisation
/ Quality management
/ Regulatory approval
/ Representations
/ Residential areas
/ Software
/ State Medicine
/ Statistical analysis
2023
Oops! Something went wrong.
While trying to remove the title from your shelf something went wrong :( Kindly try again later!
Do you wish to request the book?
De-identified Bayesian personal identity matching for privacy-preserving record linkage despite errors: development and validation
by
Cardinal, Rudolf N.
, Lewis, Jonathan R.
, Moore, Anna
, Burchell, Martin
in
Agreements
/ Algorithms
/ Analysis
/ Anonyms and pseudonyms
/ Bayes Theorem
/ Bayesian analysis
/ Bayesian probabilistic linkage
/ Bayesian statistical decision theory
/ De-identification
/ Electronic health records
/ Electronic patient records
/ Electronic medical records
/ Epidemiology
/ Gender
/ Health Informatics
/ Homelessness
/ Humans
/ Identity
/ Information Systems and Communication Service
/ Male
/ Management of Computing and Information Systems
/ Matching
/ Mathematical analysis
/ Medical care
/ Medical Record Linkage
/ Medical records
/ Medicine
/ Medicine & Public Health
/ Mental disorders
/ Methods
/ Minority & ethnic groups
/ Open source software
/ Phonetics
/ Privacy
/ Probability
/ Programming languages
/ Pseudonymisation
/ Quality management
/ Regulatory approval
/ Representations
/ Residential areas
/ Software
/ State Medicine
/ Statistical analysis
2023
Please be aware that the book you have requested cannot be checked out. If you would like to checkout this book, you can reserve another copy
We have requested the book for you!
Your request is successful and it will be processed during the Library working hours. Please check the status of your request in My Requests.
Oops! Something went wrong.
Looks like we were not able to place your request. Kindly try again later.
De-identified Bayesian personal identity matching for privacy-preserving record linkage despite errors: development and validation
Journal Article
De-identified Bayesian personal identity matching for privacy-preserving record linkage despite errors: development and validation
2023
Request Book From Autostore
and Choose the Collection Method
Overview
Background
Epidemiological research may require linkage of information from multiple organizations. This can bring two problems: (1) the information governance desirability of linkage without sharing direct identifiers, and (2) a requirement to link databases without a common person-unique identifier.
Methods
We develop a Bayesian matching technique to solve both. We provide an open-source software implementation capable of de-identified probabilistic matching despite discrepancies, via fuzzy representations and complete mismatches, plus de-identified deterministic matching if required. We validate the technique by testing linkage between multiple medical records systems in a UK National Health Service Trust, examining the effects of decision thresholds on linkage accuracy. We report demographic factors associated with correct linkage.
Results
The system supports dates of birth (DOBs), forenames, surnames, three-state gender, and UK postcodes. Fuzzy representations are supported for all except gender, and there is support for additional transformations, such as accent misrepresentation, variation for multi-part surnames, and name re-ordering. Calculated log odds predicted a proband’s presence in the sample database with an area under the receiver operating curve of 0.997–0.999 for non-self database comparisons. Log odds were converted to a decision via a consideration threshold
θ
and a leader advantage threshold
δ
. Defaults were chosen to penalize misidentification 20-fold versus linkage failure. By default, complete DOB mismatches were disallowed for computational efficiency. At these settings, for non-self database comparisons, the mean probability of a proband being correctly declared to be in the sample was 0.965 (range 0.931–0.994), and the misidentification rate was 0.00249 (range 0.00123–0.00429). Correct linkage was positively associated with male gender, Black or mixed ethnicity, and the presence of diagnostic codes for severe mental illnesses or other mental disorders, and negatively associated with birth year, unknown ethnicity, residential area deprivation, and presence of a pseudopostcode (e.g. indicating homelessness). Accuracy rates would be improved further if person-unique identifiers were also used, as supported by the software. Our two largest databases were linked in 44 min via an interpreted programming language.
Conclusions
Fully de-identified matching with high accuracy is feasible without a person-unique identifier and appropriate software is freely available.
Publisher
BioMed Central,BioMed Central Ltd,Springer Nature B.V,BMC
This website uses cookies to ensure you get the best experience on our website.