Catalogue Search | MBRL
Search Results Heading
Explore the vast range of titles available.
MBRLSearchResults
-
DisciplineDiscipline
-
Is Peer ReviewedIs Peer Reviewed
-
Item TypeItem Type
-
SubjectSubject
-
YearFrom:-To:
-
More FiltersMore FiltersSourceLanguage
Done
Filters
Reset
1,895
result(s) for
"Electronic Data Processing - standards"
Sort by:
Assessment of automated clinical trial recruitment and enrolment using patient-facing technology
by
Lam, Regina
,
Bardach, Naomi S
,
Jasik, Carolyn B
in
Automation
,
Clinical trials
,
Clinical Trials as Topic - methods
2021
ObjectiveInteractive patient care systems (IPCS) at the bedside are becoming increasingly common, but evidence is limited as to their potential for innovative clinical trial implementation. The objective of this study was to test the hypothesis that the IPCS could feasibly be used to automate recruitment and enrolment for a clinical trial.MethodsIn medical-surgical units, we used the IPCS to randomise, recruit and consent eligible subjects. For participants not interacting with IPCS study materials within 48 hours, study staff-initiated recruitment in-person. Eligible study population included all caregivers and any patients >6 years old admitted to medical-surgical units and oncology units September 2015 to January 2016. Outcomes: randomisation assessed using between-group comparisons of patient characteristics; recruitment success assessed by rates of consent; paperless implementation using successful acquisition of electronic signature and email address. We used χ2 analysis to assess success of randomisation and recruitment.ResultsRandomisation was successful (n=1012 randomised, p>0.05 for all between-group comparisons). For the subset of eligible, randomised patients who were recruited, IPCS-only recruitment (consented: 2.4% of n=213) was less successful than in-person recruitment (61.4% of n=87 eligible recruited, p<0.001). For those consenting (n=61), 96.7% provided an electronic signature and 68.9% provided email addresses.ConclusionsOur results suggest that as a tool at the bedside, the IPCS offers key efficiencies for study implementation, including randomisation and collecting e-consent and contact information, but does not offer recruitment efficiencies. Further research could assess the value that interactive technologies bring to recruitment when paired with in-person efforts, potentially focusing on more intensive user-interface testing for recruitment materials.Trial registration numberNCT02491190.
Journal Article
Micapipe: A pipeline for multimodal neuroimaging and connectome analysis
2022
•Micapipe is a comprehensive pipeline to process multimodal MRI data.•Micapipe generates matrices describing cortico-cortical microstructural similarity, functional connectivity, structural connectivity, and spatial proximity.•The pipeline provides easy-to-verify outputs and visualizations for quality control.•Outputs are hierarchically organized with BIDS-conform naming.•Our evaluations show reproducible processing across several 3T and 7T datasets.
Multimodal magnetic resonance imaging (MRI) has accelerated human neuroscience by fostering the analysis of brain microstructure, geometry, function, and connectivity across multiple scales and in living brains. The richness and complexity of multimodal neuroimaging, however, demands processing methods to integrate information across modalities and to consolidate findings across different spatial scales. Here, we present micapipe, an open processing pipeline for multimodal MRI datasets. Based on BIDS-conform input data, micapipe can generate i) structural connectomes derived from diffusion tractography, ii) functional connectomes derived from resting-state signal correlations, iii) geodesic distance matrices that quantify cortico-cortical proximity, and iv) microstructural profile covariance matrices that assess inter-regional similarity in cortical myelin proxies. The above matrices can be automatically generated across established 18 cortical parcellations (100–1000 parcels), in addition to subcortical and cerebellar parcellations, allowing researchers to replicate findings easily across different spatial scales. Results are represented on three different surface spaces (native, conte69, fsaverage5), and outputs are BIDS-conform. Processed outputs can be quality controlled at the individual and group level. micapipe was tested on several datasets and is available at https://github.com/MICA-MNI/micapipe, documented at https://micapipe.readthedocs.io/, and containerized as a BIDS App http://bids-apps.neuroimaging.io/apps/. We hope that micapipe will foster robust and integrative studies of human brain microstructure, morphology, function, cand connectivity.
Journal Article
Quantifying and contextualizing the impact of bioRxiv preprints through automated social media audience segmentation
by
Carlson, Jedidiah
,
Harris, Kelley
in
Academies and Institutes - organization & administration
,
Academies and Institutes - standards
,
Academies and Institutes - statistics & numerical data
2020
Engagement with scientific manuscripts is frequently facilitated by Twitter and other social media platforms. As such, the demographics of a paper's social media audience provide a wealth of information about how scholarly research is transmitted, consumed, and interpreted by online communities. By paying attention to public perceptions of their publications, scientists can learn whether their research is stimulating positive scholarly and public thought. They can also become aware of potentially negative patterns of interest from groups that misinterpret their work in harmful ways, either willfully or unintentionally, and devise strategies for altering their messaging to mitigate these impacts. In this study, we collected 331,696 Twitter posts referencing 1,800 highly tweeted bioRxiv preprints and leveraged topic modeling to infer the characteristics of various communities engaging with each preprint on Twitter. We agnostically learned the characteristics of these audience sectors from keywords each user's followers provide in their Twitter biographies. We estimate that 96% of the preprints analyzed are dominated by academic audiences on Twitter, suggesting that social media attention does not always correspond to greater public exposure. We further demonstrate how our audience segmentation method can quantify the level of interest from nonspecialist audience sectors such as mental health advocates, dog lovers, video game developers, vegans, bitcoin investors, conspiracy theorists, journalists, religious groups, and political constituencies. Surprisingly, we also found that 10% of the preprints analyzed have sizable (>5%) audience sectors that are associated with right-wing white nationalist communities. Although none of these preprints appear to intentionally espouse any right-wing extremist messages, cases exist in which extremist appropriation comprises more than 50% of the tweets referencing a given preprint. These results present unique opportunities for improving and contextualizing the public discourse surrounding scientific research.
Journal Article
IPO: a tool for automated optimization of XCMS parameters
2015
Background
Untargeted metabolomics generates a huge amount of data. Software packages for automated data processing are crucial to successfully process these data. A variety of such software packages exist, but the outcome of data processing strongly depends on algorithm parameter settings. If they are not carefully chosen, suboptimal parameter settings can easily lead to biased results. Therefore, parameter settings also require optimization. Several parameter optimization approaches have already been proposed, but a software package for parameter optimization which is free of intricate experimental labeling steps, fast and widely applicable is still missing.
Results
We implemented the software package IPO (‘Isotopologue Parameter Optimization’) which is fast and free of labeling steps, and applicable to data from different kinds of samples and data from different methods of liquid chromatography - high resolution mass spectrometry and data from different instruments.
IPO optimizes XCMS peak picking parameters by using natural, stable
13
C isotopic peaks to calculate a peak picking score. Retention time correction is optimized by minimizing relative retention time differences within peak groups. Grouping parameters are optimized by maximizing the number of peak groups that show one peak from each injection of a pooled sample. The different parameter settings are achieved by design of experiments, and the resulting scores are evaluated using response surface models. IPO was tested on three different data sets, each consisting of a training set and test set. IPO resulted in an increase of reliable groups (146% - 361%), a decrease of non-reliable groups (3% - 8%) and a decrease of the retention time deviation to one third.
Conclusions
IPO was successfully applied to data derived from liquid chromatography coupled to high resolution mass spectrometry from three studies with different sample types and different chromatographic methods and devices. We were also able to show the potential of IPO to increase the reliability of metabolomics data.
The source code is implemented in R, tested on Linux and Windows and it is freely available for download at
https://github.com/glibiseller/IPO
. The training sets and test sets can be downloaded from
https://health.joanneum.at/IPO
.
Journal Article
Quality Control Methods in Accelerometer Data Processing: Defining Minimum Wear Time
by
Geraci, Marco
,
Cortina-Borja, Mario
,
Dezateux, Carol
in
Accelerometers
,
Accelerometry - methods
,
Accelerometry - standards
2013
When using accelerometers to measure physical activity, researchers need to determine whether subjects have worn their device for a sufficient period to be included in analyses. We propose a minimum wear criterion using population-based accelerometer data, and explore the influence of gender and the purposeful inclusion of children with weekend data on reliability.
Accelerometer data obtained during the age seven sweep of the UK Millennium Cohort Study were analysed. Children were asked to wear an ActiGraph GT1M accelerometer for seven days. Reliability coefficients(r) of mean daily counts/minute were calculated using the Spearman-Brown formula based on the intraclass correlation coefficient. An r of 1.0 indicates that all the variation is between- rather than within-children and that measurement is 100% reliable. An r of 0.8 is often regarded as acceptable reliability. Analyses were repeated on data from children who met different minimum daily wear times (one to 10 hours) and wear days (one to seven days). Analyses were conducted for all children, separately for boys and girls, and separately for children with and without weekend data.
At least one hour of wear time data was obtained from 7,704 singletons. Reliability increased as the minimum number of days and the daily wear time increased. A high reliability (r = 0.86) and sample size (n = 6,528) was achieved when children with ≥ two days lasting ≥10 hours/day were included in analyses. Reliability coefficients were similar for both genders. Purposeful sampling of children with weekend data resulted in comparable reliabilities to those calculated independent of weekend wear.
Quality control procedures should be undertaken before analysing accelerometer data in large-scale studies. Using data from children with ≥ two days lasting ≥10 hours/day should provide reliable estimates of physical activity. It's unnecessary to include only children with accelerometer data collected during weekends in analyses.
Journal Article
An integrated approach to processing WHO-2016 verbal autopsy data: the InterVA-5 model
by
Davies, Justine
,
Hussain-Alkhateeb, Laith
,
Petzold, Max
in
Adult
,
Adults
,
Afghanistan - epidemiology
2019
Background
Verbal autopsy is an increasingly important methodology for assigning causes to otherwise uncertified deaths, which amount to around 50% of global mortality and cause much uncertainty for health planning. The World Health Organization sets international standards for the structure of verbal autopsy interviews and for cause categories that can reasonably be derived from verbal autopsy data. In addition, computer models are needed to efficiently process large quantities of verbal autopsy interviews to assign causes of death in a standardised manner. Here, we present the InterVA-5 model, developed to align with the WHO-2016 verbal autopsy standard. This is a harmonising model that can process input data from WHO-2016, as well as earlier WHO-2012 and Tariff-2 formats, to generate standardised cause-specific mortality profiles for diverse contexts.
The software development involved building on the earlier InterVA-4 model, and the expanded knowledge base required for InterVA-5 was informed by analyses from a training dataset drawn from the Population Health Metrics Research Collaboration verbal autopsy reference dataset, as well as expert input.
Results
The new model was evaluated against a test dataset of 6130 cases from the Population Health Metrics Research Collaboration and 4009 cases from the Afghanistan National Mortality Survey dataset. Both of these sources contained around three quarters of the input items from the WHO-2016, WHO-2012 and Tariff-2 formats. Cause-specific mortality fractions across all applicable WHO cause categories were compared between causes assigned in participating tertiary hospitals and InterVA-5 in the test dataset, with concordance correlation coefficients of 0.92 for children and 0.86 for adults.
The InterVA-5 model’s capacity to handle different input formats was evaluated in the Afghanistan dataset, with concordance correlation coefficients of 0.97 and 0.96 between the WHO-2016 and the WHO-2012 format for children and adults respectively, and 0.92 and 0.87 between the WHO-2016 and the Tariff-2 format respectively.
Conclusions
Despite the inherent difficulties of determining “truth” in assigning cause of death, these findings suggest that the InterVA-5 model performs well and succeeds in harmonising across a range of input formats. As more primary data collected under WHO-2016 become available, it is likely that InterVA-5 will undergo minor re-versioning in the light of practical experience. The model is an important resource for measuring and evaluating cause-specific mortality globally.
Journal Article
DNA Barcoding: Error Rates Based on Comprehensive Sampling
2005
DNA barcoding has attracted attention with promises to aid in species identification and discovery; however, few well-sampled datasets are available to test its performance. We provide the first examination of barcoding performance in a comprehensively sampled, diverse group (cypraeid marine gastropods, or cowries). We utilize previous methods for testing performance and employ a novel phylogenetic approach to calculate intraspecific variation and interspecific divergence. Error rates are estimated for (1) identifying samples against a well-characterized phylogeny, and (2) assisting in species discovery for partially known groups. We find that the lowest overall error for species identification is 4%. In contrast, barcoding performs poorly in incompletely sampled groups. Here, species delineation relies on the use of thresholds, set to differentiate between intraspecific variation and interspecific divergence. Whereas proponents envision a \"barcoding gap\" between the two, we find substantial overlap, leading to minimal error rates of approximately 17% in cowries. Moreover, error rates double if only traditionally recognized species are analyzed. Thus, DNA barcoding holds promise for identification in taxonomically well-understood and thoroughly sampled clades. However, the use of thresholds does not bode well for delineating closely related species in taxonomically understudied groups. The promise of barcoding will be realized only if based on solid taxonomic foundations.
Journal Article
DNA Barcoding the Geometrid Fauna of Bavaria (Lepidoptera): Successes, Surprises, and Questions
by
Hebert, Paul D. N.
,
Hausmann, Axel
,
Haszprunar, Gerhard
in
Agrodiaetus
,
Analysis
,
Animal species
2011
The State of Bavaria is involved in a research program that will lead to the construction of a DNA barcode library for all animal species within its territorial boundaries. The present study provides a comprehensive DNA barcode library for the Geometridae, one of the most diverse of insect families.
This study reports DNA barcodes for 400 Bavarian geometrid species, 98 per cent of the known fauna, and approximately one per cent of all Bavarian animal species. Although 98.5% of these species possess diagnostic barcode sequences in Bavaria, records from neighbouring countries suggest that species-level resolution may be compromised in up to 3.5% of cases. All taxa which apparently share barcodes are discussed in detail. One case of modest divergence (1.4%) revealed a species overlooked by the current taxonomic system: Eupithecia goossensiata Mabille, 1869 stat.n. is raised from synonymy with Eupithecia absinthiata (Clerck, 1759) to species rank. Deep intraspecific sequence divergences (>2%) were detected in 20 traditionally recognized species.
The study emphasizes the effectiveness of DNA barcoding as a tool for monitoring biodiversity. Open access is provided to a data set that includes records for 1,395 geometrid specimens (331 species) from Bavaria, with 69 additional species from neighbouring regions. Taxa with deep intraspecific sequence divergences are undergoing more detailed analysis to ascertain if they represent cases of cryptic diversity.
Journal Article
Performance of four computer-coded verbal autopsy methods for cause of death assignment compared with physician coding on 24,000 deaths in low- and middle-income countries
by
Desai, Nikita
,
Leitao, Jordana
,
Miasnikof, Pierre
in
Automatic Data Processing - methods
,
Automatic Data Processing - standards
,
Autopsies
2014
Background
Physician-coded verbal autopsy (PCVA) is the most widely used method to determine causes of death (CODs) in countries where medical certification of death is uncommon. Computer-coded verbal autopsy (CCVA) methods have been proposed as a faster and cheaper alternative to PCVA, though they have not been widely compared to PCVA or to each other.
Methods
We compared the performance of open-source random forest, open-source tariff method, InterVA-4, and the King-Lu method to PCVA on five datasets comprising over 24,000 verbal autopsies from low- and middle-income countries. Metrics to assess performance were positive predictive value and partial chance-corrected concordance at the individual level, and cause-specific mortality fraction accuracy and cause-specific mortality fraction error at the population level.
Results
The positive predictive value for the most probable COD predicted by the four CCVA methods averaged about 43% to 44% across the datasets. The average positive predictive value improved for the top three most probable CODs, with greater improvements for open-source random forest (69%) and open-source tariff method (68%) than for InterVA-4 (62%). The average partial chance-corrected concordance for the most probable COD predicted by the open-source random forest, open-source tariff method and InterVA-4 were 41%, 40% and 41%, respectively, with better results for the top three most probable CODs. Performance generally improved with larger datasets. At the population level, the King-Lu method had the highest average cause-specific mortality fraction accuracy across all five datasets (91%), followed by InterVA-4 (72% across three datasets), open-source random forest (71%) and open-source tariff method (54%).
Conclusions
On an individual level, no single method was able to replicate the physician assignment of COD more than about half the time. At the population level, the King-Lu method was the best method to estimate cause-specific mortality fractions, though it does not assign individual CODs. Future testing should focus on combining different computer-coded verbal autopsy tools, paired with PCVA strengths. This includes using open-source tools applied to larger and varied datasets (especially those including a random sample of deaths drawn from the population), so as to establish the performance for age- and sex-specific CODs.
Journal Article
Development and evaluation of a de-identification procedure for a case register sourced from mental health electronic records
by
Broadbent, Matthew TM
,
Fernandes, Andrea C
,
Chang, Chin-Kuo
in
Air pollution control equipment
,
Algorithms
,
Alzheimer's disease
2013
Background
Electronic health records (EHRs) provide enormous potential for health research but also present data governance challenges. Ensuring de-identification is a pre-requisite for use of EHR data without prior consent. The South London and Maudsley NHS Trust (SLaM), one of the largest secondary mental healthcare providers in Europe, has developed, from its EHRs, a de-identified psychiatric case register, the Clinical Record Interactive Search (CRIS), for secondary research.
Methods
We describe development, implementation and evaluation of a bespoke de-identification algorithm used to create the register. It is designed to create dictionaries using patient identifiers (PIs) entered into dedicated source fields and then identify, match and mask them (with ZZZZZ) when they appear in medical texts. We deemed this approach would be effective, given high coverage of PI in the dedicated fields and the effectiveness of the masking combined with elements of a security model. We conducted two separate performance tests i) to test performance of the algorithm in masking individual
true PIs
entered in dedicated fields and then found in text (using 500 patient notes) and ii) to compare the performance of the CRIS pattern matching algorithm with a machine learning algorithm, called the MITRE Identification Scrubber Toolkit – MIST (using 70 patient notes – 50 notes to train, 20 notes to test on). We also report any incidences of
potential breaches
, defined by occurrences of 3 or more true or apparent PIs in the same patient’s notes (and in an additional set of longitudinal notes for 50 patients); and we consider the possibility of inferring information despite de-identification.
Results
True PIs were masked with 98.8% precision and 97.6% recall. As anticipated, potential PIs did appear, owing to misspellings entered within the EHRs. We found one potential breach. In a separate performance test, with a different set of notes, CRIS yielded 100% precision and 88.5% recall, while MIST yielded a 95.1% and 78.1%, respectively. We discuss how we overcome the realistic possibility – albeit of low probability – of potential breaches through implementation of the security model.
Conclusion
CRIS is a de-identified psychiatric database sourced from EHRs, which protects patient anonymity and maximises data available for research. CRIS demonstrates the advantage of combining an effective de-identification algorithm with a carefully designed security model. The paper advances much needed discussion of EHR de-identification – particularly in relation to criteria to assess de-identification, and considering the contexts of de-identified research databases when assessing the risk of breaches of confidential patient information.
Journal Article