Search Results Heading

MBRLSearchResults

mbrl.module.common.modules.added.book.to.shelf
Title added to your shelf!
View what I already have on My Shelf.
Oops! Something went wrong.
Oops! Something went wrong.
While trying to add the title to your shelf something went wrong :( Kindly try again later!
Are you sure you want to remove the book from the shelf?
Oops! Something went wrong.
Oops! Something went wrong.
While trying to remove the title from your shelf something went wrong :( Kindly try again later!
    Done
    Filters
    Reset
  • Discipline
      Discipline
      Clear All
      Discipline
  • Is Peer Reviewed
      Is Peer Reviewed
      Clear All
      Is Peer Reviewed
  • Item Type
      Item Type
      Clear All
      Item Type
  • Subject
      Subject
      Clear All
      Subject
  • Year
      Year
      Clear All
      From:
      -
      To:
  • More Filters
      More Filters
      Clear All
      More Filters
      Source
    • Language
100 result(s) for "Gourraud, Pierre Antoine"
Sort by:
The nature of genetic and environmental susceptibility to multiple sclerosis
To understand the nature of genetic and environmental susceptibility to multiple sclerosis (MS) and, by extension, susceptibility to other complex genetic diseases. Certain basic epidemiological parameters of MS (e.g., population-prevalence of MS, recurrence-risks for MS in siblings and twins, proportion of women among MS patients, and the time-dependent changes in the sex-ratio) are well-established. In addition, more than 233 genetic-loci have now been identified as being unequivocally MS-associated, including 32 loci within the major histocompatibility complex (MHC), and one locus on the X chromosome. Despite this recent explosion in genetic associations, however, the association of MS with the HLA-DRB1*15:01~HLA-DQB1*06:02~a1 (H+) haplotype has been known for decades. We define the \"genetically-susceptible\" subset (G) to include everyone with any non-zero life-time chance of developing MS. Individuals who have no chance of developing MS, regardless of their environmental experiences, belong to the mutually exclusive \"non-susceptible\" subset (G-). Using these well-established epidemiological parameters, we analyze, mathematically, the implications that these observations have regarding the genetic-susceptibility to MS. In addition, we use the sex-ratio change (observed over a 35-year interval in Canada), to derive the relationship between MS-probability and an increasing likelihood of a sufficient environmental exposure. We demonstrate that genetic-susceptibitly is confined to less than 7.3% of populations throughout Europe and North America. Consequently, more than 92.7% of individuals in these populations have no chance whatsoever of developing MS, regardless of their environmental experiences. Even among carriers of the HLA-DRB1*15:01~HLA-DQB1*06:02~a1 haplotype, far fewer than 32% can possibly be members the (G) subset. Also, despite the current preponderance of women among MS patients, women are less likely to be in the susceptible (G) subset and have a higher environmental threshold for developing MS compared to men. Nevertheless, the penetrance of MS in susceptible women is considerably greater than it is in men. Moreover, the response-curves for MS-probability in susceptible individuals increases with an increasing likelihood of a sufficient environmental exposure, especially among women. However, these environmental response-curves plateau at under 50% for women and at a significantly lower level for men. The pathogenesis of MS requires both a genetic predisposition and a suitable environmental exposure. Nevertheless, genetic-susceptibility is rare in the population (< 7.3%) and requires specific combinations of non-additive genetic risk-factors. For example, only a minority of carriers of the HLA-DRB1*15:01~HLA-DQB1*06:02~a1 haplotype are even in the (G) subset and, thus, genetic-susceptibility to MS in these carriers must result from the combined effect this haplotype together with the effects of certain other (as yet, unidentified) genetic factors. By itself, this haplotype poses no MS-risk. By contrast, a sufficient environmental exposure (however many events are involved, whenever these events need to act, and whatever these events might be) is common, currently occurring in, at least, 76% of susceptible individuals. In addition, the fact that environmental response-curves plateau well below 50% (especially in men), indicates that disease pathogenesis is partly stochastic. By extension, other diseases, for which monozygotic-twin recurrence-risks greatly exceed the disease-prevalence (e.g., rheumatoid arthritis, diabetes, and celiac disease), must have a similar genetic basis.
Flexibility of a large blindly synthetized avatar database for occupational research: Example from the CONSTANCES cohort for stroke and knee pain
Though the rise of big data in the field of occupational health offers new opportunities especially for cross-cutting research, they raise the issue of privacy and security of data, especially when linking sensitive data from the field of insurance, occupational health or compensation claims. We aimed to validate a large, blinded synthesized database developed from the CONSTANCES cohort by comparing associations between three independently selected outcomes, and various exposures. From the CONSTANCES cohort, a large synthetic dataset was constructed using the avatar method (Octopize) that is agnostic to the data primary or secondary data uses. Three main analyses of interest were chosen to compare associations between the raw and avatar dataset: risk of stroke (any stroke, and subtypes of stroke), risk of knee pain and limitations associated with knee pain. Logistic models were computed, and a qualitative comparison of paired odds ratio (OR) was made. Both raw and avatar datasets included 162,434 observations and 19 relevant variables. On the 172 paired raw/avatar OR that were computed, including stratified analyses on sex, more than 77% of the comparisons had a OR difference ≤0.5 and less than 7% had a discrepancy in the statistical significance of the associations, with a Cohen's Kappa coefficient of 0.80. This study shows the flexibility and the multiple usage of a synthetic database created with the avatar method in the particular field of occupational health, which can be shared in open access without risking re-identification and privacy issues and help bring new insights for complex phenomenon like return to work.
Stratifying Multiple Sclerosis Susceptibility Risk: The Role of HLA‐E01 and Infectious Mononucleosis in a Population Cohort
Background Epstein–Barr Virus (EBV) and its clinical manifestation, infectious mononucleosis (IM), are strongly linked to MS risk. A recent in vitro study suggests that HLA‐E*01:03, in contrast to HLA‐E*01:01, may protect against MS through more effective immune responses against EBV‐infected B cells. However, the role of HLA‐E*01 in MS remains unclear. Methods We investigated if HLA‐E*01:01 was significantly associated with higher MS risk in individuals with a history of IM diagnosis, using 487,144 individuals from the UK Biobank's cohort. We estimated the interaction between HLA‐E*01:01 and IM using Cox proportional hazard models, adjusting for demographics, smoking, childhood body size, older siblings, and MS‐related HLA alleles (e.g., HLA‐DRB1*15:01). Results HLA‐E*01:01 allele alone was not significantly associated with IM or MS (p > 0.05). However, a significant interaction between HLA‐E*01:01 and IM history was observed in relation to MS risk (p < 0.001). Specifically, MS risk was significantly higher in both HLA‐E*01:01 heterozygotes (HR = 1.74 [95% CI: 1.36, 1.97], p < 0.001) and homozygotes (HR = 3.01 [95% CI: 1.81, 3.88], p < 0.001) with IM history, compared to HLA‐E*01:03 homozygotes. Conversely, these associations were non‐significant in individuals without IM history (p > 0.05). The estimated proportion of the combined risk attributable to interaction effects was 40% in HLA‐E*01:01 heterozygotes and 65% in HLA‐E*01:01 homozygotes. Conclusions HLA‐E*01:01 carriers diagnosed with IM are at significantly increased risk of MS, independently from other MS‐related HLA alleles. This supports the hypothesis that HLA‐E*01:01 may contribute to MS susceptibility due to weaker immune control over EBV infection. Incorporating HLA‐E*01:01 into existing MHC‐based MS risk models could then enhance personalized risk assessments in individuals with IM history.
Addressing the Challenges and Barriers to the Integration of Machine Learning into Clinical Practice: An Innovative Method to Hybrid Human–Machine Intelligence
Machine learning (ML) models have proven their potential in acquiring and analyzing large amounts of data to help solve real-world, complex problems. Their use in healthcare is expected to help physicians make diagnoses, prognoses, treatment decisions, and disease outcome predictions. However, ML solutions are not currently deployed in most healthcare systems. One of the main reasons for this is the provenance, transparency, and clinical utility of the training data. Physicians reject ML solutions if they are not at least based on accurate data and do not clearly include the decision-making process used in clinical practice. In this paper, we present a hybrid human–machine intelligence method to create predictive models driven by clinical practice. We promote the use of quality-approved data and the inclusion of physician reasoning in the ML process. Instead of training the ML algorithms on the given data to create predictive models (conventional method), we propose to pre-categorize the data according to the expert physicians’ knowledge and experience. Comparing the results of the conventional method of ML learning versus the hybrid physician–algorithm method showed that the models based on the latter can perform better. Physicians’ engagement is the most promising condition for the safe and innovative use of ML in healthcare.
Multiple sclerosis: Exploring the limits and implications of genetic and environmental susceptibility
To explore and describe the basis and implications of genetic and environmental susceptibility to multiple sclerosis (MS) using the Canadian population-based data. Certain parameters of MS-epidemiology are directly observable (e.g., the recurrence-risk of MS in siblings and twins, the proportion of women among MS patients, the population-prevalence of MS, and the time-dependent changes in the sex-ratio). By contrast, other parameters can only be inferred from the observed parameters (e.g., the proportion of the population that is \"genetically susceptible\", the proportion of women among susceptible individuals, the probability that a susceptible individual will experience an environment \"sufficient\" to cause MS, and if they do, the probability that they will develop the disease). The \"genetically susceptible\" subset (G) of the population (Z) is defined to include everyone with any non-zero life-time chance of developing MS under some environmental conditions. The value for each observed and non-observed epidemiological parameter is assigned a \"plausible\" range. Using both a Cross-sectional Model and a Longitudinal Model, together with established parameter relationships, we explore, iteratively, trillions of potential parameter combinations and determine those combinations (i.e., solutions) that fall within the acceptable range for both the observed and non-observed parameters. Both Models and all analyses intersect and converge to demonstrate that probability of genetic-susceptibitly, P(G), is limited to only a fraction of the population {i.e., P(G) ≤ 0.52)} and an even smaller fraction of women {i.e., P(G│F) < 0.32)}. Consequently, most individuals (particularly women) have no chance whatsoever of developing MS, regardless of their environmental exposure. However, for any susceptible individual to develop MS, requires that they also experience a \"sufficient\" environment. We use the Canadian data to derive, separately, the exponential response-curves for men and women that relate the increasing likelihood of developing MS to an increasing probability that a susceptible individual experiences an environment \"sufficient\" to cause MS. As the probability of a \"sufficient\" exposure increases, we define, separately, the limiting probability of developing MS in men (c) and women (d). These Canadian data strongly suggest that: (c < d ≤ 1). If so, this observation establishes both that there must be a \"truly\" random factor involved in MS pathogenesis and that it is this difference, rather than any difference in genetic or environmental factors, which primarily accounts for the penetrance difference between women and men. The development of MS (in an individual) requires both that they have an appropriate genotype (which is uncommon in the population) and that they have an environmental exposure \"sufficient\" to cause MS given their genotype. Nevertheless, the two principal findings of this study are that: P(G) ≤ 0.52)} and: (c < d ≤ 1). Threfore, even when the necessary genetic and environmental factors, \"sufficient\" for MS pathogenesis, co-occur for an individual, they still may or may not develop MS. Consequently, disease pathogenesis, even in this circumstance, seems to involve an important element of chance. Moreover, the conclusion that the macroscopic process of disease development for MS includes a \"truly\" random element, if replicated (either for MS or for other complex diseases), provides empiric evidence that our universe is non-deterministic.
Scoping review of clinical decision support systems for multiple sclerosis management: Leveraging information technology and massive health data
Background and purpose Multiple sclerosis (MS) is a complex autoimmune disease of the central nervous system, with numerous therapeutic options, but a lack of biomarkers to support a mechanistic approach to precision medicine. A computational approach to precision medicine could proceed from clinical decision support systems (CDSSs). They are digital tools aiming to empower physicians through the clinical applications of information technology and massive data. However, the process of their clinical development is still maturing; we aimed to review it in the field of MS. Methods For this scoping review, we screened systematically the PubMed database. We identified 24 articles reporting 14 CDSS projects and compared their technical and software development aspects. Results The projects position themselves in various contexts of usage with various algorithmic approaches: expert systems, CDSSs based on similar patients' data visualization, and model‐based CDSSs implementing mathematical predictive models. So far, no project has completed its clinical development up to certification for clinical use with global release. Some CDSSs have been replaced at subsequent project iterations. The most advanced projects did not necessarily report every step of clinical development in a dedicated article (proof of concept, offline validation, refined prototype, live clinical evaluation, comparative prospective evaluation). They seek different software distribution options to integrate into health care: internal usage, “peer‐to‐peer,” and marketing distribution. Conclusions This review illustrates the potential of clinical applications of information technology and massive data to support MS management and helps clarify the roadmap for future projects as a multidisciplinary and multistep process.
HLA Diversity in the 1000 Genomes Dataset
The 1000 Genomes Project aims to provide a deep characterization of human genome sequence variation by sequencing at a level that should allow the genome-wide detection of most variants with frequencies as low as 1%. However, in the major histocompatibility complex (MHC), only the top 10 most frequent haplotypes are in the 1% frequency range whereas thousands of haplotypes are present at lower frequencies. Given the limitation of both the coverage and the read length of the sequences generated by the 1000 Genomes Project, the highly variable positions that define HLA alleles may be difficult to identify. We used classical Sanger sequencing techniques to type the HLA-A, HLA-B, HLA-C, HLA-DRB1 and HLA-DQB1 genes in the available 1000 Genomes samples and combined the results with the 103,310 variants in the MHC region genotyped by the 1000 Genomes Project. Using pairwise identity-by-descent distances between individuals and principal component analysis, we established the relationship between ancestry and genetic diversity in the MHC region. As expected, both the MHC variants and the HLA phenotype can identify the major ancestry lineage, informed mainly by the most frequent HLA haplotypes. To some extent, regions of the genome with similar genetic or similar recombination rate have similar properties. An MHC-centric analysis underlines departures between the ancestral background of the MHC and the genome-wide picture. Our analysis of linkage disequilibrium (LD) decay in these samples suggests that overestimation of pairwise LD occurs due to a limited sampling of the MHC diversity. This collection of HLA-specific MHC variants, available on the dbMHC portal, is a valuable resource for future analyses of the role of MHC in population and disease studies.
Applying Natural Language Processing to Textual Data From Clinical Data Warehouses: Systematic Review
In recent years, health data collected during the clinical care process have been often repurposed for secondary use through clinical data warehouses (CDWs), which interconnect disparate data from different sources. A large amount of information of high clinical value is stored in unstructured text format. Natural language processing (NLP), which implements algorithms that can operate on massive unstructured textual data, has the potential to structure the data and make clinical information more accessible. The aim of this review was to provide an overview of studies applying NLP to textual data from CDWs. It focuses on identifying the (1) NLP tasks applied to data from CDWs and (2) NLP methods used to tackle these tasks. This review was performed according to the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines. We searched for relevant articles in 3 bibliographic databases: PubMed, Google Scholar, and ACL Anthology. We reviewed the titles and abstracts and included articles according to the following inclusion criteria: (1) focus on NLP applied to textual data from CDWs, (2) articles published between 1995 and 2021, and (3) written in English. We identified 1353 articles, of which 194 (14.34%) met the inclusion criteria. Among all identified NLP tasks in the included papers, information extraction from clinical text (112/194, 57.7%) and the identification of patients (51/194, 26.3%) were the most frequent tasks. To address the various tasks, symbolic methods were the most common NLP methods (124/232, 53.4%), showing that some tasks can be partially achieved with classical NLP techniques, such as regular expressions or pattern matching that exploit specialized lexica, such as drug lists and terminologies. Machine learning (70/232, 30.2%) and deep learning (38/232, 16.4%) have been increasingly used in recent years, including the most recent approaches based on transformers. NLP methods were mostly applied to English language data (153/194, 78.9%). CDWs are central to the secondary use of clinical texts for research purposes. Although the use of NLP on data from CDWs is growing, there remain challenges in this field, especially with regard to languages other than English. Clinical NLP is an effective strategy for accessing, extracting, and transforming data from CDWs. Information retrieved with NLP can assist in clinical research and have an impact on clinical practice.
Genetic susceptibility to multiple sclerosis in African Americans
To explore the nature of genetic-susceptibility to multiple sclerosis (MS) in African-Americans. Recently, the number of genetic-associations with MS has exploded although the MS-associations of specific haplotypes within the major histocompatibility complex (MHC) have been known for decades. For example, the haplotypes HLA-DRB1*15:01~HLA-DQB1*06:02, and HLA-DRB1*03:01~ HLA-DQB1*02:01 have odds ratios (ORs) for an MS-association orders of magnitude stronger than many of these newly-discovered associations. Nevertheless, all these haplotypes are part of much larger conserved extended haplotypes (CEHs), which span both the Class I and Class II MHC regions. African-Americans are at greater risk of developing MS compared to a native Africans but at lesser risk compared to Europeans. It is the purpose of this manuscript to explore the relationship between MS-susceptibility and the CEH make-up of our African-American cohort. The African-American (AA) cohort consisted of 1,305 patients with MS and 1,155 controls, who self-identified as being African-American. For comparison, we used the 18,492 controls and 11,144 MS-cases from the predominantly European Wellcome Trust Case Control Consortium (WTCCC) and the 28,557 phased native Africans from the multinational \"Be the Match\" registry. The WTCCC and the African-Americans were phased at each of five HLA loci (HLA-A, HLA-C, HLA-B, HLA-DRB1 and HLA-DQB1) and the at 11 SNPs (10 of which were in non-coding regions) surrounding the Class II region of the DRB1 gene using previously-published probabilistic phasing algorithms. Of the 32 most frequent CEHs, 18 (56%) occurred either more frequently or exclusively in Africans) whereas 9 (28%) occurred more frequently or exclusively in Europeans. The remaining 5 CEHs occurred in neither control group although, likely, these were African in origin. Eight of these CEHs carried the DRB1*15:03~DQB1*06:02~a36 haplotype and three carried the DRB1*15:01~DQB1*06:02~a1 haplotype. In African Americans, a single-copy of the European CEH (03:01_07:02_07:02_15:01_06:02_a1) was associated with considerable MS-risk (OR = 3.30; p = 0.0001)-similar to that observed in the WTCCC (OR = 3.25; p<10.sup.-168). By contrast, the MS-risk for the European CEH (02:01_07:02_07:02_15:01_06:02_a1) was less (OR = 1.49; ns)-again, similar to the WTCCC (OR = 2.2; p<10.sup.-38). Moreover, four African haplotypes were \"protective\" relative to a neutral reference, to three European CEHs, and also to the five other African CEHs. The common CEHs in African Americans are divisible into those that are either African or European in origin, which are derived without modification from their source population. European CEHs, linked to MS-risk, in general, had similar impacts in African-Americans as they did in Europeans. By contrast, African CEHs had mixed MS-risks. For a few, the MS-risk exceeded that in a neutral-reference group whereas, for many others, these CEHs were \"protective\"-perhaps providing a partial rationale for the lower MS-risk in African-Americans compared to European-Americans.
High-Accuracy HLA Type Inference from Whole-Genome Sequencing Data Using Population Reference Graphs
Genetic variation at the Human Leucocyte Antigen (HLA) genes is associated with many autoimmune and infectious disease phenotypes, is an important element of the immunological distinction between self and non-self, and shapes immune epitope repertoires. Determining the allelic state of the HLA genes (HLA typing) as a by-product of standard whole-genome sequencing data would therefore be highly desirable and enable the immunogenetic characterization of samples in currently ongoing population sequencing projects. Extensive hyperpolymorphism and sequence similarity between the HLA genes, however, pose problems for accurate read mapping and make HLA type inference from whole-genome sequencing data a challenging problem. We describe how to address these challenges in a Population Reference Graph (PRG) framework. First, we construct a PRG for 46 (mostly HLA) genes and pseudogenes, their genomic context and their characterized sequence variants, integrating a database of over 10,000 known allele sequences. Second, we present a sequence-to-PRG paired-end read mapping algorithm that enables accurate read mapping for the HLA genes. Third, we infer the most likely pair of underlying alleles at G group resolution from the IMGT/HLA database at each locus, employing a simple likelihood framework. We show that HLA*PRG, our algorithm, outperforms existing methods by a wide margin. We evaluate HLA*PRG on six classical class I and class II HLA genes (HLA-A, -B, -C, -DQA1, -DQB1, -DRB1) and on a set of 14 samples (3 samples with 2 x 100bp, 11 samples with 2 x 250bp Illumina HiSeq data). Of 158 alleles tested, we correctly infer 157 alleles (99.4%). We also identify and re-type two erroneous alleles in the original validation data. We conclude that HLA*PRG for the first time achieves accuracies comparable to gold-standard reference methods from standard whole-genome sequencing data, though high computational demands (currently ~30-250 CPU hours per sample) remain a significant challenge to practical application.