Search Results Heading

MBRLSearchResults

mbrl.module.common.modules.added.book.to.shelf
Title added to your shelf!
View what I already have on My Shelf.
Oops! Something went wrong.
Oops! Something went wrong.
While trying to add the title to your shelf something went wrong :( Kindly try again later!
Are you sure you want to remove the book from the shelf?
Oops! Something went wrong.
Oops! Something went wrong.
While trying to remove the title from your shelf something went wrong :( Kindly try again later!
    Done
    Filters
    Reset
  • Discipline
      Discipline
      Clear All
      Discipline
  • Is Peer Reviewed
      Is Peer Reviewed
      Clear All
      Is Peer Reviewed
  • Item Type
      Item Type
      Clear All
      Item Type
  • Subject
      Subject
      Clear All
      Subject
  • Year
      Year
      Clear All
      From:
      -
      To:
  • More Filters
      More Filters
      Clear All
      More Filters
      Source
    • Language
248 result(s) for "Thornton, Timothy A"
Sort by:
Adjusting for principal components can induce collider bias in genome-wide association studies
Principal component analysis (PCA) is widely used to control for population structure in genome-wide association studies (GWAS). Top principal components (PCs) typically reflect population structure, but challenges arise in deciding how many PCs are needed and ensuring that PCs do not capture other artifacts such as regions with atypical linkage disequilibrium (LD). In response to the latter, many groups suggest performing LD pruning or excluding known high LD regions prior to PCA. However, these suggestions are not universally implemented and the implications for GWAS are not fully understood, especially in the context of admixed populations. In this paper, we investigate the impact of pre-processing and the number of PCs included in GWAS models in African American samples from the Women’s Health Initiative SNP Health Association Resource and two Trans-Omics for Precision Medicine Whole Genome Sequencing Project contributing studies (Jackson Heart Study and Genetic Epidemiology of Chronic Obstructive Pulmonary Disease Study). In all three samples, we find the first PC is highly correlated with genome-wide ancestry whereas later PCs often capture local genomic features. The pattern of which, and how many, genetic variants are highly correlated with individual PCs differs from what has been observed in prior studies focused on European populations and leads to distinct downstream consequences: adjusting for such PCs yields biased effect size estimates and elevated rates of spurious associations due to the phenomenon of collider bias. Excluding high LD regions identified in previous studies does not resolve these issues. LD pruning proves more effective, but the optimal choice of thresholds varies across datasets. Altogether, our work highlights unique issues that arise when using PCA to control for ancestral heterogeneity in admixed populations and demonstrates the importance of careful pre-processing and diagnostics to ensure that PCs capturing multiple local genomic features are not included in GWAS models.
Improving polygenic risk prediction in admixed populations by explicitly modeling ancestral-differential effects via GAUDI
Polygenic risk scores (PRS) have shown successes in clinics, but most PRS methods focus only on participants with distinct primary continental ancestry without accommodating recently-admixed individuals with mosaic continental ancestry backgrounds for different segments of their genomes. Here, we develop GAUDI, a novel penalized-regression-based method specifically designed for admixed individuals. GAUDI explicitly models ancestry-differential effects while borrowing information across segments with shared ancestry in admixed genomes. We demonstrate marked advantages of GAUDI over other methods through comprehensive simulation and real data analyses for traits with associated variants exhibiting ancestral-differential effects. Leveraging data from the Women’s Health Initiative study, we show that GAUDI improves PRS prediction of white blood cell count and C-reactive protein in African Americans by > 64% compared to alternative methods, and even outperforms PRS-CSx with large European GWAS for some scenarios. We believe GAUDI will be a valuable tool to mitigate disparities in PRS performance in admixed individuals. Most polygenic risk score (PRS) methods focus only on individuals with distinct primary continental ancestry, without accommodating recently-admixed individuals. Here, the authors develop a novel penalized regression-based PRS method specifically designed for admixed individuals.
On the cross-population generalizability of gene expression prediction models
The genetic control of gene expression is a core component of human physiology. For the past several years, transcriptome-wide association studies have leveraged large datasets of linked genotype and RNA sequencing information to create a powerful gene-based test of association that has been used in dozens of studies. While numerous discoveries have been made, the populations in the training data are overwhelmingly of European descent, and little is known about the generalizability of these models to other populations. Here, we test for cross-population generalizability of gene expression prediction models using a dataset of African American individuals with RNA-Seq data in whole blood. We find that the default models trained in large datasets such as GTEx and DGN fare poorly in African Americans, with a notable reduction in prediction accuracy when compared to European Americans. We replicate these limitations in cross-population generalizability using the five populations in the GEUVADIS dataset. Via realistic simulations of both populations and gene expression, we show that accurate cross-population generalizability of transcriptome prediction only arises when eQTL architecture is substantially shared across populations. In contrast, models with non-identical eQTLs showed patterns similar to real-world data. Therefore, generating RNA-Seq data in diverse populations is a critical step towards multi-ethnic utility of gene expression prediction.
Genome-wide association study identifies peanut allergy-specific loci and evidence of epigenetic mediation in US children
Food allergy (FA) affects 2%–10% of US children and is a growing clinical and public health problem. Here we conduct the first genome-wide association study of well-defined FA, including specific subtypes (peanut, milk and egg) in 2,759 US participants (1,315 children and 1,444 parents) from the Chicago Food Allergy Study, and identify peanut allergy (PA)-specific loci in the HLA-DR and -DQ gene region at 6p21.32, tagged by rs7192 ( P =5.5 × 10 −8 ) and rs9275596 ( P =6.8 × 10 −10 ), in 2,197 participants of European ancestry. We replicate these associations in an independent sample of European ancestry. These associations are further supported by meta-analyses across the discovery and replication samples. Both single-nucleotide polymorphisms (SNPs) are associated with differential DNA methylation levels at multiple CpG sites ( P <5 × 10 −8 ), and differential DNA methylation of the HLA-DQB1 and HLA-DRB1 genes partially mediate the identified SNP–PA associations. This study suggests that the HLA-DR and - DQ gene region probably poses significant genetic risk for PA. Food allergy is a growing clinical and public health burden. Here, the authors carry out a genome-wide association study in samples with well-defined allergies to a variety of foods, and identify the 6p21.32 region that significantly increases risk of developing peanut allergy.
Interferon gamma-induced protein 10 (IP-10) and cardiovascular disease in African Americans
Biomarkers of chronic inflammation (such as C-reactive protein) have long been associated with cardiovascular disease and mortality; however, biomarkers involved in antiviral cytokine induction and adaptive immune system activation remain largely unexamined. We hypothesized the cytokine interferon gamma inducible protein 10 (IP-10) would be associated with clinical and subclinical cardiovascular disease and all-cause mortality in African Americans. We assessed these associations in the Jackson Heart Study (JHS) cohort and the REasons for Geographic and Racial Differences in Stroke (REGARDS) study. There was a modest association of IP-10 with higher odds of left ventricular hypertrophy (OR = 1.20 (95% confidence interval (CI) 1.03, 1.41) per standard deviation (SD) higher natural log-transformed IP-10 in JHS). We did not observe associations with ankle brachial index, intima-media thickness, or arterial calcification. Each SD higher increment of ln-transformed IP-10 concentration was associated with incident heart failure (hazard ratio (HR) 1.26; 95% CI 1.11, 1.42, p = 4x10-4) in JHS, and with overall mortality in both JHS (HR 1.12 per SD, 95% CI 1.03, 1.21, p = 7.5x10-3) and REGARDS (HR 1.31 per SD, 95% CI 1.10, 1.55, p = 2.0 x 10-3), adjusting for cardiovascular risk factors and C-reactive protein. However, we found no association between IP-10 and stroke or coronary heart disease. These results suggest a role of IP-10 in heart failure and mortality risk independent of C-reactive protein. Further research is needed to investigate how the body's response to chronic viral infection may mediate heart failure and overall mortality risk in African Americans.
Protein prediction for trait mapping in diverse populations
Genetically regulated gene expression has helped elucidate the biological mechanisms underlying complex traits. Improved high-throughput technology allows similar interrogation of the genetically regulated proteome for understanding complex trait mechanisms. Here, we used the Trans-omics for Precision Medicine (TOPMed) Multi-omics pilot study, which comprises data from Multi-Ethnic Study of Atherosclerosis (MESA), to optimize genetic predictors of the plasma proteome for genetically regulated proteome-wide association studies (PWAS) in diverse populations. We built predictive models for protein abundances using data collected in TOPMed MESA, for which we have measured 1,305 proteins by a SOMAscan assay. We compared predictive models built via elastic net regression to models integrating posterior inclusion probabilities estimated by fine-mapping SNPs prior to elastic net. In order to investigate the transferability of predictive models across ancestries, we built protein prediction models in all four of the TOPMed MESA populations, African American (n = 183), Chinese (n = 71), European (n = 416), and Hispanic/Latino (n = 301), as well as in all populations combined. As expected, fine-mapping produced more significant protein prediction models, especially in African ancestries populations, potentially increasing opportunity for discovery. When we tested our TOPMed MESA models in the independent European INTERVAL study, fine-mapping improved cross-ancestries prediction for some proteins. Using GWAS summary statistics from the Population Architecture using Genomics and Epidemiology (PAGE) study, which comprises ∼50,000 Hispanic/Latinos, African Americans, Asians, Native Hawaiians, and Native Americans, we applied S-PrediXcan to perform PWAS for 28 complex traits. The most protein-trait associations were discovered, colocalized, and replicated in large independent GWAS using proteome prediction model training populations with similar ancestries to PAGE. At current training population sample sizes, performance between baseline and fine-mapped protein prediction models in PWAS was similar, highlighting the utility of elastic net. Our predictive models in diverse populations are publicly available for use in proteome mapping methods at https://doi.org/10.5281/zenodo.4837327 .
Inflammatory Bowel Disease (IBD) pharmacotherapy and the risk of serious infection: a systematic review and network meta-analysis
Background The magnitude of risk of serious infections due to available medical therapies of inflammatory bowel disease (IBD) remains controversial. We conducted a systematic review and network meta-analysis of the existing IBD literature to estimate the risk of serious infection in adult IBD patients associated with available medical therapies. Methods Studies were identified by a literature search of PubMed, Cochrane Library, Medline, Web of Science, Scopus, EMBASE, and ProQuest Dissertations and Theses. Randomized controlled trials comparing IBD medical therapies with no restrictions on language, country of origin, or publication date were included. A network meta-analysis was used to pool direct between treatment comparisons with indirect trial evidence while preserving randomization. Results Thirty-nine articles fulfilled the inclusion criteria; one study was excluded from the analysis due to disconnectedness. We found no evidence of increased odds of serious infection in comparisons of the different treatment strategies against each other, including combination therapy with a biologic and immunomodulator compared to biologic monotherapy. Similar results were seen in the comparisons between the newer biologics (e.g. vedolizumab) and the anti-tumor necrosis factor agents. Conclusions No treatment strategy was found to confer a higher risk of serious infection than another, although wide confidence intervals indicate that a clinically significant difference cannot be excluded. These findings provide a better understanding of the risk of serious infection from IBD pharmacotherapy in the adult population. Prospero registration The protocol for this systematic review was registered on PROSPERO ( CRD42014013497 ).
Detecting Heterogeneity in Population Structure Across the Genome in Admixed Populations
The genetic structure of human populations is often characterized by aggregating measures of ancestry across the autosomal chromosomes. While it may be reasonable to assume that population structure patterns are similar genome-wide in relatively homogeneous populations, this assumption may not be appropriate for admixed populations, such as Hispanics and African-Americans, with recent ancestry from two or more continents. Recent studies have suggested that systematic ancestry differences can arise at genomic locations in admixed populations as a result of selection and nonrandom mating. Here, we propose a method, which we refer to as the chromosomal ancestry differences (CAnD) test, for detecting heterogeneity in population structure across the genome. CAnD can incorporate either local or chromosome-wide ancestry inferred from SNP genotype data to identify chromosomes harboring genomic regions with ancestry contributions that are significantly different than expected. In simulation studies with real genotype data from phase III of the HapMap Project, we demonstrate the validity and power of CAnD. We apply CAnD to the HapMap Mexican-American (MXL) and African-American (ASW) population samples; in this analysis the software RFMix is used to infer local ancestry at genomic regions, assuming admixing from Europeans, West Africans, and Native Americans. The CAnD test provides strong evidence of heterogeneity in population structure across the genome in the MXL sample (p=1e−5), which is largely driven by elevated Native American ancestry and deficit of European ancestry on the X chromosomes. Among the ASW, all chromosomes are largely African derived and no heterogeneity in population structure is detected in this sample.
Admixture mapping in the Hispanic Community Health Study/Study of Latinos reveals regions of genetic associations with blood pressure traits
Admixture mapping can be used to detect genetic association regions in admixed populations, such as Hispanics/Latinos, by estimating associations between local ancestry allele counts and the trait of interest. We performed admixture mapping of the blood pressure traits systolic and diastolic blood pressure (SBP, DBP), mean arterial pressure (MAP), and pulse pressure (PP), in a dataset of 12,116 participants from the Hispanic Community Health Study/Study of Latinos (HCHS/SOL). Hispanics/Latinos have three predominant ancestral populations (European, African, and Amerindian), for each of which we separately tested local ancestry intervals across the genome. We identified four regions that were significantly associated with a blood pressure trait at the genome-wide admixture mapping level. A 6p21.31 Amerindian ancestry association region has multiple known associations, but none explained the admixture mapping signal. We identified variants that completely explained this signal. One of these variants had p-values of 0.02 (MAP) and 0.04 (SBP) in replication testing in Pima Indians. A 11q13.4 Amerindian ancestry association region spans a variant that was previously reported (p-value = 0.001) in a targeted association study of Blood Pressure (BP) traits and variants in the vitamin D pathway. There was no replication evidence supporting an association in the identified 17q25.3 Amerindian ancestry association region. For a region on 6p12.3, associated with African ancestry, we did not identify any candidate variants driving the association. It may be driven by rare variants. Whole genome sequence data may be necessary to fine map these association signals, which may contribute to disparities in BP traits between diverse populations.
Admixture mapping reveals the association between Native American ancestry at 3q13.11 and reduced risk of Alzheimer’s disease in Caribbean Hispanics
Background Genetic studies have primarily been conducted in European ancestry populations, identifying dozens of loci associated with late-onset Alzheimer’s disease (AD). However, much of AD’s heritability remains unexplained; as the prevalence of AD varies across populations, the genetic architecture of the disease may also vary by population with the presence of novel variants or loci. Methods We conducted genome-wide analyses of AD in a sample of 2565 Caribbean Hispanics to better understand the genetic contribution to AD in this population. Statistical analysis included both admixture mapping and association testing. Evidence for differential gene expression within regions of interest was collected from independent transcriptomic studies comparing AD cases and controls in samples with primarily European ancestry. Results Our genome-wide association study of AD identified no loci reaching genome-wide significance. However, a genome-wide admixture mapping analysis that tests for association between a haplotype’s ancestral origin and AD status detected a genome-wide significant association with chromosome 3q13.11 (103.7–107.7Mb, P = 8.76E−07), driven by a protective effect conferred by the Native American ancestry (OR = 0.58, 95%CI = 0.47−0.73). Within this region, two variants were significantly associated with AD after accounting for the number of independent tests (rs12494162, P = 2.33E−06; rs1731642, P = 6.36E−05). The significant admixture mapping signal is composed of 15 haplotype blocks spanning 5 protein-coding genes ( ALCAM , BBX , CBLB , CCDC54 , CD47 ) and four brain-derived topologically associated domains, and includes markers significantly associated with the expression of ALCAM , BBX , CBLB , and CD47 in the brain. ALCAM and BBX were also significantly differentially expressed in the brain between AD cases and controls with European ancestry. Conclusion These results provide multiethnic evidence for a relationship between AD and multiple genes at 3q13.11 and illustrate the utility of leveraging genetic ancestry diversity via admixture mapping for new insights into AD.