Catalogue Search | MBRL
Search Results Heading
Explore the vast range of titles available.
MBRLSearchResults
-
DisciplineDiscipline
-
Is Peer ReviewedIs Peer Reviewed
-
Item TypeItem Type
-
SubjectSubject
-
YearFrom:-To:
-
More FiltersMore FiltersSourceLanguage
Done
Filters
Reset
13,033
result(s) for
"Linkage disequilibrium"
Sort by:
The power of genetic diversity in genome-wide association studies of lipids
2021
Increased blood lipid levels are heritable risk factors of cardiovascular disease with varied prevalence worldwide owing to different dietary patterns and medication use
1
. Despite advances in prevention and treatment, in particular through reducing low-density lipoprotein cholesterol levels
2
, heart disease remains the leading cause of death worldwide
3
. Genome-wideassociation studies (GWAS) of blood lipid levels have led to important biological and clinical insights, as well as new drug targets, for cardiovascular disease. However, most previous GWAS
4
–
23
have been conducted in European ancestry populations and may have missed genetic variants that contribute to lipid-level variation in other ancestry groups. These include differences in allele frequencies, effect sizes and linkage-disequilibrium patterns
24
. Here we conduct a multi-ancestry, genome-wide genetic discovery meta-analysis of lipid levels in approximately 1.65 million individuals, including 350,000 of non-European ancestries. We quantify the gain in studying non-European ancestries and provide evidence to support the expansion of recruitment of additional ancestries, even with relatively small sample sizes. We find that increasing diversity rather than studying additional individuals of European ancestry results in substantial improvements in fine-mapping functional variants and portability of polygenic prediction (evaluated in approximately 295,000 individuals from 7 ancestry groupings). Modest gains in the number of discovered loci and ancestry-specific variants were also achieved. As GWAS expand emphasis beyond the identification of genes and fundamental biology towards the use of genetic variants for preventive and precision medicine
25
, we anticipate that increased diversity of participants will lead to more accurate and equitable
26
application of polygenic scores in clinical practice.
A genome-wide association meta-analysis study of blood lipid levels in roughly 1.6 million individuals demonstrates the gain of power attained when diverse ancestries are included to improve fine-mapping and polygenic score generation, with gains in locus discovery related to sample size.
Journal Article
A high-resolution HLA reference panel capturing global population diversity enables multi-ancestry fine-mapping in HIV host response
2021
Fine-mapping to plausible causal variation may be more effective in multi-ancestry cohorts, particularly in the MHC, which has population-specific structure. To enable such studies, we constructed a large (
n
= 21,546) HLA reference panel spanning five global populations based on whole-genome sequences. Despite population-specific long-range haplotypes, we demonstrated accurate imputation at G-group resolution (94.2%, 93.7%, 97.8% and 93.7% in admixed African (AA), East Asian (EAS), European (EUR) and Latino (LAT) populations). Applying HLA imputation to genome-wide association study data for HIV-1 viral load in three populations (EUR, AA and LAT), we obviated effects of previously reported associations from population-specific HIV studies and discovered a novel association at position 156 in HLA-B. We pinpointed the MHC association to three amino acid positions (97, 67 and 156) marking three consecutive pockets (C, B and D) within the HLA-B peptide-binding groove, explaining 12.9% of trait variance.
A high-resolution reference panel based on whole-genome sequencing data enables accurate imputation of
HLA
alleles across diverse populations and fine-mapping of HLA association signals for HIV-1 host response.
Journal Article
Multi-ancestry fine mapping implicates OAS1 splicing in risk of severe COVID-19
by
Baillie, J. Kenneth
,
Zeberg, Hugo
,
Pairo-Castineira, Erola
in
2',5'-Oligoadenylate Synthetase - genetics
,
631/208/205
,
631/250/248
2022
The
OAS1/2/3
cluster has been identified as a risk locus for severe COVID-19 among individuals of European ancestry, with a protective haplotype of approximately 75 kilobases (kb) derived from Neanderthals in the chromosomal region 12q24.13. This haplotype contains a splice variant of
OAS1
, which occurs in people of African ancestry independently of gene flow from Neanderthals. Using trans-ancestry fine-mapping approaches in 20,779 hospitalized cases, we demonstrate that this splice variant is likely to be the SNP responsible for the association at this locus, thus strongly implicating
OAS1
as an effector gene influencing COVID-19 severity.
Multi-ancestry fine-mapping of the
OAS1/2/3
region shows that a splice site variant in
OAS1
is likely responsible for the association of this locus with the risk of severe COVID-19.
Journal Article
A saturated map of common genetic variants associated with human height
2022
Common single-nucleotide polymorphisms (SNPs) are predicted to collectively explain 40–50% of phenotypic variation in human height, but identifying the specific variants and associated regions requires huge sample sizes
1
. Here, using data from a genome-wide association study of 5.4 million individuals of diverse ancestries, we show that 12,111 independent SNPs that are significantly associated with height account for nearly all of the common SNP-based heritability. These SNPs are clustered within 7,209 non-overlapping genomic segments with a mean size of around 90 kb, covering about 21% of the genome. The density of independent associations varies across the genome and the regions of increased density are enriched for biologically relevant genes. In out-of-sample estimation and prediction, the 12,111 SNPs (or all SNPs in the HapMap 3 panel
2
) account for 40% (45%) of phenotypic variance in populations of European ancestry but only around 10–20% (14–24%) in populations of other ancestries. Effect sizes, associated regions and gene prioritization are similar across ancestries, indicating that reduced prediction accuracy is likely to be explained by linkage disequilibrium and differences in allele frequency within associated regions. Finally, we show that the relevant biological pathways are detectable with smaller sample sizes than are needed to implicate causal genes and variants. Overall, this study provides a comprehensive map of specific genomic regions that contain the vast majority of common height-associated variants. Although this map is saturated for populations of European ancestry, further research is needed to achieve equivalent saturation in other ancestries.
A large genome-wide association study of more than 5 million individuals reveals that 12,111 single-nucleotide polymorphisms account for nearly all the heritability of height attributable to common genetic variants.
Journal Article
Polygenic prediction via Bayesian regression and continuous shrinkage priors
2019
Polygenic risk scores (PRS) have shown promise in predicting human complex traits and diseases. Here, we present PRS-CS, a polygenic prediction method that infers posterior effect sizes of single nucleotide polymorphisms (SNPs) using genome-wide association summary statistics and an external linkage disequilibrium (LD) reference panel. PRS-CS utilizes a high-dimensional Bayesian regression framework, and is distinct from previous work by placing a continuous shrinkage (CS) prior on SNP effect sizes, which is robust to varying genetic architectures, provides substantial computational advantages, and enables multivariate modeling of local LD patterns. Simulation studies using data from the UK Biobank show that PRS-CS outperforms existing methods across a wide range of genetic architectures, especially when the training sample size is large. We apply PRS-CS to predict six common complex diseases and six quantitative traits in the Partners HealthCare Biobank, and further demonstrate the improvement of PRS-CS in prediction accuracy over alternative methods.
Polygenic risk scores (PRS) have the potential to predict complex diseases and traits from genetic data. Here, Ge et al. develop PRS-CS which uses a Bayesian regression framework, continuous shrinkage (CS) priors and an external LD reference panel for polygenic prediction of binary and quantitative traits from GWAS summary statistics.
Journal Article
High-definition likelihood inference of genetic correlations across human complex traits
2020
Genetic correlation is a central parameter for understanding shared genetic architecture between complex traits. By using summary statistics from genome-wide association studies (GWAS), linkage disequilibrium score regression (LDSC) was developed for unbiased estimation of genetic correlations. Although easy to use, LDSC only partially utilizes LD information. By fully accounting for LD across the genome, we develop a high-definition likelihood (HDL) method to improve precision in genetic correlation estimation. Compared to LDSC, HDL reduces the variance of genetic correlation estimates by about 60%, equivalent to a 2.5-fold increase in sample size. We apply HDL and LDSC to estimate 435 genetic correlations among 30 behavioral and disease-related phenotypes measured in the UK Biobank (UKBB). In addition to 154 significant genetic correlations observed for both methods, HDL identified another 57 significant genetic correlations, compared to only another 2 significant genetic correlations identified by LDSC. HDL brings more power to genomic analyses and better reveals the underlying connections across human complex traits.
The HDL method improves the precision in genetic correlation estimation over LD score regression when applied to GWAS summary statistics of complex traits from the UK Biobank.
Journal Article
Probabilistic fine-mapping of transcriptome-wide association studies
2019
Transcriptome-wide association studies using predicted expression have identified thousands of genes whose locally regulated expression is associated with complex traits and diseases. In this work, we show that linkage disequilibrium induces significant gene–trait associations at non-causal genes as a function of the expression quantitative trait loci weights used in expression prediction. We introduce a probabilistic framework that models correlation among transcriptome-wide association study signals to assign a probability for every gene in the risk region to explain the observed association signal. Importantly, our approach remains accurate when expression data for causal genes are not available in the causal tissue by leveraging expression prediction from other tissues. Our approach yields credible sets of genes containing the causal gene at a nominal confidence level (for example, 90%) that can be used to prioritize genes for functional assays. We illustrate our approach by using an integrative analysis of lipid traits, where our approach prioritizes genes with strong evidence for causality.
FOCUS (fine-mapping of causal gene sets) models correlation among TWAS signals to assign a probability for every gene in the risk region to explain the observed association signal while controlling for pleiotropic SNP effects and unmeasured causal expression.
Journal Article
Harnessing landrace diversity empowers wheat breeding
2024
The authors thank G. Moore and M. Bevan for providing valuable feedback at multiple stages of the project; colleagues for assistance in Watkins field trial and phenotyping work from five experimental stations across China: Z. Zhu, Q. Wang, Y. Song, Y. Zhu and X. Zhang; the John Innes Centre (JIC) NBI Computing Infrastructure for Science group; the JIC Field Trials and Horticultural Services teams for support in field and glasshouse experiments; T. Florio for figure visualization; and the Rothamsted Research farm team and Analytical Chemistry unit for support in field experiments and analytical mineral analyses. This work was supported by the Program for Guangdong \\u201CZhuJiang\\u201D Introducing Innovative and Entrepreneurial Teams (2019ZT08N628), the National Natural Science Foundation of China (32022006), the Agricultural Science and Technology Innovation Program (CAAS-ASTIP-2021-AGIS-ZDRW202101), the Shenzhen Science and Technology Program (AGIS-ZDKY202002) to S. Cheng, and the Guangdong Basic and Applied Basic Research Foundation (2020A1515110677) to L.M. The UK work was possible owing to the long-term investment of the UK Biotechnology and Biological Sciences Research Council (BBSRC) in wheat research through Institute Strategic Programme (ISP) grants and longer larger grants: BBSRC LOLA \\u2018Enhancing diversity in UK wheat through a public sector prebreeding programme\\u2019 (BB/I002545/1); BBSRC ISP \\u2018JIC WISP ISP\\u2014Wheat Institute Strategic Programme\\u2019 (BB/J004596/1); BBSRC ISP \\u2018BBSRC Strategic Programme in Designing Future Wheat (DFW)\\u2019 (BB/P016855/1); BBSRC ISP \\u2018BBSRC Institute Strategic Programme: Delivering Sustainable Wheat (DSW)\\u2019 (BB/X011003/1) and for wheat germplasm conservation and global distribution through the Germplasm Resources BBSRC National Capability award (BBS/E/J/000PR8000). S.G. and C.L. also received support from the UK Department for Environment, Food and Rural Affairs (Defra) as part of WGIN phases 3 and 4 (CH0106 and CH0109). This work was also supported by the European Research Council (ERC-2019-COG-866328), the Sustainable Crop Production Research for International Development (SCPRID) programme (BB/J012017/1), the Mexican Consejo Nacional de Ciencia y Tecnolog\\u00EDa (CONACYT; 2018-000009-01EXTF-00306), the Science, Technology & Innovation Funding Authority (STDF), Egypt-UK Newton-Mosharafa Institutional Links award, project ID 30718 and EG\\u2013US cycle 19\\u2013project ID 42687.
Journal Article
False discovery rate control in genome-wide association studies with population structure
2021
We present a comprehensive statistical framework to analyze data from genome-wide association studies of polygenic traits, producing interpretable findings while controlling the false discovery rate. In contrast with standard approaches, our method can leverage sophisticated multivariate algorithms but makes no parametric assumptions about the unknown relation between genotypes and phenotype. Instead, we recognize that genotypes can be considered as a random sample from an appropriate model, encapsulating our knowledge of genetic inheritance and human populations. This allows the generation of imperfect copies (knockoffs) of these variables that serve as ideal negative controls, correcting for linkage disequilibrium and accounting for unknown population structure, which may be due to diverse ancestries or familial relatedness. The validity and effectiveness of our method are demonstrated by extensive simulations and by applications to the UK Biobank data. These analyses confirm our method is powerful relative to state-of-the-art alternatives, while comparisons with other studies validate most of our discoveries. Finally, fast software is made available for researchers to analyze Biobank-scale datasets.
Journal Article
Parental influence on human germline de novo mutations in 1,548 trios from Iceland
by
Masson, Gisli
,
Gudjonsson, Sigurjon Axel
,
Halldorsson, Bjarni V.
in
45/23
,
631/136/2434
,
631/181/2474
2017
Whole-genome sequencing data of 14,688 Icelanders, including 1,548 parent–offspring trios, show how the age and sex of parents affect the rate and spectrum of
de novo
mutations.
Parental age influences new mutations
Daniel Gudbjartsson and colleagues examine how the age and sex of parents influence the rate and spectrum of new (
de novo
) mutations (DNM). They sequenced the genomes of 14,688 individuals from Iceland, including 1,548 parent–offspring trios, 225 of which included at least one offspring in the third generation. They identify 108,778 high-quality DNMs, an average of 70.3 DNMs per trio, providing the largest available dataset of human DNMs so far. They find changes in the types and an increase in the number of DNMs with increased age of either parent, but with a higher rate of increase with paternal compared to maternal age.
The characterization of mutational processes that generate sequence diversity in the human genome is of paramount importance both to medical genetics
1
,
2
and to evolutionary studies
3
. To understand how the age and sex of transmitting parents affect
de novo
mutations, here we sequence 1,548 Icelanders, their parents, and, for a subset of 225, at least one child, to 35× genome-wide coverage. We find 108,778
de novo
mutations, both single nucleotide polymorphisms and indels, and determine the parent of origin of 42,961. The number of
de novo
mutations from mothers increases by 0.37 per year of age (95% CI 0.32–0.43), a quarter of the 1.51 per year from fathers (95% CI 1.45–1.57). The number of clustered mutations increases faster with the mother’s age than with the father’s, and the genomic span of maternal
de novo
mutation clusters is greater than that of paternal ones. The types of
de novo
mutation from mothers change substantially with age, with a 0.26% (95% CI 0.19–0.33%) decrease in cytosine–phosphate–guanine to thymine–phosphate–guanine (CpG>TpG)
de novo
mutations and a 0.33% (95% CI 0.28–0.38%) increase in C>G
de novo
mutations per year, respectively. Remarkably, these age-related changes are not distributed uniformly across the genome. A striking example is a 20 megabase region on chromosome 8p, with a maternal C>G mutation rate that is up to 50-fold greater than the rest of the genome. The age-related accumulation of maternal non-crossover gene conversions also mostly occurs within these regions. Increased sequence diversity and linkage disequilibrium of C>G variants within regions affected by excess maternal mutations indicate that the underlying mutational process has persisted in humans for thousands of years. Moreover, the regional excess of C>G variation in humans is largely shared by chimpanzees, less by gorillas, and is almost absent from orangutans. This demonstrates that sequence diversity in humans results from evolving interactions between age, sex, mutation type, and genomic location.
Journal Article