Catalogue Search | MBRL

A high-resolution HLA reference panel capturing global population diversity enables multi-ancestry fine-mapping in HIV host response

by Wilson, James G. , Forer, Lukas , Palmer, Nicholette D. in 45/23 , 45/43 , 45/61

2021

Fine-mapping to plausible causal variation may be more effective in multi-ancestry cohorts, particularly in the MHC, which has population-specific structure. To enable such studies, we constructed a large ( n = 21,546) HLA reference panel spanning five global populations based on whole-genome sequences. Despite population-specific long-range haplotypes, we demonstrated accurate imputation at G-group resolution (94.2%, 93.7%, 97.8% and 93.7% in admixed African (AA), East Asian (EAS), European (EUR) and Latino (LAT) populations). Applying HLA imputation to genome-wide association study data for HIV-1 viral load in three populations (EUR, AA and LAT), we obviated effects of previously reported associations from population-specific HIV studies and discovered a novel association at position 156 in HLA-B. We pinpointed the MHC association to three amino acid positions (97, 67 and 156) marking three consecutive pockets (C, B and D) within the HLA-B peptide-binding groove, explaining 12.9% of trait variance. A high-resolution reference panel based on whole-genome sequencing data enables accurate imputation of HLA alleles across diverse populations and fine-mapping of HLA association signals for HIV-1 host response.

Journal Article

Share this book

Add to My Shelf

FinnGen provides genetic insights from a well-phenotyped isolated population

by Aalto-Setälä, Katriina , Saarentaus, Elmo , Jacob, Howard in 45/43 , 631/208/205/2138 , 631/208/457/649/2219

2023

Population isolates such as those in Finland benefit genetic research because deleterious alleles are often concentrated on a small number of low-frequency variants (0.1% ≤ minor allele frequency < 5%). These variants survived the founding bottleneck rather than being distributed over a large number of ultrarare variants. Although this effect is well established in Mendelian genetics, its value in common disease genetics is less explored 1 , 2 . FinnGen aims to study the genome and national health register data of 500,000 Finnish individuals. Given the relatively high median age of participants (63 years) and the substantial fraction of hospital-based recruitment, FinnGen is enriched for disease end points. Here we analyse data from 224,737 participants from FinnGen and study 15 diseases that have previously been investigated in large genome-wide association studies (GWASs). We also include meta-analyses of biobank data from Estonia and the United Kingdom. We identified 30 new associations, primarily low-frequency variants, enriched in the Finnish population. A GWAS of 1,932 diseases also identified 2,733 genome-wide significant associations (893 phenome-wide significant (PWS), P < 2.6 × 10 –11 ) at 2,496 (771 PWS) independent loci with 807 (247 PWS) end points. Among these, fine-mapping implicated 148 (73 PWS) coding variants associated with 83 (42 PWS) end points. Moreover, 91 (47 PWS) had an allele frequency of <5% in non-Finnish European individuals, of which 62 (32 PWS) were enriched by more than twofold in Finland. These findings demonstrate the power of bottlenecked populations to find entry points into the biology of common diseases through low-frequency, high impact variants. Genome-wide association studies of individuals from an isolated population (data from the Finnish biobank study FinnGen) and consequent meta-analyses facilitate the identification of previously unknown coding variant associations for both rare and common diseases.

Journal Article

Share this book

Add to My Shelf

Estimating the genome-wide contribution of selection to temporal allele frequency change

by Coop, Graham , Buffalo, Vince in Acclimatization - genetics , Adaptation , Adaptation, Biological - genetics

2020

Rapid phenotypic adaptation is often observed in natural populations and selection experiments. However, detecting the genomewide impact of this selection is difficult since adaptation often proceeds from standing variation and selection on polygenic traits, both of which may leave faint genomic signals indistinguishable from a noisy background of genetic drift. One promising signal comes from the genome-wide covariance between allele frequency changes observable from temporal genomic data (e.g., evolve-and-resequence studies). These temporal covariances reflect how heritable fitness variation in the population leads changes in allele frequencies at one time point to be predictive of the changes at later time points, as alleles are indirectly selected due to remaining associations with selected alleles. Since genetic drift does not lead to temporal covariance, we can use these covariances to estimate what fraction of the variation in allele frequency change through time is driven by linked selection. Here, we reanalyze three selection experiments to quantify the effects of linked selection over short timescales using covariance among time points and across replicates. We estimate that at least 17 to 37% of allele frequency change is driven by selection in these experiments. Against this background of positive genome-wide temporal covariances, we also identify signals of negative temporal covariance corresponding to reversals in the direction of selection for a reasonable proportion of loci over the time course of a selection experiment. Overall, we find that in the three studies we analyzed, linked selection has a large impact on short-term allele frequency dynamics that is readily distinguishable from genetic drift.

Journal Article

Share this book

Add to My Shelf

Discovery of rare variants associated with blood pressure regulation through meta-analysis of 1.3 million individuals

by Hwang, Shih-Jen , Mutsert, Renée de , Helgadottir, Anna in 38/39 , 45/43 , 631/208

2020

Genetic studies of blood pressure (BP) to date have mainly analyzed common variants (minor allele frequency > 0.05). In a meta-analysis of up to ~1.3 million participants, we discovered 106 new BP-associated genomic regions and 87 rare (minor allele frequency ≤ 0.01) variant BP associations ( P < 5 × 10 −8 ), of which 32 were in new BP-associated loci and 55 were independent BP-associated single-nucleotide variants within known BP-associated regions. Average effects of rare variants (44% coding) were ~8 times larger than common variant effects and indicate potential candidate causal genes at new and known loci (for example, GATA5 and PLCB3 ). BP-associated variants (including rare and common) were enriched in regions of active chromatin in fetal tissues, potentially linking fetal development with BP regulation in later life. Multivariable Mendelian randomization suggested possible inverse effects of elevated systolic and diastolic BP on large artery stroke. Our study demonstrates the utility of rare-variant analyses for identifying candidate genes and the results highlight potential therapeutic targets. Meta-analyses in up to 1.3 million individuals identify 87 rare-variant associations with blood pressure traits. On average, rare variants exhibit effects ~8 times larger than the mean effects of common variants and implicate candidate causal genes at associated regions.

Journal Article

Share this book

Add to My Shelf

A saturated map of common genetic variants associated with human height

by Yin, Xianyong , Kumari, Meena , Engmann, Jorgen E. in 45/43 , 631/208/205/2138 , 631/208/480

2022

Common single-nucleotide polymorphisms (SNPs) are predicted to collectively explain 40–50% of phenotypic variation in human height, but identifying the specific variants and associated regions requires huge sample sizes 1 . Here, using data from a genome-wide association study of 5.4 million individuals of diverse ancestries, we show that 12,111 independent SNPs that are significantly associated with height account for nearly all of the common SNP-based heritability. These SNPs are clustered within 7,209 non-overlapping genomic segments with a mean size of around 90 kb, covering about 21% of the genome. The density of independent associations varies across the genome and the regions of increased density are enriched for biologically relevant genes. In out-of-sample estimation and prediction, the 12,111 SNPs (or all SNPs in the HapMap 3 panel 2 ) account for 40% (45%) of phenotypic variance in populations of European ancestry but only around 10–20% (14–24%) in populations of other ancestries. Effect sizes, associated regions and gene prioritization are similar across ancestries, indicating that reduced prediction accuracy is likely to be explained by linkage disequilibrium and differences in allele frequency within associated regions. Finally, we show that the relevant biological pathways are detectable with smaller sample sizes than are needed to implicate causal genes and variants. Overall, this study provides a comprehensive map of specific genomic regions that contain the vast majority of common height-associated variants. Although this map is saturated for populations of European ancestry, further research is needed to achieve equivalent saturation in other ancestries. A large genome-wide association study of more than 5 million individuals reveals that 12,111 single-nucleotide polymorphisms account for nearly all the heritability of height attributable to common genetic variants.

Journal Article

Share this book

Add to My Shelf

SAIGE-GENE+ improves the efficiency and accuracy of set-based rare variant association tests

by Zhao, Zhangchen , Bi, Wenjian , Neale, Benjamin M. in 45/43 , 631/114/794 , 631/208/205/2138

2022

Several biobanks, including UK Biobank (UKBB), are generating large-scale sequencing data. An existing method, SAIGE-GENE, performs well when testing variants with minor allele frequency (MAF) ≤ 1%, but inflation is observed in variance component set-based tests when restricting to variants with MAF ≤ 0.1% or 0.01%. Here, we propose SAIGE-GENE+ with greatly improved type I error control and computational efficiency to facilitate rare variant tests in large-scale data. We further show that incorporating multiple MAF cutoffs and functional annotations can improve power and thus uncover new gene–phenotype associations. In the analysis of UKBB whole exome sequencing data for 30 quantitative and 141 binary traits, SAIGE-GENE+ identified 551 gene–phenotype associations. SAIGE-GENE+ performs set-based rare variant association tests with improved type 1 error control and computational efficiency by collapsing ultra-rare variants and conducting multiple tests corresponding to different minor allele frequency cutoffs and annotations.

Journal Article

Share this book

Add to My Shelf

Protein-coding variants implicate novel genes related to lipid homeostasis contributing to body-fat distribution

by Broer, Linda , Stirrups, Kathleen E. , Fox, Caroline S. in 45/43 , 631/208/205 , 692/308/2056

2019

Body-fat distribution is a risk factor for adverse cardiovascular health consequences. We analyzed the association of body-fat distribution, assessed by waist-to-hip ratio adjusted for body mass index, with 228,985 predicted coding and splice site variants available on exome arrays in up to 344,369 individuals from five major ancestries (discovery) and 132,177 European-ancestry individuals (validation). We identified 15 common (minor allele frequency, MAF ≥5%) and nine low-frequency or rare (MAF <5%) coding novel variants. Pathway/gene set enrichment analyses identified lipid particle, adiponectin, abnormal white adipose tissue physiology and bone development and morphology as important contributors to fat distribution, while cross-trait associations highlight cardiometabolic traits. In functional follow-up analyses, specifically in Drosophila RNAi-knockdowns, we observed a significant increase in the total body triglyceride levels for two genes ( DNAH10 and PLXND1 ). We implicate novel genes in fat distribution, stressing the importance of interrogating low-frequency and protein-coding variants. A transancestral exome-wide association study for body-fat distribution identifies protein-coding variants that are significantly associated with waist-to-hip ratio adjusted for body mass index.

Journal Article

Share this book

Add to My Shelf

Genotyping, sequencing and analysis of 140,000 adults from Mexico City

by García-Ortiz, Humberto , Jones, Marcus , Zöllner, Sebastian in 45/23 , 45/61 , 631/181/2474

2023

The Mexico City Prospective Study is a prospective cohort of more than 150,000 adults recruited two decades ago from the urban districts of Coyoacán and Iztapalapa in Mexico City 1 . Here we generated genotype and exome-sequencing data for all individuals and whole-genome sequencing data for 9,950 selected individuals. We describe high levels of relatedness and substantial heterogeneity in ancestry composition across individuals. Most sequenced individuals had admixed Indigenous American, European and African ancestry, with extensive admixture from Indigenous populations in central, southern and southeastern Mexico. Indigenous Mexican segments of the genome had lower levels of coding variation but an excess of homozygous loss-of-function variants compared with segments of African and European origin. We estimated ancestry-specific allele frequencies at 142 million genomic variants, with an effective sample size of 91,856 for Indigenous Mexican ancestry at exome variants, all available through a public browser. Using whole-genome sequencing, we developed an imputation reference panel that outperforms existing panels at common variants in individuals with high proportions of central, southern and southeastern Indigenous Mexican ancestry. Our work illustrates the value of genetic studies in diverse populations and provides foundational imputation and allele frequency resources for future genetic studies in Mexico and in the United States, where the Hispanic/Latino population is predominantly of Mexican descent. Genotype and exome sequencing of 150,000 participants and whole-genome sequencing of 9,950 selected individuals recruited into the Mexico City Prospective Study constitute a valuable, publicly available resource of non-European sequencing data.

Journal Article

Share this book

Add to My Shelf

Signatures of negative selection in the genetic architecture of human complex traits

by Gibson, Greg , Visscher, Peter M. , Xue, Angli in 45/43 , 631/208/205/2138 , 631/208/457

2018

We develop a Bayesian mixed linear model that simultaneously estimates single-nucleotide polymorphism (SNP)-based heritability, polygenicity (proportion of SNPs with nonzero effects), and the relationship between SNP effect size and minor allele frequency for complex traits in conventionally unrelated individuals using genome-wide SNP data. We apply the method to 28 complex traits in the UK Biobank data ( N = 126,752) and show that on average, 6% of SNPs have nonzero effects, which in total explain 22% of phenotypic variance. We detect significant ( P < 0.05/28) signatures of natural selection in the genetic architecture of 23 traits, including reproductive, cardiovascular, and anthropometric traits, as well as educational attainment. The significant estimates of the relationship between effect size and minor allele frequency in complex traits are consistent with a model of negative (or purifying) selection, as confirmed by forward simulation. We conclude that negative selection acts pervasively on the genetic variants associated with human complex traits. BayesS estimates SNP-based heritability, polygenicity, and the relationship between effect size and minor allele frequency using genome-wide SNP data. Applying BayesS to UK Biobank data identifies signatures of natural selection for 23 complex traits.

Journal Article

Share this book

Add to My Shelf

Comparison of methods that use whole genome data to estimate the heritability and genetic architecture of complex traits

by Vrieze, Scott I. , Abecasis, Gonçalo R. , Visscher, Peter M. in 45/43 , 631/114/794 , 631/208/205/2138

2018

Multiple methods have been developed to estimate narrow-sense heritability, h 2 , using single nucleotide polymorphisms (SNPs) in unrelated individuals. However, a comprehensive evaluation of these methods has not yet been performed, leading to confusion and discrepancy in the literature. We present the most thorough and realistic comparison of these methods to date. We used thousands of real whole-genome sequences to simulate phenotypes under varying genetic architectures and confounding variables, and we used array, imputed, or whole genome sequence SNPs to obtain ‘SNP-heritability’ estimates. We show that SNP-heritability can be highly sensitive to assumptions about the frequencies, effect sizes, and levels of linkage disequilibrium of underlying causal variants, but that methods that bin SNPs according to minor allele frequency and linkage disequilibrium are less sensitive to these assumptions across a wide range of genetic architectures and possible confounding factors. These findings provide guidance for best practices and proper interpretation of published estimates. This analysis compares methods for estimating the heritability and genetic architecture of complex traits using whole-genome data. The results provide guidance for best practices and proper interpretation of published heritability estimates.

Journal Article

Share this book

Add to My Shelf

Language Selector

MBRLGlobalSearch

Language Selector

Catalogue Search | MBRL

Search Results Heading

Explore the vast range of titles available.

MBRLSearchResults

MBRLHappinessMeter