Catalogue Search | MBRL
Search Results Heading
Explore the vast range of titles available.
MBRLSearchResults
-
DisciplineDiscipline
-
Is Peer ReviewedIs Peer Reviewed
-
Item TypeItem Type
-
SubjectSubject
-
YearFrom:-To:
-
More FiltersMore FiltersSourceLanguage
Done
Filters
Reset
723
result(s) for
"Chromosome Mapping - statistics "
Sort by:
Estimation and Tests of Haplotype-Environment Interaction when Linkage Phase Is Ambiguous
by
Silverman, E.K.
,
Laird, N.M.
,
Lyon, H.
in
Algorithms
,
Anti-Inflammatory Agents - therapeutic use
,
Asthma - drug therapy
2003
In the study of complex traits, the utility of linkage analysis and single marker association tests can be limited for researchers attempting to elucidate the complex interplay between a gene and environmental covariates. For these purposes, tests of gene-environment interactions are needed. In addition, recent studies have indicated that haplotypes, which are specific combinations of nucleotides on the same chromosome, may be more suitable as the unit of analysis for statistical tests than single genetic markers. The difficulty with this approach is that, in standard laboratory genotyping, haplotypes are often not directly observable. Instead, unphased marker phenotypes are collected. In this article, we present a method for estimating and testing haplotype-environment interactions when linkage phase is potentially ambiguous. The method builds on the work of Schaid et al. [2002] and is applicable to any trait that can be placed in the generalized linear model framework. Simulations were run to illustrate the salient features of the method. In addition, the method was used to test for haplotypesmoking exposure interaction with data from the Childhood Asthma Management Program.
Journal Article
Whole-exome imputation within UK Biobank powers rare coding variant association and fine-mapping analyses
2021
Exome association studies to date have generally been underpowered to systematically evaluate the phenotypic impact of very rare coding variants. We leveraged extensive haplotype sharing between 49,960 exome-sequenced UK Biobank participants and the remainder of the cohort (total
n
≈ 500,000) to impute exome-wide variants with accuracy
R
2
> 0.5 down to minor allele frequency (MAF) ~0.00005. Association and fine-mapping analyses of 54 quantitative traits identified 1,189 significant associations (
P
< 5 × 10
−8
) involving 675 distinct rare protein-altering variants (MAF < 0.01) that passed stringent filters for likely causality. Across all traits, 49% of associations (578/1,189) occurred in genes with two or more hits; follow-up analyses of these genes identified allelic series containing up to 45 distinct ‘likely-causal’ variants. Our results demonstrate the utility of within-cohort imputation in population-scale genome-wide association studies, provide a catalog of likely-causal, large-effect coding variant associations and foreshadow the insights that will be revealed as genetic biobank studies continue to grow.
Imputation of rare coding variants in the UK Biobank enables association and fine-mapping analyses of rare (minor allele frequency (MAF) = 0.00005) genotypes, identifying 600 new variant–trait associations, including long allelic series in individual genes.
Journal Article
Mapping and phasing of structural variation in patient genomes using nanopore sequencing
by
Middelkamp, Sjors
,
de Bruijn, Ewart
,
Nieboer, Marleen M.
in
631/208/212/2301
,
631/208/514
,
631/208/726/649/2157
2017
Despite improvements in genomics technology, the detection of structural variants (SVs) from short-read sequencing still poses challenges, particularly for complex variation. Here we analyse the genomes of two patients with congenital abnormalities using the MinION nanopore sequencer and a novel computational pipeline—NanoSV. We demonstrate that nanopore long reads are superior to short reads with regard to detection of de novo chromothripsis rearrangements. The long reads also enable efficient phasing of genetic variations, which we leveraged to determine the parental origin of all de novo chromothripsis breakpoints and to resolve the structure of these complex rearrangements. Additionally, genome-wide surveillance of inherited SVs reveals novel variants, missed in short-read data sets, a large proportion of which are retrotransposon insertions. We provide a first exploration of patient genome sequencing with a nanopore sequencer and demonstrate the value of long-read sequencing in mapping and phasing of SVs for both clinical and research applications.
The detection of structural variants can be difficult with short-read sequencing technology, especially when variants are highly complex. Here, the authors use a MinION nanopore sequencer to analyse two patient genomes and develop NanoSV to map known and novel structural variants in long read data.
Journal Article
Refining the accuracy of validated target identification through coding variant fine-mapping in type 2 diabetes
2018
We aggregated coding variant data for 81,412 type 2 diabetes cases and 370,832 controls of diverse ancestry, identifying 40 coding variant association signals (
P
< 2.2 × 10
−7
); of these, 16 map outside known risk-associated loci. We make two important observations. First, only five of these signals are driven by low-frequency variants: even for these, effect sizes are modest (odds ratio ≤1.29). Second, when we used large-scale genome-wide association data to fine-map the associated variants in their regional context, accounting for the global enrichment of complex trait associations in coding sequence, compelling evidence for coding variant causality was obtained for only 16 signals. At 13 others, the associated coding variants clearly represent ‘false leads’ with potential to generate erroneous mechanistic inference. Coding variant associations offer a direct route to biological insight for complex diseases and identification of validated therapeutic targets; however, appropriate mechanistic inference requires careful specification of their causal contribution to disease predisposition.
Trans-ethnic analyses of exome array data identify new risk loci for type 2 diabetes. Fine-mapping analyses using genome-wide association data show that the index coding variants represent the likely causal variants at only a subset of these loci.
Journal Article
Analysis of 589,306 genomes identifies individuals resilient to severe Mendelian childhood diseases
2016
Human disease genetics is extended to the identification of individuals who remain healthy despite carrying highly penetrant disease-causing mutations.
Genetic studies of human disease have traditionally focused on the detection of disease-causing mutations in afflicted individuals. Here we describe a complementary approach that seeks to identify healthy individuals resilient to highly penetrant forms of genetic childhood disorders. A comprehensive screen of 874 genes in 589,306 genomes led to the identification of 13 adults harboring mutations for 8 severe Mendelian conditions, with no reported clinical manifestation of the indicated disease. Our findings demonstrate the promise of broadening genetic studies to systematically search for well individuals who are buffering the effects of rare, highly penetrant, deleterious mutations. They also indicate that incomplete penetrance for Mendelian diseases is likely more common than previously believed. The identification of resilient individuals may provide a first step toward uncovering protective genetic variants that could help elucidate the mechanisms of Mendelian diseases and new therapeutic strategies.
Journal Article
Detecting epistasis with the marginal epistasis test in genetic mapping studies of quantitative traits
by
Crawford, Lorin
,
Zeng, Ping
,
Zhou, Xiang
in
Algorithms
,
Biology and Life Sciences
,
Chromosome Mapping - statistics & numerical data
2017
Epistasis, commonly defined as the interaction between multiple genes, is an important genetic component underlying phenotypic variation. Many statistical methods have been developed to model and identify epistatic interactions between genetic variants. However, because of the large combinatorial search space of interactions, most epistasis mapping methods face enormous computational challenges and often suffer from low statistical power due to multiple test correction. Here, we present a novel, alternative strategy for mapping epistasis: instead of directly identifying individual pairwise or higher-order interactions, we focus on mapping variants that have non-zero marginal epistatic effects-the combined pairwise interaction effects between a given variant and all other variants. By testing marginal epistatic effects, we can identify candidate variants that are involved in epistasis without the need to identify the exact partners with which the variants interact, thus potentially alleviating much of the statistical and computational burden associated with standard epistatic mapping procedures. Our method is based on a variance component model, and relies on a recently developed variance component estimation method for efficient parameter inference and p-value computation. We refer to our method as the \"MArginal ePIstasis Test\", or MAPIT. With simulations, we show how MAPIT can be used to estimate and test marginal epistatic effects, produce calibrated test statistics under the null, and facilitate the detection of pairwise epistatic interactions. We further illustrate the benefits of MAPIT in a QTL mapping study by analyzing the gene expression data of over 400 individuals from the GEUVADIS consortium.
Journal Article
The Statistics of Bulk Segregant Analysis Using Next Generation Sequencing
by
Magwene, Paul M.
,
Kelly, John K.
,
Willis, John H.
in
Biology
,
Chromosome Mapping - statistics & numerical data
,
Computational Biology
2011
We describe a statistical framework for QTL mapping using bulk segregant analysis (BSA) based on high throughput, short-read sequencing. Our proposed approach is based on a smoothed version of the standard G statistic, and takes into account variation in allele frequency estimates due to sampling of segregants to form bulks as well as variation introduced during the sequencing of bulks. Using simulation, we explore the impact of key experimental variables such as bulk size and sequencing coverage on the ability to detect QTLs. Counterintuitively, we find that relatively large bulks maximize the power to detect QTLs even though this implies weaker selection and less extreme allele frequency differences. Our simulation studies suggest that with large bulks and sufficient sequencing depth, the methods we propose can be used to detect even weak effect QTLs and we demonstrate the utility of this framework by application to a BSA experiment in the budding yeast Saccharomyces cerevisiae.
Journal Article
Kernel-Based Association Test
by
Fann, Cathy S. J
,
Hsieh, Hsin-Yi
,
Yang, Hsin-Chou
in
Alcoholism
,
Alcoholism - genetics
,
Alzheimer's disease
2008
Association mapping (i.e., linkage disequilibrium mapping) is a powerful tool for positional cloning of disease genes. We propose a kernel-based association test (KBAT), which is a composite function of “P-values of single-locus association tests” and “kernel weights related to intermarker distances and/or linkage disequilibria.” The KBAT is a general form of some current test statistics. This method can be applied to the study of candidate genes and can scan each chromosome using a moving average procedure. We evaluated the performance of the KBAT through simulation studies that considered evolutionary parameters, disease models, sample sizes, kernel functions, test statistics, window attributes, empirical P-value estimations, and genetic/physical maps. The results showed that the KBAT had a well-controlled false positive rate and high power compared to existing methods. In addition, the KBAT was also applied to analyze a genomewide data set from the Collaborative Study on the Genetics of Alcoholism. Important genes associated with alcoholism dependence were identified. In summary, the merits of the KBAT are multifold: the KBAT is robust against the inclusion of nuisance markers, is invariant to the map scale, and accommodates different types of genomic data, study designs, and study purposes. The proposed methods are packaged in the user-friendly software, KBAT, available at http://www.stat.sinica.edu.tw/hsinchou/genetics/association/KBAT.htm.
Journal Article
Quantifying the mapping precision of genome-wide association studies using whole-genome sequencing data
2017
Background
Understanding the mapping precision of genome-wide association studies (GWAS), that is the physical distances between the top associated single-nucleotide polymorphisms (SNPs) and the causal variants, is essential to design fine-mapping experiments for complex traits and diseases.
Results
Using simulations based on whole-genome sequencing (WGS) data from 3642 unrelated individuals of European descent, we show that the association signals at rare causal variants (minor allele frequency ≤ 0.01) are very unlikely to be mapped to common variants in GWAS using either WGS data or imputed data and vice versa. We predict that at least 80% of the common variants identified from published GWAS using imputed data are within 33.5 Kbp of the causal variants, a resolution that is comparable with that using WGS data. Mapping precision at these loci will improve with increasing sample sizes of GWAS in the future. For rare variants, the mapping precision of GWAS using WGS data is extremely high, suggesting WGS is an efficient strategy to detect and fine-map rare variants simultaneously. We further assess the mapping precision by linkage disequilibrium between GWAS hits and causal variants and develop an online tool (gwasMP) to query our results with different thresholds of physical distance and/or linkage disequilibrium (
http://cnsgenomics.com/shiny/gwasMP
).
Conclusions
Our findings provide a benchmark to inform future design and development of fine-mapping experiments and technologies to pinpoint the causal variants at GWAS loci.
Journal Article
Optimizing expression quantitative trait locus mapping workflows for single-cell studies
by
Azodi, Christina B.
,
Cuomo, Anna S. E.
,
Alvari, Giordano
in
Alleles
,
Animal Genetics and Genomics
,
Best practice
2021
Background
Single-cell RNA sequencing (scRNA-seq) has enabled the unbiased, high-throughput quantification of gene expression specific to cell types and states. With the cost of scRNA-seq decreasing and techniques for sample multiplexing improving, population-scale scRNA-seq, and thus single-cell expression quantitative trait locus (sc-eQTL) mapping, is increasingly feasible. Mapping of sc-eQTL provides additional resolution to study the regulatory role of common genetic variants on gene expression across a plethora of cell types and states and promises to improve our understanding of genetic regulation across tissues in both health and disease.
Results
While previously established methods for bulk eQTL mapping can, in principle, be applied to sc-eQTL mapping, there are a number of open questions about how best to process scRNA-seq data and adapt bulk methods to optimize sc-eQTL mapping. Here, we evaluate the role of different normalization and aggregation strategies, covariate adjustment techniques, and multiple testing correction methods to establish best practice guidelines. We use both real and simulated datasets across single-cell technologies to systematically assess the impact of these different statistical approaches.
Conclusion
We provide recommendations for future single-cell eQTL studies that can yield up to twice as many eQTL discoveries as default approaches ported from bulk studies.
Journal Article