Catalogue Search | MBRL

by Scally, Aylwyn , Mathieson, Iain in Biology and Life Sciences , Biotechnology industries , Chromosomes

2020

Sophisticated methods have been developed to infer and visualise these relationships. [...]it seems that both scientists and the wider public are learning more and more about ancestry, and there is an optimistic sense that genetic data provide an exhaustive repository of ancestral information. [...]most statements about ancestry are really statements about genetic similarity, which has a complex relationship with ancestry, and can only be related to it by making assumptions about human demography whose validity is uncertain and difficult to test. The natural definition of this kind of ancestry is genetic ancestry, which differs from genealogical ancestry in that it refers not to your pedigree but to the subset of paths through it by which the material in your genome has been inherited. Because parents transmit only half their DNA to offspring each generation, an individual’s genetic ancestry involves only a small proportion of all their genealogical ancestors [1,2]. Different positions in the genome may have different paths of inheritance, because parental chromosomes are shuffled together during meiotic recombination. [...]the difference between genealogical and genetic ancestry can be summed up by the observation that full siblings have identical genealogical ancestry but differ in their genetic ancestry, due to differences in the transmission of chromosomal segments from their parents.

Journal Article

Share this book

Add to My Shelf

The evolution of skin pigmentation-associated variation in West Eurasia

by Ju, Dan , Mathieson, Iain in Alleles , Asia , Asian Continental Ancestry Group - genetics

2021

Skin pigmentation is a classic example of a polygenic trait that has experienced directional selection in humans. Genome-wide association studies have identified well over a hundred pigmentation-associated loci, and genomic scans in present-day and ancient populations have identified selective sweeps for a small number of light pigmentation-associated alleles in Europeans. It is unclear whether selection has operated on all of the genetic variation associated with skin pigmentation as opposed to just a small number of large-effect variants. Here, we address this question using ancient DNA from 1,158 individuals from West Eurasia covering a period of 40,000 y combined with genome-wide association summary statistics from the UK Biobank. We find a robust signal of directional selection in ancient West Eurasians on 170 skin pigmentation-associated variants ascertained in the UK Biobank. However, we also show that this signal is driven by a limited number of large-effect variants. Consistent with this observation, we find that a polygenic selection test in present-day populations fails to detect selection with the full set of variants. Our data allow us to disentangle the effects of admixture and selection. Most notably, a large-effect variant at SLC24A5 was introduced to Western Europe by migrations of Neolithic farming populations but continued to be under selection post-admixture. This study shows that the response to selection for light skin pigmentation in West Eurasia was driven by a relatively small proportion of the variants that are associated with present-day phenotypic variation.

Journal Article

Share this book

Add to My Shelf

Differential confounding of rare and common variants in spatially structured populations

by McVean, Gil , Mathieson, Iain in 631/208/205/2138 , 631/208/457/649 , Agriculture

2012

Gil McVean and Iain Mathieson report an analysis of the differential effects of population stratification on rare and common variants within association studies. They find that rare variants may show stronger stratification in some situations and that this is not corrected for by current structure methods, suggesting the need for the development of new statistical methods. Well-powered genome-wide association studies, now made possible through advances in technology and large-scale collaborative projects, promise to characterize the contribution of rare variants to complex traits and disease. However, while population structure is a known confounder of association studies, it remains unknown whether methods developed to control stratification are equally effective for rare variants. Here, we demonstrate that rare variants can show a stratification that is systematically different from, and typically stronger than, common variants, and this is not necessarily corrected by existing methods. We show that the same process leads to inflation for load-based tests and can obscure signals at truly associated variants. Furthermore, we show that populations can display spatial structure in rare variants, even when Wright's fixation index F ST is low, but that allele frequency–dependent metrics of allele sharing can reveal localized stratification. These results underscore the importance of collecting and integrating spatial information in the genetic analysis of complex traits.

Journal Article

Share this book

Add to My Shelf

Limited Evidence for Selection at the FADS Locus in Native American Populations

by Mathieson, Iain in Haplotypes , Indigenous peoples , Minority & ethnic groups

2020

The FADS locus contains the genes FADS1 and FADS2 that encode enzymes involved in the synthesis of long-chain polyunsaturated fatty acids. This locus appears to have been a repeated target of selection in human evolution, likely because dietary input of long-chain polyunsaturated fatty acids varied over time depending on environment and subsistence strategy. Several recent studies have identified selection at the FADS locus in Native American populations, interpreted as evidence for adaptation during or subsequent to the passage through Beringia. Here, we show that these signals are confounded by independent selection—postdating the split from Native Americans—in the European and, possibly, the East Asian populations used in the population branch statistic test. This is supported by direct evidence from ancient DNA that one of the putatively selected haplotypes was already common in Northern Eurasia at the time of the separation of Native American ancestors. An explanation for the present-day distribution of the haplotype that is more consistent with the data is that Native Americans retain the ancestral state of Paleolithic Eurasians. Another haplotype at the locus may reflect a secondary selection signal, although its functional impact is unknown.

Journal Article

Share this book

Add to My Shelf

Demographic history mediates the effect of stratification on polygenic scores

by Mathieson, Iain , Zaidi, Arslan A in Bias , demographic history , Demography

2020

Population stratification continues to bias the results of genome-wide association studies (GWAS). When these results are used to construct polygenic scores, even subtle biases can cumulatively lead to large errors. To study the effect of residual stratification, we simulated GWAS under realistic models of demographic history. We show that when population structure is recent, it cannot be corrected using principal components of common variants because they are uninformative about recent history. Consequently, polygenic scores are biased in that they recapitulate environmental structure. Principal components calculated from rare variants or identity-by-descent segments can correct this stratification for some types of environmental effects. While family-based studies are immune to stratification, the hybrid approach of ascertaining variants in GWAS but reestimating effect sizes in siblings reduces but does not eliminate stratification. We show that the effect of population stratification depends not only on allele frequencies and environmental structure but also on demographic history.

Journal Article

Share this book

Add to My Shelf

Differences in the rare variant spectrum among human populations

by Mathieson, Iain , Reich, David in Algorithms , Analysis , Asia

2017

Mutations occur at vastly different rates across the genome, and populations, leading to differences in the spectrum of segregating polymorphisms. Here, we investigate variation in the rare variant spectrum in a sample of human genomes representing all major world populations. We find at least two distinct signatures of variation. One, consistent with a previously reported signature is characterized by an increased rate of TCC>TTC mutations in people from Western Eurasia and South Asia, likely related to differences in the rate, or efficiency of repair, of damage due to deamination of methylated guanine. We describe the geographic extent of this signature and show that it is detectable in the genomes of ancient, but not archaic humans. The second signature is private to certain Native American populations, and is concentrated at CpG sites. We show that this signature is not driven by differences in the CpG mutation rate, but is a result of the fact that highly mutable CpG sites are more likely to undergo multiple independent mutations across human populations, and the spectrum of such mutations is highly sensitive to recent demography. Both of these effects dramatically affect the spectrum of rare variants across human populations, and should be taken into account when using mutational clocks to make inference about demography.

Journal Article

Share this book

Add to My Shelf

Tracking human population structure through time from whole genome sequences

by O’Connell, Jared , Mathieson, Iain , Wang, Ke in Analysis , Biodiversity , Biology and Life Sciences

2020

The genetic diversity of humans, like many species, has been shaped by a complex pattern of population separations followed by isolation and subsequent admixture. This pattern, reaching at least as far back as the appearance of our species in the paleontological record, has left its traces in our genomes. Reconstructing a population's history from these traces is a challenging problem. Here we present a novel approach based on the Multiple Sequentially Markovian Coalescent (MSMC) to analyze the separation history between populations. Our approach, called MSMC-IM, uses an improved implementation of the MSMC (MSMC2) to estimate coalescence rates within and across pairs of populations, and then fits a continuous Isolation-Migration model to these rates to obtain a time-dependent estimate of gene flow. We show, using simulations, that our method can identify complex demographic scenarios involving post-split admixture or archaic introgression. We apply MSMC-IM to whole genome sequences from 15 worldwide populations, tracking the process of human genetic diversification. We detect traces of extremely deep ancestry between some African populations, with around 1% of ancestry dating to divergences older than a million years ago.

Journal Article

Share this book

Add to My Shelf

Demography and the Age of Rare Variants

by McVean, Gil , Mathieson, Iain in Africa , Algorithms , Asia

2014

Large whole-genome sequencing projects have provided access to much rare variation in human populations, which is highly informative about population structure and recent demography. Here, we show how the age of rare variants can be estimated from patterns of haplotype sharing and how these ages can be related to historical relationships between populations. We investigate the distribution of the age of variants occurring exactly twice (ƒ(2) variants) in a worldwide sample sequenced by the 1000 Genomes Project, revealing enormous variation across populations. The median age of haplotypes carrying ƒ(2) variants is 50 to 160 generations across populations within Europe or Asia, and 170 to 320 generations within Africa. Haplotypes shared between continents are much older with median ages for haplotypes shared between Europe and Asia ranging from 320 to 670 generations. The distribution of the ages of ƒ(2) haplotypes is informative about their demography, revealing recent bottlenecks, ancient splits, and more modern connections between populations. We see the effect of selection in the observation that functional variants are significantly younger than nonfunctional variants of the same frequency. This approach is relatively insensitive to mutation rate and complements other nonparametric methods for demographic inference.

Journal Article

Share this book

Add to My Shelf

Estimating Selection Coefficients in Spatially Structured Populations from Time Series Data of Allele Frequencies

by McVean, Gil , Mathieson, Iain in Animal Migration , Animals , Evolution

2013

Inferring the nature and magnitude of selection is an important problem in many biological contexts. Typically when estimating a selection coefficient for an allele, it is assumed that samples are drawn from a panmictic population and that selection acts uniformly across the population. However, these assumptions are rarely satisfied. Natural populations are almost always structured, and selective pressures are likely to act differentially. Inference about selection ought therefore to take account of structure. We do this by considering evolution in a simple lattice model of spatial population structure. We develop a hidden Markov model based maximum-likelihood approach for estimating the selection coefficient in a single population from time series data of allele frequencies. We then develop an approximate extension of this to the structured case to provide a joint estimate of migration rate and spatially varying selection coefficients. We illustrate our method using classical data sets of moth pigmentation morph frequencies, but it has wide applications in settings ranging from ecology to human evolution.

Journal Article

Share this book

Add to My Shelf

Integrating mapping-, assembly- and haplotype-based approaches for calling variants in clinical sequencing applications

by Phan, Hang , Twigg, Stephen R F , McVean, Gil in 631/114/2785 , 631/208/514/1948 , 692/308/2056

2014

Gerton Lunter and colleagues report Platypus software, which combines a haplotype-based multi-sample variant caller with local sequence assembly in a Bayesian statistical framework. They demonstrate applications to exome and whole-genome data sets, to the identification de novo mutations in parent-offspring trios and to the genotyping of HLA loci. High-throughput DNA sequencing technology has transformed genetic research and is starting to make an impact on clinical practice. However, analyzing high-throughput sequencing data remains challenging, particularly in clinical settings where accuracy and turnaround times are critical. We present a new approach to this problem, implemented in a software package called Platypus. Platypus achieves high sensitivity and specificity for SNPs, indels and complex polymorphisms by using local de novo assembly to generate candidate variants, followed by local realignment and probabilistic haplotype estimation. It is an order of magnitude faster than existing tools and generates calls from raw aligned read data without preprocessing. We demonstrate the performance of Platypus in clinically relevant experimental designs by comparing with SAMtools and GATK on whole-genome and exome-capture data, by identifying de novo variation in 15 parent-offspring trios with high sensitivity and specificity, and by estimating human leukocyte antigen genotypes directly from variant calls.

Journal Article

Share this book

Add to My Shelf

Language Selector

MBRLGlobalSearch

Language Selector

Catalogue Search | MBRL

Search Results Heading

Explore the vast range of titles available.

MBRLSearchResults

MBRLHappinessMeter