Catalogue Search | MBRL

Controlling bias and inflation in epigenome- and transcriptome-wide association studies using the empirical null distribution

by van Iterson, Maarten , van Zwet, Erik W. , Heijmans, Bastiaan T. in Age Factors , Animal Genetics and Genomics , Bayes Theorem

2017

We show that epigenome- and transcriptome-wide association studies (EWAS and TWAS) are prone to significant inflation and bias of test statistics, an unrecognized phenomenon introducing spurious findings if left unaddressed. Neither GWAS-based methodology nor state-of-the-art confounder adjustment methods completely remove bias and inflation. We propose a Bayesian method to control bias and inflation in EWAS and TWAS based on estimation of the empirical null distribution. Using simulations and real data, we demonstrate that our method maximizes power while properly controlling the false positive rate. We illustrate the utility of our method in large-scale EWAS and TWAS meta-analyses of age and smoking.

Journal Article

Share this book

Add to My Shelf

A linear mixed-model approach to study multivariate gene–environment interactions

by Moore, Rachel , Franke, Lude , Casale, Francesco Paolo in 631/208/191 , 631/208/205 , 639/705/531

2019

Different exposures, including diet, physical activity, or external conditions can contribute to genotype–environment interactions (G×E). Although high-dimensional environmental data are increasingly available and multiple exposures have been implicated with G×E at the same loci, multi-environment tests for G×E are not established. Here, we propose the structured linear mixed model (StructLMM), a computationally efficient method to identify and characterize loci that interact with one or more environments. After validating our model using simulations, we applied StructLMM to body mass index in the UK Biobank, where our model yields previously known and novel G×E signals. Finally, in an application to a large blood eQTL dataset, we demonstrate that StructLMM can be used to study interactions with hundreds of environmental variables. StructLMM is a new method to identify genotype–environment interactions (G×E) that involve multiple exposures or environments. When applied to UK Biobank and eQTL data, StructLMM discovers new G×E signals.

Journal Article

Share this book

Add to My Shelf

Genetic and environmental influences interact with age and sex in shaping the human methylome

by Ehli, Erik A. , van Dongen, Jenny , Willemsen, Gonneke in 631/208/212/177 , 631/337/176/1988 , Adolescent

2016

The methylome is subject to genetic and environmental effects. Their impact may depend on sex and age, resulting in sex- and age-related physiological variation and disease susceptibility. Here we estimate the total heritability of DNA methylation levels in whole blood and estimate the variance explained by common single nucleotide polymorphisms at 411,169 sites in 2,603 individuals from twin families, to establish a catalogue of between-individual variation in DNA methylation. Heritability estimates vary across the genome (mean=19%) and interaction analyses reveal thousands of sites with sex-specific heritability as well as sites where the environmental variance increases with age. Integration with previously published data illustrates the impact of genome and environment across the lifespan at methylation sites associated with metabolic traits, smoking and ageing. These findings demonstrate that our catalogue holds valuable information on locations in the genome where methylation variation between people may reflect disease-relevant environmental exposures or genetic variation. Differential impact of genetic and environmental influences on DNA methylation may result in sex- and age-related physiological variation and disease susceptibility. By analysing DNA methylome of 2,603 individuals from twin families, here, the authors establish a catalogue of between-individual variation in DNA methylation.

Journal Article

Share this book

Add to My Shelf

A characterization of cis- and trans-heritability of RNA-Seq-based gene expression

by Deelen Joris , Veldink, Jan H , Hottenga, Jouke J in Cytokines , DNA microarrays , Gene expression

2020

Insights into individual differences in gene expression and its heritability (h2) can help in understanding pathways from DNA to phenotype. We estimated the heritability of gene expression of 52,844 genes measured in whole blood in the largest twin RNA-Seq sample to date (1497 individuals including 459 monozygotic twin pairs and 150 dizygotic twin pairs) from classical twin modeling and identity-by-state-based approaches. We estimated for each gene h2total, composed of cis-heritability (h2cis, the variance explained by single nucleotide polymorphisms in the cis-window of the gene), and trans-heritability (h2res, the residual variance explained by all other genome-wide variants). Mean h2total was 0.26, which was significantly higher than heritability estimates earlier found in a microarray-based study using largely overlapping (>60%) RNA samples (mean h2 = 0.14, p = 6.15 × 10−258). Mean h2cis was 0.06 and strongly correlated with beta of the top cis expression quantitative loci (eQTL, ρ = 0.76, p < 10−308) and with estimates from earlier RNA-Seq-based studies. Mean h2res was 0.20 and correlated with the beta of the corresponding trans-eQTL (ρ = 0.04, p < 1.89 × 10−3) and was significantly higher for genes involved in cytokine-cytokine interactions (p = 4.22 × 10−15), many other immune system pathways, and genes identified in genome-wide association studies for various traits including behavioral disorders and cancer. This study provides a thorough characterization of cis- and trans-h2 estimates of gene expression, which is of value for interpretation of GWAS and gene expression studies.

Journal Article

Share this book

Add to My Shelf

DNA methylation in peripheral tissues and left-handedness

by Cuellar-Partida, Gabriel , Ehli, Erik A. , Relton, Caroline L. in 631/208/1515 , 631/208/177 , Adult

2022

Handedness has low heritability and epigenetic mechanisms have been proposed as an etiological mechanism. To examine this hypothesis, we performed an epigenome-wide association study of left-handedness. In a meta-analysis of 3914 adults of whole-blood DNA methylation, we observed that CpG sites located in proximity of handedness-associated genetic variants were more strongly associated with left-handedness than other CpG sites ( P = 0.04), but did not identify any differentially methylated positions. In longitudinal analyses of DNA methylation in peripheral blood and buccal cells from children ( N = 1737), we observed moderately stable associations across age (correlation range [0.355–0.578]), but inconsistent across tissues (correlation range [− 0.384 to 0.318]). We conclude that DNA methylation in peripheral tissues captures little of the variance in handedness. Future investigations should consider other more targeted sources of tissue, such as the brain.

Journal Article

Share this book

Add to My Shelf

Effects of smoking on genome-wide DNA methylation profiles: A study of discordant and concordant monozygotic twin pairs

by van Dongen, Jenny , Willemsen, Gonneke , Boomsma, Dorret I in Acetylcholine receptors (nicotinic) , Biobanks , Blood cells

2023

The genetic information of people who smoke present distinctive characteristics. In particular, previous research has revealed differences in patterns of DNA methylation, a type of chemical modification that helps cells switch certain genes on or off. However, most of these studies could not establish for sure whether these changes were caused by smoking, predisposed individuals to smoke, or were driven by underlying genetic variation in the DNA sequence itself. To investigate this question, van Dongen et al. examined DNA methylation data from the blood cells of over 700 pairs of identical twins. These individuals share the exact same genetic information, making it possible to better evaluate the impact of lifestyle on DNA modifications. The analyses identified differences in methylation at 13 DNA locations in pairs of twins where one was a current smoker and their sibling had never smoked. Two of the genes code for proteins involved in the response to nicotine, the primary addictive chemical in cigarette smoke. The differences were smaller if one of the twins had stopped smoking, suggesting that quitting can help to reverse some of these changes. These findings confirm that DNA methylation in blood cells is influenced by cigarette smoke, which could help to better understand smoking-associated diseases. They also demonstrate how useful identical twins studies can be to identify methylation changes that are markers of lifestyle.

Journal Article

Share this book

Add to My Shelf

Autosomal genetic variation is associated with DNA methylation in regions variably escaping X-chromosome inactivation

by Wijmenga, Cisca , Veldink, Jan H. , Franke, Lude in 45/43 , 631/208/176/1433 , 631/208/176/1988

2018

X-chromosome inactivation (XCI), i.e., the inactivation of one of the female X chromosomes, restores equal expression of X-chromosomal genes between females and males. However, ~10% of genes show variable degrees of escape from XCI between females, although little is known about the causes of variable XCI. Using a discovery data-set of 1867 females and 1398 males and a replication sample of 3351 females, we show that genetic variation at three autosomal loci is associated with female-specific changes in X-chromosome methylation. Through cis -eQTL expression analysis, we map these loci to the genes SMCHD1 / METTL4 , TRIM6 / HBG2 , and ZSCAN9 . Low-expression alleles of the loci are predominantly associated with mild hypomethylation of CpG islands near genes known to variably escape XCI, implicating the autosomal genes in variable XCI. Together, these results suggest a genetic basis for variable escape from XCI and highlight the potential of a population genomics approach to identify genes involved in XCI. DNA methylation is critically involved in X chromosome inactivation (XCI) and dosage compensation, yet some X-chromosomal genes escape XCI. Here, Lujik et al. identify three autosomal genetic loci that associate with differential DNA methylation near genes that variably escape XCI in females.

Journal Article

Share this book

Add to My Shelf

Genome-wide identification of directed gene networks using large-scale population genomics data

by Wijmenga, Cisca , Veldink, Jan H. , Franke, Lude in 38/91 , 45/43 , 631/114/2114

2018

Identification of causal drivers behind regulatory gene networks is crucial in understanding gene function. Here, we develop a method for the large-scale inference of gene–gene interactions in observational population genomics data that are both directed (using local genetic instruments as causal anchors, akin to Mendelian Randomization) and specific (by controlling for linkage disequilibrium and pleiotropy). Analysis of genotype and whole-blood RNA-sequencing data from 3072 individuals identified 49 genes as drivers of downstream transcriptional changes (Wald P < 7 × 10 −10 ), among which transcription factors were overrepresented (Fisher’s P = 3.3 × 10 −7 ). Our analysis suggests new gene functions and targets, including for SENP7 (zinc-finger genes involved in retroviral repression) and BCL2A1 (target genes possibly involved in auditory dysfunction). Our work highlights the utility of population genomics data in deriving directed gene expression networks. A resource of trans -effects for all 6600 genes with a genetic instrument can be explored individually using a web-based browser. Identification of causal drivers behind expression is essential for understanding gene function. Here authors develop a method for the large-scale inference of gene–gene interactions in observational population genomics data and characterize a network of trans -effects for 6600 genes.

Journal Article

Share this book

Add to My Shelf

Correction for both common and rare cell types in blood is important to identify genes that correlate with age

by Boomsma, Dorret I. , Slagboom, P. Eline , Westra, Harm-Jan in Adolescent , Adult , Aged

2021

Background Aging is a multifactorial process that affects multiple tissues and is characterized by changes in homeostasis over time, leading to increased morbidity. Whole blood gene expression signatures have been associated with aging and have been used to gain information on its biological mechanisms, which are still not fully understood. However, blood is composed of many cell types whose proportions in blood vary with age. As a result, previously observed associations between gene expression levels and aging might be driven by cell type composition rather than intracellular aging mechanisms. To overcome this, previous aging studies already accounted for major cell types, but the possibility that the reported associations are false positives driven by less prevalent cell subtypes remains. Results Here, we compared the regression model from our previous work to an extended model that corrects for 33 additional white blood cell subtypes. Both models were applied to whole blood gene expression data from 3165 individuals belonging to the general population (age range of 18–81 years). We evaluated that the new model is a better fit for the data and it identified fewer genes associated with aging (625, compared to the 2808 of the initial model; P ≤ 2.5⨯10 −6 ). Moreover, 511 genes (~ 18% of the 2808 genes identified by the initial model) were found using both models, indicating that the other previously reported genes could be proxies for less abundant cell types. In particular, functional enrichment of the genes identified by the new model highlighted pathways and GO terms specifically associated with platelet activity. Conclusions We conclude that gene expression analyses in blood strongly benefit from correction for both common and rare blood cell types, and recommend using blood-cell count estimates as standard covariates when studying whole blood gene expression.

Journal Article

Share this book

Add to My Shelf

Phenotype prediction using biologically interpretable neural networks on multi-cohort multi-omics data

by Boomsma, Dorret I , Veldink, Jan H , Hottenga, Jouke J in Biological analysis , Blood levels , CD34 antigen

2024

Integrating multi-omics data into predictive models has the potential to enhance accuracy, which is essential for precision medicine. In this study, we developed interpretable predictive models for multi-omics data by employing neural networks informed by prior biological knowledge, referred to as visible networks. These neural networks offer insights into the decision-making process and can unveil novel perspectives on the underlying biological mechanisms associated with traits and complex diseases. We tested the performance, interpretability and generalizability for inferring smoking status, subject age and LDL levels using genome-wide RNA expression and CpG methylation data from the blood of the BIOS consortium (four population cohorts, Ntotal = 2940). In a cohort-wise cross-validation setting, the consistency of the diagnostic performance and interpretation was assessed. Performance was consistently high for predicting smoking status with an overall mean AUC of 0.95 (95% CI: 0.90–1.00) and interpretation revealed the involvement of well-replicated genes such as AHRR, GPR15 and LRRN3. LDL-level predictions were only generalized in a single cohort with an R2 of 0.07 (95% CI: 0.05–0.08). Age was inferred with a mean error of 5.16 (95% CI: 3.97–6.35) years with the genes COL11A2, AFAP1, OTUD7A, PTPRN2, ADARB2 and CD34 consistently predictive. For both regression tasks, we found that using multi-omics networks improved performance, stability and generalizability compared to interpretable single omic networks. We believe that visible neural networks have great potential for multi-omics analysis; they combine multi-omic data elegantly, are interpretable, and generalize well to data from different cohorts.

Journal Article

Share this book

Add to My Shelf

Language Selector

MBRLGlobalSearch

Language Selector

Catalogue Search | MBRL

Search Results Heading

Explore the vast range of titles available.

MBRLSearchResults

MBRLHappinessMeter