Catalogue Search | MBRL

GC content shapes mRNA storage and decay in human cells

by Yi, Zhou , Morillon, Antonin , Brest, Patrick in Base Composition , Base Composition - genetics , Chromosomes and Gene Expression

2019

mRNA translation and decay appear often intimately linked although the rules of this interplay are poorly understood. In this study, we combined our recent P-body transcriptome with transcriptomes obtained following silencing of broadly acting mRNA decay and repression factors, and with available CLIP and related data. This revealed the central role of GC content in mRNA fate, in terms of P-body localization, mRNA translation and mRNA stability: P-bodies contain mostly AU-rich mRNAs, which have a particular codon usage associated with a low protein yield; AU-rich and GC-rich transcripts tend to follow distinct decay pathways; and the targets of sequence-specific RBPs and miRNAs are also biased in terms of GC content. Altogether, these results suggest an integrated view of post-transcriptional control in human cells where most translation regulation is dedicated to inefficiently translated AU-rich mRNAs, whereas control at the level of 5’ decay applies to optimally translated GC-rich mRNAs.

Journal Article

Share this book

Add to My Shelf

Chloroplast Genomes of Two Species of Cypripedium: Expanded Genome Size and Proliferation of AT-Biased Repeat Sequences

by Guo, Yan-Yan , Zhao, Hu-Sheng , Li, Hong-Kun in Angiosperms , Annotations , Assembling

2021

The size of the chloroplast genome (plastome) of autotrophic angiosperms is generally conserved. However, the chloroplast genomes of some lineages are greatly expanded, which may render assembling these genomes from short read sequencing data more challenging. Here, we present the sequencing, assembly, and annotation of the chloroplast genomes of Cypripedium tibeticum and Cypripedium subtropicum . We de novo assembled the chloroplast genomes of the two species with a combination of short-read Illumina data and long-read PacBio data. The plastomes of the two species are characterized by expanded genome size, proliferated AT-rich repeat sequences, low GC content and gene density, as well as low substitution rates of the coding genes. The plastomes of C. tibeticum (197,815 bp) and C. subtropicum (212,668 bp) are substantially larger than those of the three species sequenced in previous studies. The plastome of C. subtropicum is the longest one of Orchidaceae to date. Despite the increase in genome size, the gene order and gene number of the plastomes are conserved, with the exception of an ∼75 kb large inversion in the large single copy (LSC) region shared by the two species. The most striking is the record-setting low GC content in C. subtropicum (28.2%). Moreover, the plastome expansion of the two species is strongly correlated with the proliferation of AT-biased non-coding regions: the non-coding content of C. subtropicum is in excess of 57%. The genus provides a typical example of plastome expansion induced by the expansion of non-coding regions. Considering the pros and cons of different sequencing technologies, we recommend hybrid assembly based on long and short reads applied to the sequencing of plastomes with AT-biased base composition.

Journal Article

Share this book

Add to My Shelf

Impact of GC content on de novo gene birth

by Grislain, Antoine , Herman, Simon , Roginski, Paul in 631/181/2474 , 631/181/735 , 631/208/182

2026

Noncoding regions in eukaryotes are extensively expressed and represent a significant source of novel microproteins, some of which become fixed as de novo genes. However, the structural properties of these unevolved products and the features driving their fixation remain poorly understood. Particularly, the influence of nucleotide composition (GC content) on their structural properties and evolutionary trajectories is still unclear. Here, we predict the foldability and sequence properties of millions of microproteins potentially encoded in the noncoding open reading frames (ORFs) of 3,379 eukaryotic genomes with GC contents ranging from 18% to 79%. Depending on GC content, these microproteins exhibit distinct structural properties, suggesting different cellular impacts if non-genic regions are pervasively expressed. Using phylostratigraphy, de novo gene search, and ancestral sequence reconstruction, we trace the evolution of several hundred de novo proteins across 22 organisms with varying GC contents. We show that de novo genes preferentially emerge from GC-rich ORFs with folding potential, revealing that the interplay between GC content and foldability - rooted in the structure of the genetic code - shapes the emergence of novel genes. Noncoding DNA can generate microproteins, some of which evolve into new genes. This study finds that de novo genes preferentially originate from GC-rich, foldable sequences, showing how base composition influences the birth of new proteins.

Journal Article

Share this book

Add to My Shelf

Compensatory Evolution Following Deleterious Episodes of GC-biased Gene Conversion in Rodents

by Saclier, Nathanaëlle , Galtier, Nicolas , Riffis, Marie in Accumulation , Amino acids , Animal genetics

2025

GC-biased gene conversion (gBGC) is a widespread evolutionary force associated with meiotic recombination that favors the accumulation of deleterious AT to GC substitutions in proteins, moving them away from their fitness optimum. In many mammals, recombination hotspots have a rapid turnover, leading to episodic gBGC, with the accumulation of deleterious mutations stopping when the recombination hotspot dies. Selection is therefore expected to act to repair the damage caused by gBGC episodes through compensatory evolution. However, this process has never been studied or quantified so far. Here, we analyzed the nucleotide substitution pattern in coding sequences of a highly diversified group of Murinae rodents. Using phylogenetic analyses of about 70,000 coding exons, we identified numerous exon-specific, lineage-specific gBGC episodes, characterized by a clustering of synonymous AT to GC substitutions and by an increasing rate of nonsynonymous AT to GC substitutions, many of which are potentially deleterious. Analyzing the molecular evolution of the affected exons in downstream lineages, we found evidence for pervasive compensatory evolution after deleterious gBGC episodes. Compensation appears to occur rapidly after the end of the episode and to be driven by the standing genetic variation rather than new mutations. Our results demonstrate the impact of gBGC on the evolution of amino-acid sequences and underline the key role of epistasis in protein adaptation. This study contributes to a growing body of literature emphasizing that adaptive mutations, which arise in response to environmental changes, are just 1 subset of beneficial mutations, alongside mutations resulting from oscillations around the fitness optimum.

Journal Article

Share this book

Add to My Shelf

Gene body methylation is conserved between plant orthologs and is of evolutionary consequence

by Gaut, Brandon S. , Takuno, Shohei in Arabidopsis - genetics , Arabidopsis thaliana , Base Composition - genetics

2013

DNA methylation is a common feature of eukaryotic genomes and is especially common in noncoding regions of plants. Protein coding regions of plants are often methylated also, but the extent, function, and evolutionary consequences of gene body methylation remain unclear. Here we investigate gene body methylation using an explicit comparative evolutionary approach. We generated bisulfite sequencing data from two tissues of Brachypodium distachyon and compared genic methylation patterns to those of rice (Oryza sativa ssp. japonica). Gene body methylation was strongly conserved between orthologs of the two species and affected a biased subset of long, slowly evolving genes. Because gene body methylation is conserved over evolutionary time, it shapes important features of plant genome evolution, such as the bimodality of G+C content among grass genes. Our results superficially contradict previous observations of high cytosine methylation polymorphism within Arabidopsis thaliana genes, but reanalyses of these data are consistent with conservation of methylation within gene regions. Overall, our results indicate that the methylation level is a long-term property of individual genes and therefore of evolutionary consequence.

Journal Article

Share this book

Add to My Shelf

The Branch-Site Test of Positive Selection Is Surprisingly Robust but Lacks Power under Synonymous Substitution Saturation and Variation in GC

by Robinson-Rechavi, Marc , Gharib, Walid H in Positive selection , Robustness , Substitutes

2013

Positive selection is widely estimated from protein coding sequence alignments by the nonsynonymous-to-synonymous ratio ω. Increasingly elaborate codon models are used in a likelihood framework for this estimation. Although there is widespread concern about the robustness of the estimation of the ω ratio, more efforts are needed to estimate this robustness, especially in the context of complex models. Here, we focused on the branch-site codon model. We investigated its robustness on a large set of simulated data. First, we investigated the impact of sequence divergence. We found evidence of underestimation of the synonymous substitution rate for values as small as 0.5, with a slight increase in false positives for the branch-site test. When dS increases further, underestimation of dS is worse, but false positives decrease. Interestingly, the detection of true positives follows a similar distribution, with a maximum for intermediary values of dS. Thus, high dS is more of a concern for a loss of power (false negatives) than for false positives of the test. Second, we investigated the impact of GC content. We showed that there is no significant difference of false positives between high GC (up to ∼80%) and low GC (∼30%) genes. Moreover, neither shifts of GC content on a specific branch nor major shifts in GC along the gene sequence generate many false positives. Our results confirm that the branch-site is a very conservative test.

Journal Article

Share this book

Add to My Shelf

Gene Discovery and Tissue-Specific Transcriptome Analysis in Chickpea with Massively Parallel Pyrosequencing and Web Resource Development

by Yadav, Gitanjali , Garg, Rohini , Priya, Pushp in Base Composition , Base Composition - genetics , Base Sequence

2011

Chickpea (Cicer arietinum) is an important food legume crop but lags in the availability of genomic resources. In this study, we have generated about 2 million high-quality sequences of average length of 372 bp using pyrosequencing technology. The optimization of de novo assembly clearly indicated that hybrid assembly of long-read and short-read primary assemblies gave better results. The hybrid assembly generated a set of 34,760 transcripts with an average length of 1,020 bp representing about 4.8% (35.5 Mb) of the total chickpea genome. We identified more than 4,000 simple sequence repeats, which can be developed as functional molecular markers in chickpea. Putative function and Gene Ontology terms were assigned to at least 73.2% and 71.0% of chickpea transcripts, respectively. We have also identified several chickpea transcripts that showed tissue-specific expression and validated the results using real-time polymerase chain reaction analysis. Based on sequence comparison with other species within the plant kingdom, we identified two sets of lineage-specific genes, including those conserved in the Fabaceae family (legume specific) and those lacking significant similarity with any non chickpea species (chickpea specific). Finally, we have developed a Web resource, Chickpea Transcriptome Database, which provides public access to the data and results reported in this study The strategy for optimization of de novo assembly presented here may further facilitate the transcriptome sequencing and characterization in other organisms. Most importantly, the data and results reported in this study will help to accelerate research in various areas of genomics and implementing breeding programs in chickpea.

Journal Article

Share this book

Add to My Shelf

Metabolic characteristics of the species Variovorax paradoxus

by Steinbüchel, Alexander , Wübbeler, Jan Hendrik , Satola, Barbara in 1-aminocyclopropane-1-carboxylic acid , 2,4-D , 2-methylisoborneol

2013

This review outlines information about the Gram-negative, aerobic bacterium Variovorax paradoxus. The genomes of these species have G+C contents of 66.5–69.4 mol%, and the cells form yellow colonies. Some strains of V. paradoxus are facultative lithoautotrophic, others are chemoorganotrophic. Many of them are associated with important catabolic processes including the degradation of toxic and/or complex chemical compounds. The degradation pathways or other skills related to the following compounds, respectively, are described in this review: sulfolane, 3-sulfolene, 2-mercaptosuccinic acid, 3,3′-thiodipropionic acid, aromatic sulfonates, alkanesulfonates, amino acids and other sulfur sources, polychlorinated biphenyls, dimethyl terephthalate, linuron, 2,4-dinitrotoluene, homovanillate, veratraldehyde, 2,4-dichlorophenoxyacetic acid, anthracene, poly(3-hydroxybutyrate), chitin, cellulose, humic acids, metal–EDTA complexes, yttrium, rare earth elements, As(III), trichloroethylene, capsaicin, 3-nitrotyrosine, acyl-homoserine lactones, 1-aminocyclopropane-1-carboxylate, methyl tert -butyl ether, geosmin, and 2-methylisoborneol. Strains of V. paradoxus are also engaged in mutually beneficial interactions with other plant and bacterial species in various ecosystems. This species comprises probably promising strains for bioremediation and other biotechnical applications. Lately, the complete genomes of strains S110 and EPS have been sequenced for further investigations.

Journal Article

Share this book

Add to My Shelf

Impact of Recombination on the Base Composition of Bacteria and Archaea

by Louis-Marie Bobay , Ochman, Howard in Bacteria , Base composition , Bias

2017

The mutational process in bacteria is biased toward A and T, and most species are GC-rich relative to the mutational input to their genome. It has been proposed that the shift in base composition is an adaptive process—that natural selection operates to increase GC-contents—and there is experimental evidence that bacterial strains with GC-rich versions of genes have higher growth rates than those strains with AT-rich versions expressing identical proteins. Alternatively, a nonadaptive process, GC-biased gene conversion (gBGC), could also increase the GC-content of DNA due to the mechanistic bias of gene conversion events during recombination. To determine what role recombination plays in the base composition of bacterial genomes, we compared the spectrum of nucleotide polymorphisms introduced by recombination in all microbial species represented by large numbers of sequenced strains. We found that recombinant alleles are consistently biased toward A and T, and that the magnitude of AT-bias introduced by recombination is similar to that of mutations. These results indicate that recombination alone, without the intervention of selection, is unlikely to counteract the AT-enrichment of bacterial genomes.

Journal Article

Share this book

Add to My Shelf

selective force favoring increased G+C content in bacterial genes

by Raghavan, Rahul , Ochman, Howard , Kelkar, Yogeshwar D in Adaptation, Biological - genetics , Bacteria , Bacterial genomes

2012

Bacteria display considerable variation in their overall base compositions, which range from 13% to over 75% G+C. This variation in genomic base compositions has long been considered to be a strictly neutral character, due solely to differences in the mutational process; however, recent sequence comparisons indicate that mutational input alone cannot produce the observed base compositions, implying a role for natural selection. Because bacterial genomes have high gene content, forces that operate on the base composition of individual genes could help shape the overall genomic base composition. To explore this possibility, we tested whether genes that encode the same protein but vary only in their base compositions at synonymous sites have effects on bacterial fitness. Escherichia coli strains harboring G+C-rich versions of genes display higher growth rates, indicating that despite a pervasive mutational bias toward A+T, a selective force, independent of adaptive codon use, is driving genes toward higher G+C contents.

Journal Article

Share this book

Add to My Shelf

Language Selector

MBRLGlobalSearch

Language Selector

Catalogue Search | MBRL

Search Results Heading

Explore the vast range of titles available.

MBRLSearchResults

MBRLHappinessMeter