Catalogue Search | MBRL
Search Results Heading
Explore the vast range of titles available.
MBRLSearchResults
-
DisciplineDiscipline
-
Is Peer ReviewedIs Peer Reviewed
-
Item TypeItem Type
-
SubjectSubject
-
YearFrom:-To:
-
More FiltersMore FiltersSourceLanguage
Done
Filters
Reset
16,674
result(s) for
"Base Composition"
Sort by:
GC content shapes mRNA storage and decay in human cells
by
Yi, Zhou
,
Morillon, Antonin
,
Brest, Patrick
in
Base Composition
,
Base Composition - genetics
,
Chromosomes and Gene Expression
2019
mRNA translation and decay appear often intimately linked although the rules of this interplay are poorly understood. In this study, we combined our recent P-body transcriptome with transcriptomes obtained following silencing of broadly acting mRNA decay and repression factors, and with available CLIP and related data. This revealed the central role of GC content in mRNA fate, in terms of P-body localization, mRNA translation and mRNA stability: P-bodies contain mostly AU-rich mRNAs, which have a particular codon usage associated with a low protein yield; AU-rich and GC-rich transcripts tend to follow distinct decay pathways; and the targets of sequence-specific RBPs and miRNAs are also biased in terms of GC content. Altogether, these results suggest an integrated view of post-transcriptional control in human cells where most translation regulation is dedicated to inefficiently translated AU-rich mRNAs, whereas control at the level of 5’ decay applies to optimally translated GC-rich mRNAs.
Journal Article
Chloroplast Genomes of Two Species of Cypripedium: Expanded Genome Size and Proliferation of AT-Biased Repeat Sequences
2021
The size of the chloroplast genome (plastome) of autotrophic angiosperms is generally conserved. However, the chloroplast genomes of some lineages are greatly expanded, which may render assembling these genomes from short read sequencing data more challenging. Here, we present the sequencing, assembly, and annotation of the chloroplast genomes of Cypripedium tibeticum and Cypripedium subtropicum . We de novo assembled the chloroplast genomes of the two species with a combination of short-read Illumina data and long-read PacBio data. The plastomes of the two species are characterized by expanded genome size, proliferated AT-rich repeat sequences, low GC content and gene density, as well as low substitution rates of the coding genes. The plastomes of C. tibeticum (197,815 bp) and C. subtropicum (212,668 bp) are substantially larger than those of the three species sequenced in previous studies. The plastome of C. subtropicum is the longest one of Orchidaceae to date. Despite the increase in genome size, the gene order and gene number of the plastomes are conserved, with the exception of an ∼75 kb large inversion in the large single copy (LSC) region shared by the two species. The most striking is the record-setting low GC content in C. subtropicum (28.2%). Moreover, the plastome expansion of the two species is strongly correlated with the proliferation of AT-biased non-coding regions: the non-coding content of C. subtropicum is in excess of 57%. The genus provides a typical example of plastome expansion induced by the expansion of non-coding regions. Considering the pros and cons of different sequencing technologies, we recommend hybrid assembly based on long and short reads applied to the sequencing of plastomes with AT-biased base composition.
Journal Article
Impact of GC content on de novo gene birth
2026
Noncoding regions in eukaryotes are extensively expressed and represent a significant source of novel microproteins, some of which become fixed as de novo genes. However, the structural properties of these unevolved products and the features driving their fixation remain poorly understood. Particularly, the influence of nucleotide composition (GC content) on their structural properties and evolutionary trajectories is still unclear. Here, we predict the foldability and sequence properties of millions of microproteins potentially encoded in the noncoding open reading frames (ORFs) of 3,379 eukaryotic genomes with GC contents ranging from 18% to 79%. Depending on GC content, these microproteins exhibit distinct structural properties, suggesting different cellular impacts if non-genic regions are pervasively expressed. Using phylostratigraphy, de novo gene search, and ancestral sequence reconstruction, we trace the evolution of several hundred de novo proteins across 22 organisms with varying GC contents. We show that de novo genes preferentially emerge from GC-rich ORFs with folding potential, revealing that the interplay between GC content and foldability - rooted in the structure of the genetic code - shapes the emergence of novel genes.
Noncoding DNA can generate microproteins, some of which evolve into new genes. This study finds that de novo genes preferentially originate from GC-rich, foldable sequences, showing how base composition influences the birth of new proteins.
Journal Article
Compensatory Evolution Following Deleterious Episodes of GC-biased Gene Conversion in Rodents
by
Saclier, Nathanaëlle
,
Galtier, Nicolas
,
Riffis, Marie
in
Accumulation
,
Amino acids
,
Animal genetics
2025
GC-biased gene conversion (gBGC) is a widespread evolutionary force associated with meiotic recombination that favors the accumulation of deleterious AT to GC substitutions in proteins, moving them away from their fitness optimum. In many mammals, recombination hotspots have a rapid turnover, leading to episodic gBGC, with the accumulation of deleterious mutations stopping when the recombination hotspot dies. Selection is therefore expected to act to repair the damage caused by gBGC episodes through compensatory evolution. However, this process has never been studied or quantified so far. Here, we analyzed the nucleotide substitution pattern in coding sequences of a highly diversified group of Murinae rodents. Using phylogenetic analyses of about 70,000 coding exons, we identified numerous exon-specific, lineage-specific gBGC episodes, characterized by a clustering of synonymous AT to GC substitutions and by an increasing rate of nonsynonymous AT to GC substitutions, many of which are potentially deleterious. Analyzing the molecular evolution of the affected exons in downstream lineages, we found evidence for pervasive compensatory evolution after deleterious gBGC episodes. Compensation appears to occur rapidly after the end of the episode and to be driven by the standing genetic variation rather than new mutations. Our results demonstrate the impact of gBGC on the evolution of amino-acid sequences and underline the key role of epistasis in protein adaptation. This study contributes to a growing body of literature emphasizing that adaptive mutations, which arise in response to environmental changes, are just 1 subset of beneficial mutations, alongside mutations resulting from oscillations around the fitness optimum.
Journal Article
Gene body methylation is conserved between plant orthologs and is of evolutionary consequence
by
Gaut, Brandon S.
,
Takuno, Shohei
in
Arabidopsis - genetics
,
Arabidopsis thaliana
,
Base Composition - genetics
2013
DNA methylation is a common feature of eukaryotic genomes and is especially common in noncoding regions of plants. Protein coding regions of plants are often methylated also, but the extent, function, and evolutionary consequences of gene body methylation remain unclear. Here we investigate gene body methylation using an explicit comparative evolutionary approach. We generated bisulfite sequencing data from two tissues of Brachypodium distachyon and compared genic methylation patterns to those of rice (Oryza sativa ssp. japonica). Gene body methylation was strongly conserved between orthologs of the two species and affected a biased subset of long, slowly evolving genes. Because gene body methylation is conserved over evolutionary time, it shapes important features of plant genome evolution, such as the bimodality of G+C content among grass genes. Our results superficially contradict previous observations of high cytosine methylation polymorphism within Arabidopsis thaliana genes, but reanalyses of these data are consistent with conservation of methylation within gene regions. Overall, our results indicate that the methylation level is a long-term property of individual genes and therefore of evolutionary consequence.
Journal Article
The Branch-Site Test of Positive Selection Is Surprisingly Robust but Lacks Power under Synonymous Substitution Saturation and Variation in GC
2013
Positive selection is widely estimated from protein coding sequence alignments by the nonsynonymous-to-synonymous ratio ω. Increasingly elaborate codon models are used in a likelihood framework for this estimation. Although there is widespread concern about the robustness of the estimation of the ω ratio, more efforts are needed to estimate this robustness, especially in the context of complex models. Here, we focused on the branch-site codon model. We investigated its robustness on a large set of simulated data. First, we investigated the impact of sequence divergence. We found evidence of underestimation of the synonymous substitution rate for values as small as 0.5, with a slight increase in false positives for the branch-site test. When dS increases further, underestimation of dS is worse, but false positives decrease. Interestingly, the detection of true positives follows a similar distribution, with a maximum for intermediary values of dS. Thus, high dS is more of a concern for a loss of power (false negatives) than for false positives of the test. Second, we investigated the impact of GC content. We showed that there is no significant difference of false positives between high GC (up to ∼80%) and low GC (∼30%) genes. Moreover, neither shifts of GC content on a specific branch nor major shifts in GC along the gene sequence generate many false positives. Our results confirm that the branch-site is a very conservative test.
Journal Article
Gene Discovery and Tissue-Specific Transcriptome Analysis in Chickpea with Massively Parallel Pyrosequencing and Web Resource Development
by
Yadav, Gitanjali
,
Garg, Rohini
,
Priya, Pushp
in
Base Composition
,
Base Composition - genetics
,
Base Sequence
2011
Chickpea (Cicer arietinum) is an important food legume crop but lags in the availability of genomic resources. In this study, we have generated about 2 million high-quality sequences of average length of 372 bp using pyrosequencing technology. The optimization of de novo assembly clearly indicated that hybrid assembly of long-read and short-read primary assemblies gave better results. The hybrid assembly generated a set of 34,760 transcripts with an average length of 1,020 bp representing about 4.8% (35.5 Mb) of the total chickpea genome. We identified more than 4,000 simple sequence repeats, which can be developed as functional molecular markers in chickpea. Putative function and Gene Ontology terms were assigned to at least 73.2% and 71.0% of chickpea transcripts, respectively. We have also identified several chickpea transcripts that showed tissue-specific expression and validated the results using real-time polymerase chain reaction analysis. Based on sequence comparison with other species within the plant kingdom, we identified two sets of lineage-specific genes, including those conserved in the Fabaceae family (legume specific) and those lacking significant similarity with any non chickpea species (chickpea specific). Finally, we have developed a Web resource, Chickpea Transcriptome Database, which provides public access to the data and results reported in this study The strategy for optimization of de novo assembly presented here may further facilitate the transcriptome sequencing and characterization in other organisms. Most importantly, the data and results reported in this study will help to accelerate research in various areas of genomics and implementing breeding programs in chickpea.
Journal Article
Metabolic characteristics of the species Variovorax paradoxus
by
Steinbüchel, Alexander
,
Wübbeler, Jan Hendrik
,
Satola, Barbara
in
1-aminocyclopropane-1-carboxylic acid
,
2,4-D
,
2-methylisoborneol
2013
This review outlines information about the Gram-negative, aerobic bacterium
Variovorax paradoxus.
The genomes of these species have G+C contents of 66.5–69.4 mol%, and the cells form yellow colonies. Some strains of
V. paradoxus
are facultative lithoautotrophic, others are chemoorganotrophic. Many of them are associated with important catabolic processes including the degradation of toxic and/or complex chemical compounds. The degradation pathways or other skills related to the following compounds, respectively, are described in this review: sulfolane, 3-sulfolene, 2-mercaptosuccinic acid, 3,3′-thiodipropionic acid, aromatic sulfonates, alkanesulfonates, amino acids and other sulfur sources, polychlorinated biphenyls, dimethyl terephthalate, linuron, 2,4-dinitrotoluene, homovanillate, veratraldehyde, 2,4-dichlorophenoxyacetic acid, anthracene, poly(3-hydroxybutyrate), chitin, cellulose, humic acids, metal–EDTA complexes, yttrium, rare earth elements, As(III), trichloroethylene, capsaicin, 3-nitrotyrosine, acyl-homoserine lactones, 1-aminocyclopropane-1-carboxylate, methyl
tert
-butyl ether, geosmin, and 2-methylisoborneol. Strains of
V. paradoxus
are also engaged in mutually beneficial interactions with other plant and bacterial species in various ecosystems. This species comprises probably promising strains for bioremediation and other biotechnical applications. Lately, the complete genomes of strains S110 and EPS have been sequenced for further investigations.
Journal Article
Impact of Recombination on the Base Composition of Bacteria and Archaea
2017
The mutational process in bacteria is biased toward A and T, and most species are GC-rich relative to the mutational input to their genome. It has been proposed that the shift in base composition is an adaptive process—that natural selection operates to increase GC-contents—and there is experimental evidence that bacterial strains with GC-rich versions of genes have higher growth rates than those strains with AT-rich versions expressing identical proteins. Alternatively, a nonadaptive process, GC-biased gene conversion (gBGC), could also increase the GC-content of DNA due to the mechanistic bias of gene conversion events during recombination. To determine what role recombination plays in the base composition of bacterial genomes, we compared the spectrum of nucleotide polymorphisms introduced by recombination in all microbial species represented by large numbers of sequenced strains. We found that recombinant alleles are consistently biased toward A and T, and that the magnitude of AT-bias introduced by recombination is similar to that of mutations. These results indicate that recombination alone, without the intervention of selection, is unlikely to counteract the AT-enrichment of bacterial genomes.
Journal Article
selective force favoring increased G+C content in bacterial genes
by
Raghavan, Rahul
,
Ochman, Howard
,
Kelkar, Yogeshwar D
in
Adaptation, Biological - genetics
,
Bacteria
,
Bacterial genomes
2012
Bacteria display considerable variation in their overall base compositions, which range from 13% to over 75% G+C. This variation in genomic base compositions has long been considered to be a strictly neutral character, due solely to differences in the mutational process; however, recent sequence comparisons indicate that mutational input alone cannot produce the observed base compositions, implying a role for natural selection. Because bacterial genomes have high gene content, forces that operate on the base composition of individual genes could help shape the overall genomic base composition. To explore this possibility, we tested whether genes that encode the same protein but vary only in their base compositions at synonymous sites have effects on bacterial fitness. Escherichia coli strains harboring G+C-rich versions of genes display higher growth rates, indicating that despite a pervasive mutational bias toward A+T, a selective force, independent of adaptive codon use, is driving genes toward higher G+C contents.
Journal Article