Catalogue Search | MBRL
Search Results Heading
Explore the vast range of titles available.
MBRLSearchResults
-
DisciplineDiscipline
-
Is Peer ReviewedIs Peer Reviewed
-
Item TypeItem Type
-
SubjectSubject
-
YearFrom:-To:
-
More FiltersMore FiltersSourceLanguage
Done
Filters
Reset
449
result(s) for
"codon optimization"
Sort by:
Codon usage bias
by
Parvathy, Sujatha Thankeswaran
,
Udayasuriyan, Varatharajalu
,
Bhadana, Vijaipal
in
Animal Anatomy
,
Animal Biochemistry
,
Animals
2022
Codon usage bias is the preferential or non-random use of synonymous codons, a ubiquitous phenomenon observed in bacteria, plants and animals. Different species have consistent and characteristic codon biases. Codon bias varies not only with species, family or group within kingdom, but also between the genes within an organism. Codon usage bias has evolved through mutation, natural selection, and genetic drift in various organisms. Genome composition, GC content, expression level and length of genes, position and context of codons in the genes, recombination rates, mRNA folding, and tRNA abundance and interactions are some factors influencing codon bias. The factors shaping codon bias may also be involved in evolution of the universal genetic code. Codon-usage bias is critical factor determining gene expression and cellular function by influencing diverse processes such as RNA processing, protein translation and protein folding. Codon usage bias reflects the origin, mutation patterns and evolution of the species or genes. Investigations of codon bias patterns in genomes can reveal phylogenetic relationships between organisms, horizontal gene transfers, molecular evolution of genes and identify selective forces that drive their evolution. Most important application of codon bias analysis is in the design of transgenes, to increase gene expression levels through codon optimization, for development of transgenic crops. The review gives an overview of deviations of genetic code, factors influencing codon usage or bias, codon usage bias of nuclear and organellar genes, computational methods to determine codon usage and the significance as well as applications of codon usage analysis in biological research, with emphasis on plants.
Journal Article
A new and updated resource for codon usage tables
by
Athey, John
,
Rostovtsev, Alexandre
,
Osipova, Ekaterina
in
Algorithms
,
Animals
,
BASIC BIOLOGICAL SCIENCES
2017
Background
Due to the degeneracy of the genetic code, most amino acids can be encoded by multiple synonymous codons. Synonymous codons naturally occur with different frequencies in different organisms. The choice of codons may affect protein expression, structure, and function. Recombinant gene technologies commonly take advantage of the former effect by implementing a technique termed codon optimization, in which codons are replaced with synonymous ones in order to increase protein expression. This technique relies on the accurate knowledge of codon usage frequencies. Accurately quantifying codon usage bias for different organisms is useful not only for codon optimization, but also for evolutionary and translation studies: phylogenetic relations of organisms, and host-pathogen co-evolution relationships, may be explored through their codon usage similarities. Furthermore, codon usage has been shown to affect protein structure and function through interfering with translation kinetics, and cotranslational protein folding.
Results
Despite the obvious need for accurate codon usage tables, currently available resources are either limited in scope, encompassing only organisms from specific domains of life, or greatly outdated. Taking advantage of the exponential growth of GenBank and the creation of NCBI’s RefSeq database, we have developed a new database, the High-performance Integrated Virtual Environment-Codon Usage Tables (HIVE-CUTs), to present and analyse codon usage tables for every organism with publicly available sequencing data. Compared to existing databases, this new database is more comprehensive, addresses concerns that limited the accuracy of earlier databases, and provides several new functionalities, such as the ability to view and compare codon usage between individual organisms and across taxonomical clades, through graphical representation or through commonly used indices. In addition, it is being routinely updated to keep up with the continuous flow of new data in GenBank and RefSeq.
Conclusion
Given the impact of codon usage bias on recombinant gene technologies, this database will facilitate effective development and review of recombinant drug products and will be instrumental in a wide area of biological research. The database is available at
hive.biochemistry.gwu.edu/review/codon
.
Journal Article
ICOR: improving codon optimization with recurrent neural networks
by
Mauro, Elizabeth
,
LeShane, Kevin
,
Jain, Rishab
in
Algorithms
,
Amino acid sequence
,
Amino acids
2023
Background
In protein sequences—as there are 61 sense codons but only 20 standard amino acids—most amino acids are encoded by more than one codon. Although such synonymous codons do not alter the encoded amino acid sequence, their selection can dramatically affect the expression of the resulting protein. Codon optimization of synthetic DNA sequences is important for heterologous expression. However, existing solutions are primarily based on choosing high-frequency codons only, neglecting the important effects of rare codons. In this paper, we propose a novel recurrent-neural-network based codon optimization tool, ICOR, that aims to learn codon usage bias on a genomic dataset of
Escherichia coli
. We compile a dataset of over 7,000 non-redundant, high-expression, robust genes which are used for deep learning. The model uses a bidirectional long short-term memory-based architecture, allowing for the sequential context of codon usage in genes to be learned. Our tool can predict synonymous codons for synthetic genes toward optimal expression in
Escherichia coli
.
Results
We demonstrate that sequential context achieved via RNN may yield codon selection that is more similar to the host genome. Based on computational metrics that predict protein expression, ICOR theoretically optimizes protein expression more than frequency-based approaches. ICOR is evaluated on 1,481
Escherichia coli
genes as well as a benchmark set of 40 select DNA sequences whose heterologous expression has been previously characterized. ICOR’s performance is measured across five metrics: the Codon Adaptation Index, GC-content, negative repeat elements, negative cis-regulatory elements, and codon frequency distribution.
Conclusions
The results, based on in silico metrics, indicate that ICOR codon optimization is theoretically more effective in enhancing recombinant expression of proteins over other established codon optimization techniques. Our tool is provided as an open-source software package that includes the benchmark set of sequences used in this study.
Journal Article
Detailed Dissection and Critical Evaluation of the Pfizer/BioNTech and Moderna mRNA Vaccines
2021
The design of Pfizer/BioNTech and Moderna mRNA vaccines involves many different types of optimizations. Proper optimization of vaccine mRNA can reduce dosage required for each injection leading to more efficient immunization programs. The mRNA components of the vaccine need to have a 5′-UTR to load ribosomes efficiently onto the mRNA for translation initiation, optimized codon usage for efficient translation elongation, and optimal stop codon for efficient translation termination. Both 5′-UTR and the downstream 3′-UTR should be optimized for mRNA stability. The replacement of uridine by N1-methylpseudourinine (Ψ) complicates some of these optimization processes because Ψ is more versatile in wobbling than U. Different optimizations can conflict with each other, and compromises would need to be made. I highlight the similarities and differences between Pfizer/BioNTech and Moderna mRNA vaccines and discuss the advantage and disadvantage of each to facilitate future vaccine improvement. In particular, I point out a few optimizations in the design of the two mRNA vaccines that have not been performed properly.
Journal Article
Codon Optimization Improves the Prediction of Xylose Metabolism from Gene Content in Budding Yeasts
by
Opulente, Dana A
,
LaBella, Abigail Leavitt
,
Nalabothu, Rishitha L
in
BASIC BIOLOGICAL SCIENCES
,
Biological products
,
Catabolism
2023
Abstract
Xylose is the second most abundant monomeric sugar in plant biomass. Consequently, xylose catabolism is an ecologically important trait for saprotrophic organisms, as well as a fundamentally important trait for industries that hope to convert plant mass to renewable fuels and other bioproducts using microbial metabolism. Although common across fungi, xylose catabolism is rare within Saccharomycotina, the subphylum that contains most industrially relevant fermentative yeast species. The genomes of several yeasts unable to consume xylose have been previously reported to contain the full set of genes in the XYL pathway, suggesting the absence of a gene–trait correlation for xylose metabolism. Here, we measured growth on xylose and systematically identified XYL pathway orthologs across the genomes of 332 budding yeast species. Although the XYL pathway coevolved with xylose metabolism, we found that pathway presence only predicted xylose catabolism about half of the time, demonstrating that a complete XYL pathway is necessary, but not sufficient, for xylose catabolism. We also found that XYL1 copy number was positively correlated, after phylogenetic correction, with xylose utilization. We then quantified codon usage bias of XYL genes and found that XYL3 codon optimization was significantly higher, after phylogenetic correction, in species able to consume xylose. Finally, we showed that codon optimization of XYL2 was positively correlated, after phylogenetic correction, with growth rates in xylose medium. We conclude that gene content alone is a weak predictor of xylose metabolism and that using codon optimization enhances the prediction of xylose metabolism from yeast genome sequence data.
Journal Article
Using protein-per-mRNA differences among human tissues in codon optimization
by
Schaefer, Martin H.
,
Hernandez-Alias, Xavier
,
Radusky, Leandro G.
in
algorithms
,
Animal Genetics and Genomics
,
Bioinformatics
2023
Background
Codon usage and nucleotide composition of coding sequences have profound effects on protein expression. However, while it is recognized that different tissues have distinct tRNA profiles and codon usages in their transcriptomes, the effect of tissue-specific codon optimality on protein synthesis remains elusive.
Results
We leverage existing state-of-the-art transcriptomics and proteomics datasets from the GTEx project and the Human Protein Atlas to compute the protein-to-mRNA ratios of 36 human tissues. Using this as a proxy of translational efficiency, we build a machine learning model that identifies codons enriched or depleted in specific tissues. We detect two clusters of tissues with an opposite pattern of codon preferences. We then use these identified patterns for the development of CUSTOM, a codon optimizer algorithm which suggests a synonymous codon design in order to optimize protein production in a tissue-specific manner. In human cell-line models, we provide evidence that codon optimization should take into account particularities of the translational machinery of the tissues in which the target proteins are expressed and that our approach can design genes with tissue-optimized expression profiles.
Conclusions
We provide proof-of-concept evidence that codon preferences exist in tissue-specific protein synthesis and demonstrate its application to synthetic gene design. We show that CUSTOM can be of benefit in biological and biotechnological applications, such as in the design of tissue-targeted therapies and vaccines.
Journal Article
Assessing optimal: inequalities in codon optimization algorithms
2021
Background
Custom genes have become a common resource in recombinant biology over the last 20 years due to the plummeting cost of DNA synthesis. These genes are often “optimized” to non-native sequences for overexpression in a non-native host by substituting synonymous codons within the coding DNA sequence (CDS). A handful of studies have compared native and optimized CDSs, reporting different levels of soluble product due to the accumulation of misfolded aggregates, variable activity of enzymes, and (at least one report of) a change in substrate specificity. No study, to the best of our knowledge, has performed a practical comparison of CDSs generated from different codon optimization algorithms or reported the corresponding protein yields.
Results
In our efforts to understand what factors constitute an optimized CDS, we identified that there is little consensus among codon-optimization algorithms, a roughly equivalent chance that an algorithm-optimized CDS will increase or diminish recombinant yields as compared to the native DNA, a near ubiquitous use of a codon database that was last updated in 2007, and a high variability of output CDSs by some algorithms. We present a case study, using KRas4B, to demonstrate that a median codon frequency may be a better predictor of soluble yields than the more commonly utilized CAI metric.
Conclusions
We present a method for visualizing, analyzing, and comparing algorithm-optimized DNA sequences for recombinant protein expression. We encourage researchers to consider if DNA optimization is right for their experiments, and work towards improving the reproducibility of published recombinant work by publishing non-native CDSs.
Journal Article
Heterologous Expression of Toxic White Spot Syndrome Virus (WSSV) Protein in Eengineered Escherichia coli Strains
2023
Aquacultural shrimps suffer economic lost due to the white spot syndrome virus (WSSV) that is the most notorious virus for its fatality and contagion, leading to a 100% death rate on infected shrimps within 7 days. However, the infection of mechanism remains a mystery and crucial problem. To elucidate the pathogenesis of WSSV, a high abundance of protein is required to identify and characterize its functions. Therefore, the optimal WSSV355 overexpression was explored in engineered Escherichia coli strains, in particular C43(DE3) as a toxic tolerance strain remedied 40% of cell growth from BL21(DE3). Meanwhile, a trace amount of WSSV355 was observed in both strains. To optimize the codon of WSSV355 using codon adaption index (CAI), an overexpression was observed with 1.32 mg/mL in C43(DE3), while the biomass was decreased by 35%. Subsequently, the co-expression with pRARE boosted the target protein up to 1.93 mg/mL. Finally, by scaling up production of WSSV355 in the fermenter with sufficient oxygen supplied, the biomass and total and soluble protein were enhanced 67.6%, 44.9%, and 7.8% compared with that in flask condition. Herein, the current approach provides efficacious solutions to produce toxic proteins via codon usage, strain selection, and processing optimization by alleviating the burden and boosting protein production in E. coli.
Journal Article
Codon-Optimized RPGR Improves Stability and Efficacy of AAV8 Gene Therapy in Two Mouse Models of X-Linked Retinitis Pigmentosa
by
Ramsden, Simon C.
,
Bellingrath, Julia-Sophia
,
Hickey, Doron G.
in
Animal models
,
Animals
,
Carrier Proteins - genetics
2017
X-linked retinitis pigmentosa (XLRP) is generally a severe form of retinitis pigmentosa, a neurodegenerative, blinding disorder of the retina. 70% of XLRP cases are due to mutations in the retina-specific isoform of the gene encoding retinitis pigmentosa GTPase regulator (RPGRORF15). Despite successful RPGRORF15 gene replacement with adeno-associated viral (AAV) vectors being established in a number of animal models of XLRP, progression to human trials has not yet been possible. The inherent sequence instability in the purine-rich region of RPGRORF15 (which contains highly repetitive nucleotide sequences) leads to unpredictable recombination errors during viral vector cloning. While deleted RPGR may show some efficacy in animal models, which have milder disease, the therapeutic effect of a mutated RPGR variant in patients with XLRP cannot be predicted. Here, we describe an optimized gene replacement therapy for human XLRP disease using an AAV8 vector that reliably and consistently produces the full-length correct RPGR protein. The glutamylation pattern in the RPGR protein derived from the codon-optimized sequence is indistinguishable from the wild-type variant, implying that codon optimization does not significantly alter post-translational modification. The codon-optimized sequence has superior stability and expression levels in vitro. Significantly, when delivered by AAV8 vector and driven by the rhodopsin kinase promoter, the codon-optimized RPGR rescues the disease phenotype in two relevant animal models (Rpgr−/y and C57BL/6JRd9/Boc) and shows good safety in C57BL6/J wild-type mice. This work provides the basis for clinical trial development to treat patients with XLRP caused by RPGR mutations.
X-linked retinitis pigmentosa caused by mutations in RPGR is a frequent cause of retinal degeneration and blindness without available treatment. Fischer et al. demonstrate safety and efficacy of gene supplementation in relevant animal models using a codon-optimized transgene, thereby resolving the problem of sequence instability of wild-type RPGR.
Journal Article
Comparative analysis of codon usage patterns in chloroplast genomes of five Miscanthus species and related species
2021
Miscanthus is not only a perennial fiber biomass crop, but also valuable breeding resource for its low-nutrient requirements, photosynthetic efficiency and strong adaptability to environment. In the present study, the codon usage patterns of five different
Miscanthus
plants and other two related species were systematically analyzed. The results indicated that the cp genomes of the seven representative species were preference to A/T bases and A/T-ending codons. In addition, 21 common high-frequency codons and 4–11 optimal codons were detected in the seven chloroplast genomes. The results of ENc-plot, PR2-plot and neutrality analysis revealed the codon usage patterns of the seven chloroplast genomes are influenced by multiple factors, in which nature selection is the main influencing factor. Comparative analysis of the codon usage frequencies between the seven representative species and four model organisms suggested that
Arabidopsis thaliana
,
Populus trichocarpa
and
Saccharomyces cerevisiae
could be considered as preferential appropriate exogenous expression receptors. These results might not only provide important reference information for evolutionary analysis, but also shed light on the way to improve the expression efficiency of exogenous gene in transgenic research based on codon optimization.
Journal Article