Catalogue Search | MBRL
Search Results Heading
Explore the vast range of titles available.
MBRLSearchResults
-
LanguageLanguage
-
SubjectSubject
-
Item TypeItem Type
-
DisciplineDiscipline
-
YearFrom:-To:
-
More FiltersMore FiltersIs Peer Reviewed
Done
Filters
Reset
57
result(s) for
"Yutin, Natalya"
Sort by:
Analysis of metagenome-assembled viral genomes from the human gut reveals diverse putative CrAss-like phages with unique genomic features
2021
CrAssphage is the most abundant human-associated virus and the founding member of a large group of bacteriophages, discovered in animal-associated and environmental metagenomes, that infect bacteria of the phylum Bacteroidetes. We analyze 4907 Circular Metagenome Assembled Genomes (cMAGs) of putative viruses from human gut microbiomes and identify nearly 600 genomes of crAss-like phages that account for nearly 87% of the DNA reads mapped to these cMAGs. Phylogenetic analysis of conserved genes demonstrates the monophyly of crAss-like phages, a putative virus order, and of 5 branches, potential families within that order, two of which have not been identified previously. The phage genomes in one of these families are almost twofold larger than the crAssphage genome (145-192 kilobases), with high density of self-splicing introns and inteins. Many crAss-like phages encode suppressor tRNAs that enable read-through of UGA or UAG stop-codons, mostly, in late phage genes. A distinct feature of the crAss-like phages is the recurrent switch of the phage DNA polymerase type between A and B families. Thus, comparative genomic analysis of the expanded assemblage of crAss-like phages reveals aspects of genome architecture and expression as well as phage biology that were not apparent from the previous work on phage genomics.
Here, the authors analyze 4907 Circular Metagenome Assembled Genomes from human microbiomes and identify and characterize nearly 600 diverse genomes of crAss-like phages, finding two putative families with unusual genomic features, including high density of self-splicing introns and inteins.
Journal Article
Multiple evolutionary origins of giant viruses version 1; peer review: 4 approved
2018
The nucleocytoplasmic large DNA viruses (NCLDVs) are a monophyletic group of diverse eukaryotic viruses that reproduce primarily in the cytoplasm of the infected cells and include the largest viruses currently known: the giant mimiviruses, pandoraviruses, and pithoviruses. With virions measuring up to 1.5 μm and genomes of up to 2.5 Mb, the giant viruses break the now-outdated definition of a virus and extend deep into the genome size range typical of bacteria and archaea. Additionally, giant viruses encode multiple proteins that are universal among cellular life forms, particularly components of the translation system, the signature cellular molecular machinery. These findings triggered hypotheses on the origin of giant viruses from cells, likely of an extinct fourth domain of cellular life, via reductive evolution. However, phylogenomic analyses reveal a different picture, namely multiple origins of giant viruses from smaller NCLDVs via acquisition of multiple genes from the eukaryotic hosts and bacteria, along with gene duplication. Thus, with regard to their origin, the giant viruses do not appear to qualitatively differ from the rest of the virosphere. However, the evolutionary forces that led to the emergence of virus gigantism remain enigmatic.
Journal Article
Ancient Gene Capture and Recent Gene Loss Shape the Evolution of Orthopoxvirus-Host Interaction Genes
by
Wolf, Yuri I.
,
Koonin, Eugene V.
,
Senkevich, Tatiana G.
in
Animals
,
Chemokines
,
Evolution, Molecular
2021
Orthopoxviruses (ORPV) include smallpox (variola) virus, one of the most devastating human pathogens, and vaccinia virus, comprising the vaccine used for smallpox eradication. Among roughly 200 ORPV genes, about half are essential for genome replication and expression as well as virion morphogenesis, whereas the remaining half consists of accessory genes counteracting the host immune response. The survival of viruses depends on their ability to resist host defenses and, of all animal virus families, the poxviruses have the most antidefense genes. Orthopoxviruses (ORPV), a genus within the subfamily Chordopoxvirinae , infect diverse mammals and include one of the most devastating human pathogens, the now eradicated smallpox virus. ORPV encode ∼200 genes, of which roughly half are directly involved in virus genome replication and expression as well as virion morphogenesis. The remaining ∼100 “accessory” genes are responsible for virus-host interactions, particularly counter-defense of innate immunity. Complete sequences are currently available for several hundred ORPV genomes isolated from a variety of mammalian hosts, providing a rich resource for comparative genomics and reconstruction of ORPV evolution. To identify the provenance and evolutionary trends of the ORPV accessory genes, we constructed clusters including the orthologs of these genes from all chordopoxviruses. Most of the accessory genes were captured in three major waves early in chordopoxvirus evolution, prior to the divergence of ORPV and the sister genus Centapoxvirus from their common ancestor. The capture of these genes from the host was followed by extensive gene duplication, yielding several paralogous gene families. In addition, nine genes were gained during the evolution of ORPV themselves. In contrast, nearly every accessory gene was lost, some on multiple, independent occasions in numerous lineages of ORPV, so that no ORPV retains them all. A variety of functional interactions could be inferred from examination of pairs of ORPV accessory genes that were either often or rarely lost concurrently. IMPORTANCE Orthopoxviruses (ORPV) include smallpox (variola) virus, one of the most devastating human pathogens, and vaccinia virus, comprising the vaccine used for smallpox eradication. Among roughly 200 ORPV genes, about half are essential for genome replication and expression as well as virion morphogenesis, whereas the remaining half consists of accessory genes counteracting the host immune response. We reannotated the accessory genes of ORPV, predicting the functions of uncharacterized genes, and reconstructed the history of their gain and loss during the evolution of ORPV. Most of the accessory genes were acquired in three major waves antedating the origin of ORPV from chordopoxviruses. The evolution of ORPV themselves was dominated by gene loss, with numerous genes lost at the base of each major group of ORPV. Examination of pairs of ORPV accessory genes that were either often or rarely lost concurrently during ORPV evolution allows prediction of different types of functional interactions.
Journal Article
Hidden evolutionary complexity of Nucleo-Cytoplasmic Large DNA viruses of eukaryotes
2012
Background
The Nucleo-Cytoplasmic Large DNA Viruses (NCLDV) constitute an apparently monophyletic group that consists of at least 6 families of viruses infecting a broad variety of eukaryotic hosts. A comprehensive genome comparison and maximum-likelihood reconstruction of the NCLDV evolution revealed a set of approximately 50 conserved, core genes that could be mapped to the genome of the common ancestor of this class of eukaryotic viruses.
Results
We performed a detailed phylogenetic analysis of these core NCLDV genes and applied the constrained tree approach to show that the majority of the core genes are unlikely to be monophyletic. Several of the core genes have been independently acquired from different sources by different NCLDV lineages whereas for the majority of these genes displacement by homologs from cellular organisms in one or more groups of the NCLDV was demonstrated.
Conclusions
A detailed study of the evolution of the genomic core of the NCLDV reveals substantial complexity and diversity of evolutionary scenarios that was largely unsuspected previously. The phylogenetic coherence between the core genes is sufficient to validate the hypothesis on the evolution of all NCLDV from a common ancestral virus although the set of ancestral genes might be smaller than previously inferred from patterns of gene presence-absence.
Journal Article
A distinct abundant group of microbial rhodopsins discovered using functional metagenomics
2018
Many organisms capture or sense sunlight using rhodopsin pigments
1
,
2
, which are integral membrane proteins that bind retinal chromophores. Rhodopsins comprise two distinct protein families
1
, type-1 (microbial rhodopsins) and type-2 (animal rhodopsins). The two families share similar topologies and contain seven transmembrane helices that form a pocket in which retinal is linked covalently as a protonated Schiff base to a lysine at the seventh transmembrane helix
2
,
3
. Type-1 and type-2 rhodopsins show little or no sequence similarity to each other, as a consequence of extensive divergence from a common ancestor or convergent evolution of similar structures
1
. Here we report a previously unknown and diverse family of rhodopsins—which we term the heliorhodopsins—that we identified using functional metagenomics and that are distantly related to type-1 rhodopsins. Heliorhodopsins are embedded in the membrane with their N termini facing the cell cytoplasm, an orientation that is opposite to that of type-1 or type-2 rhodopsins. Heliorhodopsins show photocycles that are longer than one second, which is suggestive of light-sensory activity. Heliorhodopsin photocycles accompany retinal isomerization and proton transfer, as in type-1 and type-2 rhodopsins, but protons are never released from the protein, even transiently. Heliorhodopsins are abundant and distributed globally; we detected them in Archaea, Bacteria, Eukarya and their viruses. Our findings reveal a previously unknown family of light-sensing rhodopsins that are widespread in the microbial world.
An analysis based on functional metagenomics reveals a previously unknown group of microbial light-sensory rhodopsins that are widespread among a diverse range of microorganisms.
Journal Article
Thousands of previously unknown phages discovered in whole-community human gut metagenomes
2021
Background
Double-stranded DNA bacteriophages (dsDNA phages) play pivotal roles in structuring human gut microbiomes; yet, the gut virome is far from being fully characterized, and additional groups of phages, including highly abundant ones, continue to be discovered by metagenome mining. A multilevel framework for taxonomic classification of viruses was recently adopted, facilitating the classification of phages into evolutionary informative taxonomic units based on hallmark genes. Together with advanced approaches for sequence assembly and powerful methods of sequence analysis, this revised framework offers the opportunity to discover and classify unknown phage taxa in the human gut.
Results
A search of human gut metagenomes for circular contigs encoding phage hallmark genes resulted in the identification of 3738 apparently complete phage genomes that represent 451 putative genera. Several of these phage genera are only distantly related to previously identified phages and are likely to found new families. Two of the candidate families, “Flandersviridae” and “Quimbyviridae”, include some of the most common and abundant members of the human gut virome that infect
Bacteroides
,
Parabacteroides
, and
Prevotella
. The third proposed family, “Gratiaviridae,” consists of less abundant phages that are distantly related to the families
Autographiviridae
,
Drexlerviridae
, and
Chaseviridae
. Analysis of CRISPR spacers indicates that phages of all three putative families infect bacteria of the phylum Bacteroidetes. Comparative genomic analysis of the three candidate phage families revealed features without precedent in phage genomes. Some “Quimbyviridae” phages possess Diversity-Generating Retroelements (DGRs) that generate hypervariable target genes nested within defense-related genes, whereas the previously known targets of phage-encoded DGRs are structural genes. Several “Flandersviridae” phages encode enzymes of the isoprenoid pathway, a lipid biosynthesis pathway that so far has not been known to be manipulated by phages. The “Gratiaviridae” phages encode a HipA-family protein kinase and glycosyltransferase, suggesting these phages modify the host cell wall, preventing superinfection by other phages. Hundreds of phages in these three and other families are shown to encode catalases and iron-sequestering enzymes that can be predicted to enhance cellular tolerance to reactive oxygen species.
Conclusions
Analysis of phage genomes identified in whole-community human gut metagenomes resulted in the delineation of at least three new candidate families of
Caudovirales
and revealed diverse putative mechanisms underlying phage-host interactions in the human gut. Addition of these phylogenetically classified, diverse, and distinct phages to public databases will facilitate taxonomic decomposition and functional characterization of human gut viromes.
E-7rced89cjMzTjfwKghHn
Video abstract
Journal Article
Eukaryotic large nucleo-cytoplasmic DNA viruses: Clusters of orthologous genes and reconstruction of viral genome evolution
by
Koonin, Eugene V
,
Wolf, Yuri I
,
Raoult, Didier
in
Asfarviridae - genetics
,
Biomedical and Life Sciences
,
Biomedicine
2009
Background
The Nucleo-Cytoplasmic Large DNA Viruses (NCLDV) comprise an apparently monophyletic class of viruses that infect a broad variety of eukaryotic hosts. Recent progress in isolation of new viruses and genome sequencing resulted in a substantial expansion of the NCLDV diversity, resulting in additional opportunities for comparative genomic analysis, and a demand for a comprehensive classification of viral genes.
Results
A comprehensive comparison of the protein sequences encoded in the genomes of 45 NCLDV belonging to 6 families was performed in order to delineate cluster of orthologous viral genes. Using previously developed computational methods for orthology identification, 1445 Nucleo-Cytoplasmic Virus Orthologous Groups (NCVOGs) were identified of which 177 are represented in more than one NCLDV family. The NCVOGs were manually curated and annotated and can be used as a computational platform for functional annotation and evolutionary analysis of new NCLDV genomes. A maximum-likelihood reconstruction of the NCLDV evolution yielded a set of 47 conserved genes that were probably present in the genome of the common ancestor of this class of eukaryotic viruses. This reconstructed ancestral gene set is robust to the parameters of the reconstruction procedure and so is likely to accurately reflect the gene core of the ancestral NCLDV, indicating that this virus encoded a complex machinery of replication, expression and morphogenesis that made it relatively independent from host cell functions.
Conclusions
The NCVOGs are a flexible and expandable platform for genome analysis and functional annotation of newly characterized NCLDV. Evolutionary reconstructions employing NCVOGs point to complex ancestral viruses.
Journal Article
Phylogenomics of Prokaryotic Ribosomal Proteins
2012
Archaeal and bacterial ribosomes contain more than 50 proteins, including 34 that are universally conserved in the three domains of cellular life (bacteria, archaea, and eukaryotes). Despite the high sequence conservation, annotation of ribosomal (r-) protein genes is often difficult because of their short lengths and biased sequence composition. We developed an automated computational pipeline for identification of r-protein genes and applied it to 995 completely sequenced bacterial and 87 archaeal genomes available in the RefSeq database. The pipeline employs curated seed alignments of r-proteins to run position-specific scoring matrix (PSSM)-based BLAST searches against six-frame genome translations, mitigating possible gene annotation errors. As a result of this analysis, we performed a census of prokaryotic r-protein complements, enumerated missing and paralogous r-proteins, and analyzed the distributions of ribosomal protein genes among chromosomal partitions. Phyletic patterns of bacterial and archaeal r-protein genes were mapped to phylogenetic trees reconstructed from concatenated alignments of r-proteins to reveal the history of likely multiple independent gains and losses. These alignments, available for download, can be used as search profiles to improve genome annotation of r-proteins and for further comparative genomics studies.
Journal Article
Origin and Evolution of Eukaryotic Large Nucleo-Cytoplasmic DNA Viruses
by
Koonin, Eugene V.
,
Yutin, Natalya
in
Bacteria - genetics
,
Bacteriophages - genetics
,
Cluster Analysis
2010
Background/Aims: The nucleo-cytoplasmic large DNA viruses (NCLDV) constitute an apparently monophyletic group that consists of 6 families of viruses infecting a broad variety of eukaryotes. A comprehensive genome comparison and maximum-likelihood reconstruction of NCLDV evolution reveal a set of approximately 50 conserved genes that can be tentatively mapped to the genome of the common ancestor of this class of eukaryotic viruses. We address the origins and evolution of NCLDV. Results: Phylogenetic analysis indicates that some of the major clades of NCLDV infect diverse animals and protists, suggestive of early radiation of the NCLDV, possibly concomitant with eukaryogenesis. The core NCLDV genes seem to have originated from different sources including homologous genes of bacteriophages, bacteria and eukaryotes. These observations are compatible with a scenario of the origin of the NCLDV at an early stage of the evolution of eukaryotes through extensive mixing of genes from widely different genomes. Conclusions: The common ancestor of the NCLDV probably evolved from a bacteriophage as a result of recruitment of numerous eukaryotic and some bacterial genes, and concomitant loss of the majority of phage genes except for a small core of genes coding for proteins essential for virus genome replication and virion formation.
Journal Article
Mriyaviruses: small relatives of giant viruses
by
Krupovic, Mart
,
Koonin, Eugene V.
,
Mutz, Pascal
in
AlphaFold
,
Amino acid sequence
,
Capsid Proteins - chemistry
2024
The origin of giant viruses of eukaryotes that belong to the phylum Nucleocytoviricota is not thoroughly understood and remains a matter of major interest and debate. Here, we combine metagenome database searches with extensive protein sequence and structure analysis to describe a distinct group of viruses with comparatively small genomes of 35–45 kilobases that appear to comprise a distinct class within the phylum Nucleocytoviricota that we provisionally named “ Mriyaviricetes .” Mriyaviruses appear to be the closest identified relatives of the ancestors of the Nucleocytoviricota . Analysis of proteins encoded in mriyavirus genomes suggests that they replicate their genome via the rolling circle mechanism that is unusual among viruses with double-stranded DNA genomes and so far not described for members of Nucleocytoviricota .
Journal Article