Catalogue Search | MBRL

SARS-CoV-2 historical global testing and genomic variability

by E. Boulos, Rasha , Khoueiry, Lea Maria , Akiki, Shadi in Analysis , Coronaviruses , COVID-19

2022

The world has been dealing with the SARS-COV-2 pandemic since December 2019 and a lot of effort has focused on tracking the spread of the virus by gathering information regarding testing statistics and generating viral genomic sequences. Unfortunately, there is neither a single comprehensive resource with global historical testing data nor a centralized database with summary statistics of the identified genomic variants. We merged different pre-aggregated historical testing data and complemented them with our manually extracted ones, which consist of 6852 historical test statistics from 76 countries/states unreported in any other dataset, at the date of submission, making our dataset the most comprehensive to date. We also analyzed all publicly deposited SARS-CoV-2 genomic sequences in GISAID and annotated their variants. Both datasets can be accessed through our interactive dashboard which also provides important insights on different outbreak trends across countries and states. The dashboard is available at https://bioinfo.lau.edu.lb/gkhazen/covid19 . A daily updated version of the datasets can be downloaded from github.com/KhazenLab/covid19-data.

Journal Article

Share this book

Add to My Shelf

Hidden biases in germline structural variant detection

by Carroll, Andrew , Pan, Bohu , Sedlazeck, Fritz J. in Animal Genetics and Genomics , Base Sequence , Bias

2021

Background Genomic structural variations (SV) are important determinants of genotypic and phenotypic changes in many organisms. However, the detection of SV from next-generation sequencing data remains challenging. Results In this study, DNA from a Chinese family quartet is sequenced at three different sequencing centers in triplicate. A total of 288 derivative data sets are generated utilizing different analysis pipelines and compared to identify sources of analytical variability. Mapping methods provide the major contribution to variability, followed by sequencing centers and replicates. Interestingly, SV supported by only one center or replicate often represent true positives with 47.02% and 45.44% overlapping the long-read SV call set, respectively. This is consistent with an overall higher false negative rate for SV calling in centers and replicates compared to mappers (15.72%). Finally, we observe that the SV calling variability also persists in a genotyping approach, indicating the impact of the underlying sequencing and preparation approaches. Conclusions This study provides the first detailed insights into the sources of variability in SV identification from next-generation sequencing and highlights remaining challenges in SV calling for large cohorts. We further give recommendations on how to reduce SV calling variability and the choice of alignment methodology.

Journal Article

Share this book

Add to My Shelf

Genetic variability, genotyping, and genomics of Mycobacterium leprae

by Ghosh, Arup , Dwivedi, Purna , Kumar, Roopendra in Artificial intelligence , Epidemiology , Gene deletion

2025

Abstract Leprosy, caused by Mycobacterium leprae and Mycobacterium lepromatosis, remains a significant global health issue despite a tremendous decline in its worldwide prevalence in the last four decades. Mycobacterium leprae strains possess very limited genetic variability, making it difficult to distinguish them using traditional genotyping tools. Successful genome sequencing of a considerable number of M. leprae strains in the recent past has allowed development of improved genotyping tools for the molecular epidemiology of leprosy. Comparative genomics has identified distinct M. leprae genotypes and revealed their characteristic genomic markers. This review summarizes the progress made in M. leprae genomics, with special emphasis on the development of genotyping schemes. Further, an updated genotyping scheme is introduced that also includes the newly reported genotypes 1B_Bangladesh, 1D_Malagasy, 3K-0/3K-1, 3Q and 4N/O. Additionally, genotype-specific markers (single nucleotide polymorphisms, Insertion/Deletion) have been incorporated into the typing scheme for the first time to enable differentiation of closely related strains. This will be particularly useful for geographic regions where M. leprae strains characterized by a small number of genotypes are predominant. The detailed compilation of genomic markers will also enable accurate identification of M. leprae genotypes, using targeted analysis of variable regions. Such markers are good candidates for developing artificial intelligence-based algorithms for classifying M. leprae genomic datasets. An updated genotyping scheme for the causative agent of leprosy, Mycobacterium leprae, has been compiled in this review, which also includes the newly reported M. leprae genotypes identified using recent whole genome sequencing studies from different parts of the world and thus will be useful for molecular epidemiological and phylogeographical investigations.

Journal Article

Share this book

Add to My Shelf

Whole genome sequencing of Trypanosoma cruzi field isolates reveals extensive genomic variability and complex aneuploidy patterns within TcII DTU

by de Almeida, Laila Viana , Dias, Fernando Hugo Cunha , Galvão, Lúcia M. C. in Aneuploidy , Animal Genetics and Genomics , Animals

2018

Background Trypanosoma cruzi , the etiologic agent of Chagas disease, is currently divided into six discrete typing units (DTUs), named TcI-TcVI. TcII is among the major DTUs enrolled in human infections in South America southern cone, where it is associated with severe cardiac and digestive symptoms. Despite the importance of TcII in Chagas disease epidemiology and pathology, so far, no genome-wide comparisons of the mitochondrial and nuclear genomes of TcII field isolates have been performed to track the variability and evolution of this DTU in endemic regions. Results In the present work, we have sequenced and compared the whole nuclear and mitochondrial genomes of seven TcII strains isolated from chagasic patients from the central and northeastern regions of Minas Gerais, Brazil, revealing an extensive genetic variability within this DTU. A comparison of the phylogeny based on the nuclear or mitochondrial genomes revealed that the majority of branches were shared by both sequences. The subtle divergences in the branches are probably consequence of mitochondrial introgression events between TcII strains. Two T. cruzi strains isolated from patients living in the central region of Minas Gerais, S15 and S162a, were clustered in the nuclear and mitochondrial phylogeny analysis. These two strains were isolated from the other five by the Espinhaço Mountains, a geographic barrier that could have restricted the traffic of insect vectors during T. cruzi evolution in the Minas Gerais state. Finally, the presence of aneuploidies was evaluated, revealing that all seven TcII strains have a different pattern of chromosomal duplication/loss. Conclusions Analysis of genomic variability and aneuploidies suggests that there is significant genomic variability within Minas Gerais TcII strains, which could be exploited by the parasite to allow rapid selection of favorable phenotypes. Also, the aneuploidy patterns vary among T. cruzi strains and does not correlate with the nuclear phylogeny, suggesting that chromosomal duplication/loss are recent and frequent events in the parasite evolution.

Journal Article

Share this book

Add to My Shelf

Antigenic and sequence variability of the human respiratory syncytial virus F glycoprotein compared to related viruses in a comprehensive dataset

by Nair, Harish , Williams, Thomas C. , Melero, Jose A. in Allergy and Immunology , Amino acid sequence , Amino acids

2018

A comprehensive analysis of sequence variation was carried out comparing the fusion (F) protein of human respiratory syncytial viruses (hRSV) from antigenic groups A and B with the prototype sequence of the A2 strain, also belonging to antigenic group A. The limited number of full bovine RSV F sequences available were included, as well as an extensive set of F sequences from the related human metapneumovirus (hMPV). The results were analysed in the context of the recently determined three dimensional F protein structures, with antigenic sites mapped to these. Although a high degree of sequence conservation in hRSV F exists, and sequence changes did not correlate with location of antigenic sites, preferential accumulation of amino acid changes in certain antigenic sites was noted. When the analysis was extended to hMPV F, a high number of changes was noticed, in agreement with the limited degree of sequence conservation. However, some conserved regions were noted, which may account for the limited number of cross-reactive monoclonal antibodies described between hRSV F and hMPV F. These results provide information about the degree of sequence and antigenic variation currently found in the F protein of circulating viruses. They highlight the importance of establishing a baseline dataset to monitor for future changes that might evolve should preventative immunological measures be made widely available.

Journal Article

Share this book

Add to My Shelf

Genomic insights into host-associated variants and transmission features of a ToBRFV isolate from Mexico

by Chavarín-Camacho, Claudia Yaritza , Ochoa-Martínez, Daniel Leobardo , Aviña-Padilla, Katia in Adaptability , Bioassays , Biological properties

2025

Tomato brown rugose fruit virus (ToBRFV) poses a global threat to tomato and pepper production due to its high transmissibility and adaptability. Understanding its genomic features and transmission mechanisms is critical for effective disease management. We characterized the genome and biological properties of a ToBRFV isolate from Mexico. Comparative genomic analyses were conducted using 100 global genomes, with particular focus on single nucleotide variants (SNVs) and their distribution across different host species. Phylogenetic analysis and experimental bioassays, including seed transmission tests, were also performed. Phylogenetic analysis revealed genetic proximity between the Mexican isolate and strains from Mexico, USA, Canada, Israel, and China, indicating shared transmission routes. Genomic comparisons confirmed general sequence stability, but SNVs were found in the 126-kDa replicase, particularly within the methyltransferase domain. These SNVs exhibited host-associated patterns, with conserved profiles in tomato and unique substitutions in Capsicum annum , Solanum nigrum , and Citrullus lanatus . Bioassays demonstrated susceptibility in additional solanaceous hosts, and seed transmission assays in Nicotiana rustica indicated reduced germination and low-frequency viral detection in seedlings. The study highlights the genomic conservation and host-specific variation in ToBRFV, suggesting that the methyltransferase domain may undergo differential evolutionary pressures. The findings provide valuable insights for improving risk assessment, seed health testing, and biosecurity measures.

Journal Article

Share this book

Add to My Shelf

Genomic Diversification, Structural Plasticity, and Hybridization in Leishmania (Viannia) braziliensis

by Muskus, Carlos , Muñoz, Marina , Cruz-Saavedra, Lissa in Archives & records , Cellular and Infection Microbiology , Chromosomes

2020

is an important species circulating in several Central and South American countries. Among species circulating in Brazil, Argentina and Colombia, has the highest genomic variability. However, genomic variability at the whole genome level has been only studied in Brazilian and Peruvian isolates; to date, no Colombian isolates have been studied. Considering that in Colombia, is a species with great clinical and therapeutic relevance, as well as the role of genetic variability in the epidemiology of leishmaniasis, we analyzed and evaluated intraspecific genomic variability of from Colombian and Bolivian isolates and compared them with Brazilian isolates. Twenty-one genomes were analyzed, six from Colombian patients, one from a Bolivian patient, and 14 Brazilian isolates downloaded from public databases. The results obtained of Phylogenomic analysis showed the existence of four well-supported clades, which evidenced intraspecific variability. The whole-genome analysis revealed structural variations in the somy, mainly in the Brazilian genomes (clade 1 and clade 3), low copy number variations, and a moderate number of single-nucleotide polymorphisms (SNPs) in all genomes analyzed. Interestingly, the genomes belonging to clades 2 and 3 from Colombia and Brazil, respectively, were characterized by low heterozygosity (~90% of SNP loci were homozygous) and regions suggestive of loss of heterozygosity (LOH). Additionally, we observed the drastic whole genome loss of heterozygosity and possible hybridization events in one genome belonging to clade 4. Unique/shared SNPs between and within the four clades were identified, revealing the importance of some of them in biological processes of . Our analyses demonstrate high genomic variability of in different regions of South America, mainly in Colombia and suggest that this species exhibits striking genomic diversity and a capacity of genomic hybridization; additionally, this is the first study to report whole-genome sequences of Colombian isolates.

Journal Article

Share this book

Add to My Shelf

Assessment of k-mer spectrum applicability for metagenomic dissimilarity analysis

by Ulyantsev, Vladimir I. , Dubinkina, Veronika B. , Ischenko, Dmitry S. in Algorithms , Bioinformatics , Biomedical and Life Sciences

2016

Background A rapidly increasing flow of genomic data requires the development of efficient methods for obtaining its compact representation. Feature extraction facilitates classification, clustering and model analysis for testing and refining biological hypotheses. “Shotgun” metagenome is an analytically challenging type of genomic data - containing sequences of all genes from the totality of a complex microbial community. Recently, researchers started to analyze metagenomes using reference-free methods based on the analysis of oligonucleotides ( k- mers) frequency spectrum previously applied to isolated genomes. However, little is known about their correlation with the existing approaches for metagenomic feature extraction, as well as the limits of applicability. Here we evaluated a metagenomic pairwise dissimilarity measure based on short k- mer spectrum using the example of human gut microbiota, a biomedically significant object of study. Results We developed a method for calculating pairwise dissimilarity (beta-diversity) of “shotgun” metagenomes based on short k- mer spectra (5≤ k ≤11). The method was validated on simulated metagenomes and further applied to a large collection of human gut metagenomes from the populations of the world ( n =281). The k- mer spectrum-based measure was found to behave similarly to one based on mapping to a reference gene catalog, but different from one using a genome catalog. This difference turned out to be associated with a significant presence of viral reads in a number of metagenomes. Simulations showed limited impact of bacterial genetic variability as well as sequencing errors on k- mer spectra. Specific differences between the datasets from individual populations were identified. Conclusions Our approach allows rapid estimation of pairwise dissimilarity between metagenomes. Though we applied this technique to gut microbiota, it should be useful for arbitrary metagenomes, even metagenomes with novel microbiota. Dissimilarity measure based on k- mer spectrum provides a wider perspective in comparison with the ones based on the alignment against reference sequence sets. It helps not to miss possible outstanding features of metagenomic composition, particularly related to the presence of an unknown bacteria, virus or eukaryote, as well as to technical artifacts (sample contamination, reads of non-biological origin, etc.) at the early stages of bioinformatic analysis. Our method is complementary to reference-based approaches and can be easily integrated into metagenomic analysis pipelines.

Journal Article

Share this book

Add to My Shelf

Genomic approach to manage genetic variability in dairy farms

by Bernini, Francesca , Punturiero, Chiara , Delledonne, Andrea in Agriculture , alleles , animal science

2023

In this study we investigated the genetic variability, the inbreeding and allele frequencies of monogenic traits in seven herds of Holstein breed and provided insight to farmers on the value of genomic management of reproduction in their herds. A total of 3,953 Holstein cows were sampled and genotyped with the Neogen GGP Bovine 100K SNP chip within the activities of the Regione Lombardia funded GO-PEI project ‘GENOmic tool for the management of reproduction in dairy cattle and for the control of inbreeding – GENORIP’. Principal component analysis was applied for analysing the genetic variability within and among farms using the SVS software of Golden Helix. Run of Homozygosity (ROH) and the genomic inbreeding were obtained with the detectRUNS package of the R software. Genotype frequencies for mendelian disease, fertility and production traits were also obtained. A total of 458,267 ROH were identified and ROH were distributed on all autosomes with an average length of 2,703,811 bp covering 12.7% of the genome. Several genomic regions appear under selection, while a specific region on BTA4 was identified in one herd, harbouring genes mainly related to the specific selection strategy of the farmer. The FROH values obtained considering ROH greater than 16 Mb, varied from 0.004 to 0.325, with the highest FROH average value of 0.136. Among mendelian heritable diseases, the Haplotype Cholesterol Deficiency was the one with the largest proportion of carrier animals, i.e. 5.6%. A herd-tailored process to assist farmers in genomic management of reproduction was released. The ROH distribution within herd, together with the genotype frequencies for disease, fertility and production mendelian traits, suggest that similar directional selection is occurring across herds. This study released to each farmer the genomic make-up of their herd used jointly with the gEBV estimated by their national breeders’ association (ANAFIBJ) for herd reproductive management.

Journal Article

Share this book

Add to My Shelf

Evolutionary Profile of Mayaro Virus in the Americas: An Update into Genome Variability

by Jardim, Ana Carolina Gomes , Ferreira, Giulia Magalhães , Marinho, Mikaela dos Santos in Adaptability , Alphavirus - classification , Alphavirus - genetics

2024

The Mayaro virus (MAYV) is an arbovirus with emerging potential, though with a limited understanding of its epidemiology and evolution due to the lack of studies and surveillance. Here, we investigated 71 MAYV genome sequences from the Americas available at GenBank and characterized the phylogenetic relationship among virus strains. A phylogenetic analysis showed that sequences were grouped according to the genotypes L, D, and N. Genotype D sequences were closely related to sequences collected in adjacent years and from their respective countries, suggesting that isolates may have originated from circulating lineages. The coalescent analysis demonstrated similar results, indicating the continuous circulation of the virus between countries as well. An unidentified sequence from the USA was grouped with genotype D, suggesting the insertion of this genotype in the country. Furthermore, the recombination analysis detected homologous and three heterologous hybrids which presented an insertion into the nsP3 protein. Amino acid substitutions among sequences indicated selective pressure sites, suggesting viral adaptability. This also impacted the binding affinity between the E1–E2 protein complex and the Mxra8 receptor, associated with MAYV entry into human cells. These results provide information for a better understanding of genotypes circulating in the Americas.

Journal Article

Share this book

Add to My Shelf

Language Selector

MBRLGlobalSearch

Language Selector

Catalogue Search | MBRL

Search Results Heading

Explore the vast range of titles available.

MBRLSearchResults

MBRLHappinessMeter