Catalogue Search | MBRL
Search Results Heading
Explore the vast range of titles available.
MBRLSearchResults
-
DisciplineDiscipline
-
Is Peer ReviewedIs Peer Reviewed
-
Item TypeItem Type
-
SubjectSubject
-
YearFrom:-To:
-
More FiltersMore FiltersSourceLanguage
Done
Filters
Reset
395
result(s) for
"PacBio sequencing"
Sort by:
Long-read sequencing revealed cooccurrence, host range, and potential mobility of antibiotic resistome in cow feces
2021
While it is well recognized that the environmental resistome is global, diverse, and augmented by human activities, it has been difficult to assess risk because of the inability to culture many environmental organisms, and it is difficult to evaluate risk from current sequence-based environmental methods. The four most important criteria to determine risk are whether the antibioticresistance genes (ARGs) are a complete, potentially functional complement; if they are linked with other resistances; whether they are mobile; and the identity of their host. Long-read sequencing fills this important gap between culture and short sequencebased methods. To address these criteria, we collected feces from a ceftiofur-treated cow, enriched the samples in the presence of antibiotics to favor ARG functionality, and sequenced long reads using Nanopore and PacBio technologies. Multidrug-resistance genes comprised 58% of resistome abundance, but only 0.8% of them were plasmid associated; fluroquinolone-, aminoglycoside-, macrolide-lincosamide-streptogramin (MLS)-, and β-lactam–resistance genes accounted for 2.7 to 12.3% of resistome abundance but with 19 to 78%located on plasmids. A variety of plasmid types were assembled, some of which share low similarity to plasmids in current databases. Enterobacteriaceae were dominant hosts of antibiotic-resistant plasmids; physical linkage of extended-spectrum β-lactamase genes (CTX-M, TEM, CMY, and CARB) was largely found with aminoglycoside-, MLS-, tetracycline-, trimethoprim-, phenicol-, sulfonamide-, and mercury-resistance genes. A draft circular chromosome of Vagococcus lutrae was assembled; it carries MLS-, tetracycline- (including tetM and tetL on an integrative conjugative element), and trimethoprim-resistance genes flanked by many transposase genes and insertion sequences, implying that they remain transferrable.
Journal Article
Third-Generation Sequencing: The Spearhead towards the Radical Transformation of Modern Genomics
by
Scorilas, Andreas
,
Adamopoulos, Panagiotis G.
,
Boti, Michaela A.
in
Assembly
,
Bioinformatics
,
direct RNA sequencing
2021
Although next-generation sequencing (NGS) technology revolutionized sequencing, offering a tremendous sequencing capacity with groundbreaking depth and accuracy, it continues to demonstrate serious limitations. In the early 2010s, the introduction of a novel set of sequencing methodologies, presented by two platforms, Pacific Biosciences (PacBio) and Oxford Nanopore Sequencing (ONT), gave birth to third-generation sequencing (TGS). The innovative long-read technologies turn genome sequencing into an ease-of-handle procedure by greatly reducing the average time of library construction workflows and simplifying the process of de novo genome assembly due to the generation of long reads. Long sequencing reads produced by both TGS methodologies have already facilitated the decipherment of transcriptional profiling since they enable the identification of full-length transcripts without the need for assembly or the use of sophisticated bioinformatics tools. Long-read technologies have also provided new insights into the field of epitranscriptomics, by allowing the direct detection of RNA modifications on native RNA molecules. This review highlights the advantageous features of the newly introduced TGS technologies, discusses their limitations and provides an in-depth comparison regarding their scientific background and available protocols as well as their potential utility in research and clinical applications.
Journal Article
DNA barcoding, an effective tool for species identification: a review
2023
DNA barcoding is a powerful taxonomic tool to identify and discover species. DNA barcoding utilizes one or more standardized short DNA regions for taxon identification. With the emergence of new sequencing techniques, such as Next-generation sequencing (NGS), ONT MinION nanopore sequencing, and Pac Bio sequencing, DNA barcoding has become more accurate, fast, and reliable. Rapid species identification by DNA barcodes has been used in a variety of fields, including forensic science, control of the food supply chain, and disease understanding. The Consortium for Barcode of Life (CBOL) presents various working groups to identify the universal barcode gene, such as COI in metazoans; rbcL, matK, and ITS in plants; ITS in fungi; 16S rRNA gene in bacteria and archaea, and creating a reference DNA barcode library. In this article, an attempt has been made to analyze the various proposed DNA barcode for different organisms, strengths & limitations, recent advancements in DNA barcoding, and methods to speed up the DNA barcode reference library construction. This study concludes that constructing a reference library with high species coverage would be a major step toward identifying species by DNA barcodes. This can be achieved in a short period of time by using advanced sequencing and data analysis methods.
Journal Article
Transcriptome analysis of heat stress and drought stress in pearl millet based on Pacbio full-length transcriptome sequencing
by
Wang, Xiaoshan
,
Huang, Linkai
,
Zhang, Ailing
in
Agricultural production
,
Agriculture
,
Analysis
2020
Background
Heat and drought are serious threats for crop growth and development. As the sixth largest cereal crop in the world, pearl millet can not only be used for food and forage but also as a source of bioenergy. Pearl millet is highly tolerant to heat and drought. Given this, it is considered an ideal crop to study plant stress tolerance and can be used to identify heat-resistant genes.
Results
In this study, we used Pacbio sequencing data as a reference sequence to analyze the Illumina data of pearl millet that had been subjected to heat and drought stress for 48 h. By summarizing previous studies, we found 26,299 new genes and 63,090 new transcripts, and the number of gene annotations increased by 20.18%. We identified 2792 transcription factors and 1223 transcriptional regulators. There were 318 TFs and 149 TRs differentially expressed under heat stress, and 315 TFs and 128 TRs were differentially expressed under drought stress. We used RNA sequencing to identify 6920 genes and 6484 genes differentially expressed under heat stress and drought stress, respectively.
Conclusions
Through Pacbio sequencing, we have identified more new genes and new transcripts. On the other hand, comparing the differentially expressed genes under heat tolerance with the DEGs under drought stress, we found that even in the same pathway, pearl millet responds with a different protein.
Journal Article
Chromosome-level assembly of Arabidopsis thaliana Ler reveals the extent of translocation and inversion polymorphisms
by
Zapata, Luis
,
Koornneef, Maarten
,
Ding, Jia
in
Arabidopsis
,
Arabidopsis - genetics
,
Arabidopsis thaliana
2016
Resequencing or reference-based assemblies reveal large parts of the small-scale sequence variation. However, they typically fail to separate such local variation into colinear and rearranged variation, because they usually do not recover the complement of large-scale rearrangements, including transpositions and inversions. Besides the availability of hundreds of genomes of diverse Arabidopsis thaliana accessions, there is so far only one full-length assembled genome: the reference sequence. We have assembled 117 Mb of the A. thaliana Landsberg erecta (Ler) genome into five chromosome-equivalent sequences using a combination of short Illumina reads, long PacBio reads, and linkage information. Whole-genome comparison against the reference sequence revealed 564 transpositions and 47 inversions comprising ∼3.6 Mb, in addition to 4.1 Mb of nonreference sequence, mostly originating from duplications. Although rearranged regions are not different in local divergence from colinear regions, they are drastically depleted for meiotic recombination in heterozygotes. Using a 1.2-Mb inversion as an example, we show that such rearrangement-mediated reduction of meiotic recombination can lead to genetically isolated haplotypes in the worldwide population of A. thaliana. Moreover, we found 105 single-copy genes, which were only present in the reference sequence or the Ler assembly, and 334 single-copy orthologs, which showed an additional copy in only one of the genomes. To our knowledge, this work gives first insights into the degree and type of variation, which will be revealed once complete assemblies will replace resequencing or other reference-dependent methods.
Journal Article
Complete Chloroplast Genome Sequences of Kaempferia Galanga and Kaempferia Elegans: Molecular Structures and Comparative Analysis
by
Li, Dong-Mei
,
Zhao, Chao-Yi
,
Liu, Xiao-Fei
in
chloroplast genome
,
Chloroplasts
,
comparative analysis
2019
Kaempferia galanga and Kaempferia elegans, which belong to the genus Kaempferia family Zingiberaceae, are used as valuable herbal medicine and ornamental plants, respectively. The chloroplast genomes have been used for molecular markers, species identification and phylogenetic studies. In this study, the complete chloroplast genome sequences of K. galanga and K. elegans are reported. Results show that the complete chloroplast genome of K. galanga is 163,811 bp long, having a quadripartite structure with large single copy (LSC) of 88,405 bp and a small single copy (SSC) of 15,812 bp separated by inverted repeats (IRs) of 29,797 bp. Similarly, the complete chloroplast genome of K. elegans is 163,555 bp long, having a quadripartite structure in which IRs of 29,773 bp length separates 88,020 bp of LSC and 15,989 bp of SSC. A total of 111 genes in K. galanga and 113 genes in K. elegans comprised 79 protein-coding genes and 4 ribosomal RNA (rRNA) genes, as well as 28 and 30 transfer RNA (tRNA) genes in K. galanga and K. elegans, respectively. The gene order, GC content and orientation of the two Kaempferia chloroplast genomes exhibited high similarity. The location and distribution of simple sequence repeats (SSRs) and long repeat sequences were determined. Eight highly variable regions between the two Kaempferia species were identified and 643 mutation events, including 536 single-nucleotide polymorphisms (SNPs) and 107 insertion/deletions (indels), were accurately located. Sequence divergences of the whole chloroplast genomes were calculated among related Zingiberaceae species. The phylogenetic analysis based on SNPs among eleven species strongly supported that K. galanga and K. elegans formed a cluster within Zingiberaceae. This study identified the unique characteristics of the entire K. galanga and K. elegans chloroplast genomes that contribute to our understanding of the chloroplast DNA evolution within Zingiberaceae species. It provides valuable information for phylogenetic analysis and species identification within genus Kaempferia.
Journal Article
Evaluation of PacBio sequencing for full-length bacterial 16S rRNA gene classification
by
Lawley, Trevor D.
,
Coupland, Paul
,
Parkhill, Julian
in
Accuracy
,
Bacteria
,
Bacterial genetics
2016
Background
Currently, bacterial 16S rRNA gene analyses are based on sequencing of individual variable regions of the 16S rRNA gene (Kozich, et al
Appl Environ Microbiol
79:5112–5120, 2013).This short read approach can introduce biases. Thus, full-length bacterial 16S rRNA gene sequencing is needed to reduced biases. A new alternative for full-length bacterial 16S rRNA gene sequencing is offered by PacBio single molecule, real-time (SMRT) technology. The aim of our study was to validate PacBio P6 sequencing chemistry using three approaches: 1) sequencing the full-length bacterial 16S rRNA gene from a single bacterial species
Staphylococcus aureus
to analyze error modes and to optimize the bioinformatics pipeline; 2) sequencing the full-length bacterial 16S rRNA gene from a pool of 50 different bacterial colonies from human stool samples to compare with full-length bacterial 16S rRNA capillary sequence; and 3) sequencing the full-length bacterial 16S rRNA genes from 11 vaginal microbiome samples and compare with in
silico
selected bacterial 16S rRNA V1V2 gene region and with bacterial 16S rRNA V1V2 gene regions sequenced using the Illumina MiSeq.
Results
Our optimized bioinformatics pipeline for PacBio sequence analysis was able to achieve an error rate of 0.007% on the S
taphylococcus aureus
full-length 16S rRNA gene. Capillary sequencing of the full-length bacterial 16S rRNA gene from the pool of 50 colonies from stool identified 40 bacterial species of which up to 80% could be identified by PacBio full-length bacterial 16S rRNA gene sequencing. Analysis of the human vaginal microbiome using the bacterial 16S rRNA V1V2 gene region on MiSeq generated 129 operational taxonomic units (OTUs) from which 70 species could be identified. For the PacBio, 36,000 sequences from over 58,000 raw reads could be assigned to a barcode, and the
in silico
selected bacterial 16S rRNA V1V2 gene region generated 154 OTUs grouped into 63 species, of which 62% were shared with the MiSeq dataset. The PacBio full-length bacterial 16S rRNA gene datasets generated 261 OTUs, which were grouped into 52 species, of which 54% were shared with the MiSeq dataset. Alpha diversity index reported a higher diversity in the MiSeq dataset.
Conclusion
The PacBio sequencing error rate is now in the same range of the previously widely used Roche 454 sequencing platform and current MiSeq platform. Species-level microbiome analysis revealed some inconsistencies between the full-length bacterial 16S rRNA gene capillary sequencing and PacBio sequencing.
Journal Article
The complete mitochondrial genome of the early flowering plant Nymphaea colorata is highly repetitive with low recombination
by
Chen, Fei
,
Liu, Yanhui
,
Zhang, Shouzhou
in
Amborella
,
Angiosperms
,
Animal Genetics and Genomics
2018
Background
Mitochondrial genomes of flowering plants (angiosperms) are highly dynamic in genome structure. The mitogenome of the earliest angiosperm
Amborella
is remarkable in carrying rampant foreign DNAs, in contrast to
Liriodendron
, the other only known early angiosperm mitogenome that is described as ‘fossilized’. The distinctive features observed in the two early flowering plant mitogenomes add to the current confusions of what early flowering plants look like. Expanded sampling would provide more details in understanding the mitogenomic evolution of early angiosperms. Here we report the complete mitochondrial genome of water lily
Nymphaea colorata
from Nymphaeales, one of the three orders of the earliest angiosperms.
Results
Assembly of data from Pac-Bio long-read sequencing yielded a circular mitochondria chromosome of 617,195 bp with an average depth of 601×. The genome encoded 41 protein coding genes, 20 tRNA and three rRNA genes with 25 group II introns disrupting 10 protein coding genes. Nearly half of the genome is composed of repeated sequences, which contributed substantially to the intron size expansion, making the gross intron length of the
Nymphaea
mitochondrial genome one of the longest among angiosperms, including an 11.4-Kb intron in
cox2
, which is the longest organellar intron reported to date in plants. Nevertheless, repeat mediated homologous recombination is unexpectedly low in
Nymphaea
evidenced by 74 recombined reads detected from ten recombinationally active repeat pairs among 886,982 repeat pairs examined. Extensive gene order changes were detected in the three early angiosperm mitogenomes, i.e. 38 or 44 events of inversions and translocations are needed to reconcile the mitogenome of
Nymphaea
with
Amborella
or
Liriodendron
, respectively. In contrast to
Amborella
with six genome equivalents of foreign mitochondrial DNA, not a single horizontal gene transfer event was observed in the
Nymphaea
mitogenome.
Conclusions
The
Nymphaea
mitogenome resembles the other available early angiosperm mitogenomes by a similarly rich 64-coding gene set, and many conserved gene clusters, whereas stands out by its highly repetitive nature and resultant remarkable intron expansions. The low recombination level in
Nymphaea
provides evidence for the predominant master conformation in vivo with a highly substoichiometric set of rearranged molecules.
Journal Article
A chromosome‐level genome assembly provides insights into ascorbic acid accumulation and fruit softening in guava (Psidium guajava)
2021
Summary Guava (Psidium guajava) is an important fleshy‐fruited tree of the Myrtaceae family that is widely cultivated in tropical and subtropical areas of the world and has attracted considerable attention for the richness of ascorbic acid in its fruits. However, studies on the evolution and genetic breeding potential of guava are hindered by the lack of a reference genome. Here, we present a chromosome‐level genomic assembly of guava using PacBio sequencing and Hi‐C technology. We found that the genome assembly size was 443.8 Mb with a contig N50 of ~15.8 Mb. We annotated a total of 25 601 genes and 193.2 Mb of repetitive sequences for this genome. Comparative genomic analysis revealed that guava has undergone a recent whole‐genome duplication (WGD) event shared by all species in Myrtaceae. In addition, through metabolic analysis, we determined that the L‐galactose pathway plays a major role in ascorbic acid biosynthesis in guava fruits. Moreover, the softening of fruits of guava may result from both starch and cell wall degradation according to analyses of gene expression profiles and positively selected genes. Our data provide a foundational resource to support molecular breeding of guava and represent new insights into the evolution of soft, fleshy fruits in Myrtaceae.
Journal Article
Amplification-free long-read sequencing reveals unforeseen CRISPR-Cas9 off-target activity
by
den Hoed, Marcel
,
Ameur, Adam
,
Chin, Chen-Shan
in
Animal Genetics and Genomics
,
Base Sequence
,
Bioinformatics
2020
Background
One ongoing concern about CRISPR-Cas9 genome editing is that unspecific guide RNA (gRNA) binding may induce off-target mutations. However, accurate prediction of CRISPR-Cas9 off-target activity is challenging. Here, we present SMRT-OTS and Nano-OTS, two novel, amplification-free, long-read sequencing protocols for detection of gRNA-driven digestion of genomic DNA by Cas9 in vitro.
Results
The methods are assessed using the human cell line HEK293, re-sequenced at 18x coverage using highly accurate HiFi SMRT reads. SMRT-OTS and Nano-OTS are first applied to three different gRNAs targeting HEK293 genomic DNA, resulting in a set of 55 high-confidence gRNA cleavage sites identified by both methods. Twenty-five of these sites are not reported by off-target prediction software, either because they contain four or more single nucleotide mismatches or insertion/deletion mismatches, as compared with the human reference. Additional experiments reveal that 85% of Cas9 cleavage sites are also found by other in vitro-based methods and that on- and off-target sites are detectable in gene bodies where short-reads fail to uniquely align. Even though SMRT-OTS and Nano-OTS identify several sites with previously validated off-target editing activity in cells, our own CRISPR-Cas9 editing experiments in human fibroblasts do not give rise to detectable off-target mutations at the in vitro-predicted sites. However, indel and structural variation events are enriched at the on-target sites.
Conclusions
Amplification-free long-read sequencing reveals Cas9 cleavage sites in vitro that would have been difficult to predict using computational tools, including in dark genomic regions inaccessible by short-read sequencing.
Journal Article