Search Results Heading

MBRLSearchResults

mbrl.module.common.modules.added.book.to.shelf
Title added to your shelf!
View what I already have on My Shelf.
Oops! Something went wrong.
Oops! Something went wrong.
While trying to add the title to your shelf something went wrong :( Kindly try again later!
Are you sure you want to remove the book from the shelf?
Oops! Something went wrong.
Oops! Something went wrong.
While trying to remove the title from your shelf something went wrong :( Kindly try again later!
    Done
    Filters
    Reset
  • Discipline
      Discipline
      Clear All
      Discipline
  • Is Peer Reviewed
      Is Peer Reviewed
      Clear All
      Is Peer Reviewed
  • Item Type
      Item Type
      Clear All
      Item Type
  • Subject
      Subject
      Clear All
      Subject
  • Year
      Year
      Clear All
      From:
      -
      To:
  • More Filters
      More Filters
      Clear All
      More Filters
      Source
    • Language
193,555 result(s) for "DNA sequences"
Sort by:
DNA sequencing at 40: past, present and future
This review commemorates the 40th anniversary of DNA sequencing, a period in which we have already witnessed multiple technological revolutions and a growth in scale from a few kilobases to the first human genome, and now to millions of human and a myriad of other genomes. DNA sequencing has been extensively and creatively repurposed, including as a ‘counter’ for a vast range of molecular phenomena. We predict that in the long view of history, the impact of DNA sequencing will be on a par with that of the microscope. The history and future potential of DNA sequencing, including the development of the underlying technologies and the expansion of its areas of application, are reviewed. DNA sequencing at 40 This year marks the 40th anniversary of the Sanger method for DNA sequencing, the most widely used sequencing method, pioneered by Fred Sanger and his team in 1977. Jay Shendure and colleagues review the evolution of sequencing technologies since their inception, highlighting the major milestones in the development, analyses and applications of genome sequencing over the past 40 years. Despite multiple technological revolutions and growth in scale, the authors see DNA sequencing as a relatively nascent technology in the grand scheme of scientific history. They review current emerging applications and discuss the continued evolution and future of DNA sequencing from population-scale resequencing to networks of portable sensors used for real-time monitoring in environmental settings.
Rapid DNA Sequencing Technology Based on the Sanger Method for Bacterial Identification
Antimicrobial resistance, a global health concern, has been increasing due to inappropriate use of antibacterial agents. To facilitate early treatment of sepsis, rapid bacterial identification is imperative to determine appropriate antibacterial agent for better therapeutic outcomes. In this study, we developed a rapid PCR method, rapid cycle sequencing, and microchip electrophoresis, which are the three elemental technologies for DNA sequencing based on the Sanger sequencing method, for bacterial identification. We achieved PCR amplification within 13 min and cycle sequencing within 14 min using a rapid thermal cycle system applying microfluidic technology. Furthermore, DNA analysis was completed in 14 min by constructing an algorithm for analyzing and performing microchip electrophoresis. Thus, the three elemental Sanger-based DNA sequencing steps were accomplished within 41 min. Development of a rapid purification process subsequent to PCR and cycle sequence using a microchip would help realize the identification of causative bacterial agents within one hour, and facilitate early treatment of sepsis.
Accuracy and quality assessment of 454 GS-FLX Titanium pyrosequencing
Background The rapid evolution of 454 GS-FLX sequencing technology has not been accompanied by a reassessment of the quality and accuracy of the sequences obtained. Current strategies for decision-making and error-correction are based on an initial analysis by Huse et al. in 2007, for the older GS20 system based on experimental sequences. We analyze here the quality of 454 sequencing data and identify factors playing a role in sequencing error, through the use of an extensive dataset for Roche control DNA fragments. Results We obtained a mean error rate for 454 sequences of 1.07%. More importantly, the error rate is not randomly distributed; it occasionally rose to more than 50% in certain positions, and its distribution was linked to several experimental variables. The main factors related to error are the presence of homopolymers, position in the sequence, size of the sequence and spatial localization in PT plates for insertion and deletion errors. These factors can be described by considering seven variables. No single variable can account for the error rate distribution, but most of the variation is explained by the combination of all seven variables. Conclusions The pattern identified here calls for the use of internal controls and error-correcting base callers, to correct for errors, when available (e.g. when sequencing amplicons). For shotgun libraries, the use of both sequencing primers and deep coverage, combined with the use of random sequencing primer sites should partly compensate for even high error rates, although it may prove more difficult than previous thought to distinguish between low-frequency alleles and errors.
Human Genome Sequencing Using Unchained Base Reads on Self-Assembling DNA Nanoarrays
Genome sequencing of large numbers of individuals promises to advance the understanding, treatment, and prevention of human diseases, among other applications. We describe a genome sequencing platform that achieves efficient imaging and low reagent consumption with combinatorial probe anchor ligation chemistry to independently assay each base from patterned nanoarrays of self-assembling DNA nanoballs. We sequenced three human genomes with this platform, generating an average of 45-to 87-fold coverage per genome and identifying 3.2 to 4.5 million sequence variants per genome. Validation of one genome data set demonstrates a sequence accuracy of about 1 false variant per 100 kilobases. The high accuracy, affordable cost of $4400 for sequencing consumables, and scalability of this platform enable complete human genome sequencing for the detection of rare variants in large-scale genetic studies.
Sequencing technologies — the next generation
Key Points The major advance offered by next-generation sequencing (NGS) technologies is the ability to produce, in some cases, in excess of one billion short reads per instrument run, which makes them useful for many biological applications. The variety of NGS features makes it likely that multiple platforms will coexist in the marketplace, with some having clear advantages for particular applications over others. The leading NGS platforms use clonally amplified templates, which are not affected by the arbitrary losses of genomic sequences that are inherent in bacterial cloning methods. An important advantage of single-molecule template platforms is that PCR is not required. PCR can create mutations that masquerade as sequence variants and amplification bias that underrepresents AT-rich and GC-rich regions in target sequences. There are four primary NGS chemistry methods: cyclic reversible termination, sequencing by ligation, pyrosequencing and real-time sequencing, which are described in this Review. To call sequence variants in genomes, NGS reads are aligned to a reference sequence using various bioinformatics mapping tools. Whole-genome sequencing using current NGS platforms is still expensive, but targeting regions of interest may provide an interim solution to analysing hundreds, if not thousands, of samples. To date, the sequences of twelve human genomes have been published using a number of NGS platforms, marking the beginning of personalized genomics. NGS costs will continue to drop in the foreseeable future, although the cost reduction should be weighed against the quality of the produced genome sequence. There is an increasing demand for next-generation sequencing technologies that rapidly deliver high volumes of accurate genome information at a low cost. This Review provides a guide to the features of the different platforms, and describes the recent advances in this fast-moving area. Demand has never been greater for revolutionary technologies that deliver fast, inexpensive and accurate genome information. This challenge has catalysed the development of next-generation sequencing (NGS) technologies. The inexpensive production of large volumes of sequence data is the primary advantage over conventional methods. Here, I present a technical review of template preparation, sequencing and imaging, genome alignment and assembly approaches, and recent advances in current and near-term commercially available NGS instruments. I also outline the broad range of applications for NGS technologies, in addition to providing guidelines for platform selection to address biological questions of interest.
Real-time, portable genome sequencing for Ebola surveillance
A nanopore DNA sequencer is used for real-time genomic surveillance of the Ebola virus epidemic in the field in Guinea; the authors demonstrate that it is possible to pack a genomic surveillance laboratory in a suitcase and transport it to the field for on-site virus sequencing, generating results within 24 hours of sample collection. Ebola virus genomics surveillance This paper reports the use of nanopore DNA sequencers (known as MinIONs) for real-time genomic surveillance of the Ebola virus epidemic, in the field in Guinea. The authors demonstrate that it is possible to pack a genomic surveillance laboratory in a suitcase and transport it to the field for on-site virus sequencing, generating results within 24 hours of sample collection. The Ebola virus disease epidemic in West Africa is the largest on record, responsible for over 28,599 cases and more than 11,299 deaths 1 . Genome sequencing in viral outbreaks is desirable to characterize the infectious agent and determine its evolutionary rate. Genome sequencing also allows the identification of signatures of host adaptation, identification and monitoring of diagnostic targets, and characterization of responses to vaccines and treatments. The Ebola virus (EBOV) genome substitution rate in the Makona strain has been estimated at between 0.87 × 10 −3 and 1.42 × 10 −3 mutations per site per year. This is equivalent to 16–27 mutations in each genome, meaning that sequences diverge rapidly enough to identify distinct sub-lineages during a prolonged epidemic 2 , 3 , 4 , 5 , 6 , 7 . Genome sequencing provides a high-resolution view of pathogen evolution and is increasingly sought after for outbreak surveillance. Sequence data may be used to guide control measures, but only if the results are generated quickly enough to inform interventions 8 . Genomic surveillance during the epidemic has been sporadic owing to a lack of local sequencing capacity coupled with practical difficulties transporting samples to remote sequencing facilities 9 . To address this problem, here we devise a genomic surveillance system that utilizes a novel nanopore DNA sequencing instrument. In April 2015 this system was transported in standard airline luggage to Guinea and used for real-time genomic surveillance of the ongoing epidemic. We present sequence data and analysis of 142 EBOV samples collected during the period March to October 2015. We were able to generate results less than 24 h after receiving an Ebola-positive sample, with the sequencing process taking as little as 15–60 min. We show that real-time genomic surveillance is possible in resource-limited settings and can be established rapidly to monitor outbreaks.
Nanopore Sequencing: From Imagination to Reality
Information about the identity of an analyte is contained within the mean duration of the current blockades, the amplitude of the blockades, and additional characteristics of the blockades, such as an increase in current noise while the analyte is bound. Because signals from related analytes (e.g., various divalent metal ions or structurally related organic molecules) differ, the engineered binding site does not have to be absolutely selective for an analyte, which is often very difficult to achieve. [...]it should not be forgotten that nanopore sensing is a platform technology, readily adapted for the detection of virtually any water-soluble analyte (5 ).
Review on the Application of Machine Learning Algorithms in the Sequence Data Mining of DNA
Deoxyribonucleic acid (DNA) is a biological macromolecule. Its main function is information storage. At present, the advancement of sequencing technology had caused DNA sequence data to grow at an explosive rate, which has also pushed the study of DNA sequences in the wave of big data. Moreover, machine learning is a powerful technique for analyzing largescale data and learns spontaneously to gain knowledge. It has been widely used in DNA sequence data analysis and obtained a lot of research achievements. Firstly, the review introduces the development process of sequencing technology, expounds on the concept of DNA sequence data structure and sequence similarity. Then we analyze the basic process of data mining, summary several major machine learning algorithms, and put forward the challenges faced by machine learning algorithms in the mining of biological sequence data and possible solutions in the future. Then we review four typical applications of machine learning in DNA sequence data: DNA sequence alignment, DNA sequence classification, DNA sequence clustering, and DNA pattern mining. We analyze their corresponding biological application background and significance, and systematically summarized the development and potential problems in the field of DNA sequence data mining in recent years. Finally, we summarize the content of the review and look into the future of some research directions for the next step.
Genomic landscape of the global oak phylogeny
The tree of life is highly reticulate, with the history of population divergence emerging from populations of gene phylogenies that reflect histories of introgression, lineage sorting and divergence. In this study, we investigate global patterns of oak diversity and test the hypothesis that there are regions of the oak genome that are broadly informative about phylogeny.We utilize fossil data and restriction-site associated DNA sequencing (RAD-seq) for 632 individuals representing nearly 250 Quercus species to infer a time-calibrated phylogeny of the world's oaks. We use a reversible-jump Markov chain Monte Carlo method to reconstruct shifts in lineage diversification rates, accounting for among-clade sampling biases. We then map the > 20 000 RAD-seq loci back to an annotated oak genome and investigate genomic distribution of introgression and phylogenetic support across the phylogeny.Oak lineages have diversified among geographic regions, followed by ecological divergence within regions, in the Americas and Eurasia. Roughly 60% of oak diversity traces back to four clades that experienced increases in net diversification, probably in response to climatic transitions or ecological opportunity.The strong support for the phylogeny contrasts with high genomic heterogeneity in phylogenetic signal and introgression. Oaks are phylogenomic mosaics, and their diversity may in fact depend on the gene flow that shapes the oak genome.
Limitations of next-generation genome sequence assembly
High-throughput sequencing technologies promise to transform the fields of genetics and comparative biology by delivering tens of thousands of genomes in the near future. Although it is feasible to construct de novo genome assemblies in a few months, there has been relatively little attention to what is lost by sole application of short sequence reads. We compared the recent de novo assemblies using the short oligonucleotide analysis package (SOAP), generated from the genomes of a Han Chinese individual and a Yoruban individual, to experimentally validated genomic features. We found that de novo assemblies were 16.2% shorter than the reference genome and that 420.2 megabase pairs of common repeats and 99.1% of validated duplicated sequences were missing from the genome. Consequently, over 2,377 coding exons were completely missing. We conclude that high-quality sequencing approaches must be considered in conjunction with high-throughput sequencing for comparative genomics analyses and studies of genome evolution.