Catalogue Search | MBRL
Search Results Heading
Explore the vast range of titles available.
MBRLSearchResults
-
DisciplineDiscipline
-
Is Peer ReviewedIs Peer Reviewed
-
Item TypeItem Type
-
SubjectSubject
-
YearFrom:-To:
-
More FiltersMore FiltersSourceLanguage
Done
Filters
Reset
110,410
result(s) for
"Sequence Analysis, DNA"
Sort by:
DNA sequencing at 40: past, present and future
2017
This review commemorates the 40th anniversary of DNA sequencing, a period in which we have already witnessed multiple technological revolutions and a growth in scale from a few kilobases to the first human genome, and now to millions of human and a myriad of other genomes. DNA sequencing has been extensively and creatively repurposed, including as a ‘counter’ for a vast range of molecular phenomena. We predict that in the long view of history, the impact of DNA sequencing will be on a par with that of the microscope.
The history and future potential of DNA sequencing, including the development of the underlying technologies and the expansion of its areas of application, are reviewed.
DNA sequencing at 40
This year marks the 40th anniversary of the Sanger method for DNA sequencing, the most widely used sequencing method, pioneered by Fred Sanger and his team in 1977. Jay Shendure and colleagues review the evolution of sequencing technologies since their inception, highlighting the major milestones in the development, analyses and applications of genome sequencing over the past 40 years. Despite multiple technological revolutions and growth in scale, the authors see DNA sequencing as a relatively nascent technology in the grand scheme of scientific history. They review current emerging applications and discuss the continued evolution and future of DNA sequencing from population-scale resequencing to networks of portable sensors used for real-time monitoring in environmental settings.
Journal Article
Accuracy and quality assessment of 454 GS-FLX Titanium pyrosequencing
by
Malausa, Thibaut
,
Martin, Jean-François
,
Pech, Nicolas
in
Analysis
,
Animal Genetics and Genomics
,
DNA sequencing
2011
Background
The rapid evolution of 454 GS-FLX sequencing technology has not been accompanied by a reassessment of the quality and accuracy of the sequences obtained. Current strategies for decision-making and error-correction are based on an initial analysis by Huse
et al.
in 2007, for the older GS20 system based on experimental sequences. We analyze here the quality of 454 sequencing data and identify factors playing a role in sequencing error, through the use of an extensive dataset for Roche control DNA fragments.
Results
We obtained a mean error rate for 454 sequences of 1.07%. More importantly, the error rate is not randomly distributed; it occasionally rose to more than 50% in certain positions, and its distribution was linked to several experimental variables. The main factors related to error are the presence of homopolymers, position in the sequence, size of the sequence and spatial localization in PT plates for insertion and deletion errors. These factors can be described by considering seven variables. No single variable can account for the error rate distribution, but most of the variation is explained by the combination of all seven variables.
Conclusions
The pattern identified here calls for the use of internal controls and error-correcting base callers, to correct for errors, when available (e.g. when sequencing amplicons). For shotgun libraries, the use of both sequencing primers and deep coverage, combined with the use of random sequencing primer sites should partly compensate for even high error rates, although it may prove more difficult than previous thought to distinguish between low-frequency alleles and errors.
Journal Article
Human Genome Sequencing Using Unchained Base Reads on Self-Assembling DNA Nanoarrays
by
Reid, Clifford A.
,
Mutch, Karl
,
Peterson, Joe
in
assays
,
Base Sequence
,
Biological and medical sciences
2010
Genome sequencing of large numbers of individuals promises to advance the understanding, treatment, and prevention of human diseases, among other applications. We describe a genome sequencing platform that achieves efficient imaging and low reagent consumption with combinatorial probe anchor ligation chemistry to independently assay each base from patterned nanoarrays of self-assembling DNA nanoballs. We sequenced three human genomes with this platform, generating an average of 45-to 87-fold coverage per genome and identifying 3.2 to 4.5 million sequence variants per genome. Validation of one genome data set demonstrates a sequence accuracy of about 1 false variant per 100 kilobases. The high accuracy, affordable cost of $4400 for sequencing consumables, and scalability of this platform enable complete human genome sequencing for the detection of rare variants in large-scale genetic studies.
Journal Article
Sequencing technologies — the next generation
2010
Key Points
The major advance offered by next-generation sequencing (NGS) technologies is the ability to produce, in some cases, in excess of one billion short reads per instrument run, which makes them useful for many biological applications.
The variety of NGS features makes it likely that multiple platforms will coexist in the marketplace, with some having clear advantages for particular applications over others.
The leading NGS platforms use clonally amplified templates, which are not affected by the arbitrary losses of genomic sequences that are inherent in bacterial cloning methods.
An important advantage of single-molecule template platforms is that PCR is not required. PCR can create mutations that masquerade as sequence variants and amplification bias that underrepresents AT-rich and GC-rich regions in target sequences.
There are four primary NGS chemistry methods: cyclic reversible termination, sequencing by ligation, pyrosequencing and real-time sequencing, which are described in this Review.
To call sequence variants in genomes, NGS reads are aligned to a reference sequence using various bioinformatics mapping tools.
Whole-genome sequencing using current NGS platforms is still expensive, but targeting regions of interest may provide an interim solution to analysing hundreds, if not thousands, of samples.
To date, the sequences of twelve human genomes have been published using a number of NGS platforms, marking the beginning of personalized genomics.
NGS costs will continue to drop in the foreseeable future, although the cost reduction should be weighed against the quality of the produced genome sequence.
There is an increasing demand for next-generation sequencing technologies that rapidly deliver high volumes of accurate genome information at a low cost. This Review provides a guide to the features of the different platforms, and describes the recent advances in this fast-moving area.
Demand has never been greater for revolutionary technologies that deliver fast, inexpensive and accurate genome information. This challenge has catalysed the development of next-generation sequencing (NGS) technologies. The inexpensive production of large volumes of sequence data is the primary advantage over conventional methods. Here, I present a technical review of template preparation, sequencing and imaging, genome alignment and assembly approaches, and recent advances in current and near-term commercially available NGS instruments. I also outline the broad range of applications for NGS technologies, in addition to providing guidelines for platform selection to address biological questions of interest.
Journal Article
Real-time, portable genome sequencing for Ebola surveillance
by
Koundouno, Raymond
,
Kosgey, Abigael
,
Severi, Ettore
in
631/181/735
,
631/208/325
,
631/208/514/2254
2016
A nanopore DNA sequencer is used for real-time genomic surveillance of the Ebola virus epidemic in the field in Guinea; the authors demonstrate that it is possible to pack a genomic surveillance laboratory in a suitcase and transport it to the field for on-site virus sequencing, generating results within 24 hours of sample collection.
Ebola virus genomics surveillance
This paper reports the use of nanopore DNA sequencers (known as MinIONs) for real-time genomic surveillance of the Ebola virus epidemic, in the field in Guinea. The authors demonstrate that it is possible to pack a genomic surveillance laboratory in a suitcase and transport it to the field for on-site virus sequencing, generating results within 24 hours of sample collection.
The Ebola virus disease epidemic in West Africa is the largest on record, responsible for over 28,599 cases and more than 11,299 deaths
1
. Genome sequencing in viral outbreaks is desirable to characterize the infectious agent and determine its evolutionary rate. Genome sequencing also allows the identification of signatures of host adaptation, identification and monitoring of diagnostic targets, and characterization of responses to vaccines and treatments. The Ebola virus (EBOV) genome substitution rate in the Makona strain has been estimated at between 0.87 × 10
−3
and 1.42 × 10
−3
mutations per site per year. This is equivalent to 16–27 mutations in each genome, meaning that sequences diverge rapidly enough to identify distinct sub-lineages during a prolonged epidemic
2
,
3
,
4
,
5
,
6
,
7
. Genome sequencing provides a high-resolution view of pathogen evolution and is increasingly sought after for outbreak surveillance. Sequence data may be used to guide control measures, but only if the results are generated quickly enough to inform interventions
8
. Genomic surveillance during the epidemic has been sporadic owing to a lack of local sequencing capacity coupled with practical difficulties transporting samples to remote sequencing facilities
9
. To address this problem, here we devise a genomic surveillance system that utilizes a novel nanopore DNA sequencing instrument. In April 2015 this system was transported in standard airline luggage to Guinea and used for real-time genomic surveillance of the ongoing epidemic. We present sequence data and analysis of 142 EBOV samples collected during the period March to October 2015. We were able to generate results less than 24 h after receiving an Ebola-positive sample, with the sequencing process taking as little as 15–60 min. We show that real-time genomic surveillance is possible in resource-limited settings and can be established rapidly to monitor outbreaks.
Journal Article
A head-to-head evaluation of the diagnostic efficacy and costs of trio versus singleton exome sequencing analysis
2019
Diagnostic exome sequencing (ES) can be performed on the proband only (singleton; sES) or with additional samples, often including both biological parents with the proband (trio; tES). In this study we sought to compare the efficiencies of exome sequencing (ES) by trio (tES) versus singleton (sES) approach, determine costs, and identify factors to consider when deciding on optimal implementation strategies for the diagnosis of monogenic disorders. We undertook ES in 30 trios and analysed each proband’s sES and tES data in parallel. Two teams were randomly allocated to either sES or tES analysis for each case and blinded to each other’s work. Each task was timed and cost analyses were based on time taken and diagnostic yield. We modelled three scenarios to determine the factors to consider in the implementation of tES. sES diagnosed 11/30 (36.7%) cases and tES identified one additional diagnosis (12/30 (40.0%)). tES obviated the need for Sanger segregation, reduced the number of variants for curation, and had lower cost-per-diagnosis when considering analysis alone. When sequencing costs were included, tES nearly doubled the cost of sES. Reflexing to tES in those who remain undiagnosed after sES was cost-saving over tES in all as first-line. This approach requires a large differential in diagnostic yield between sES and tES for maximal benefit given current sequencing costs. tES may be preferable when scaling up laboratory throughput due to efficiency gains and opportunity cost considerations. Our findings are relevant to clinicians, laboratories and health services considering tES over sES.
Journal Article
Limitations of next-generation genome sequence assembly
2011
High-throughput sequencing technologies promise to transform the fields of genetics and comparative biology by delivering tens of thousands of genomes in the near future. Although it is feasible to construct
de novo
genome assemblies in a few months, there has been relatively little attention to what is lost by sole application of short sequence reads. We compared the recent
de novo
assemblies using the short oligonucleotide analysis package (SOAP), generated from the genomes of a Han Chinese individual and a Yoruban individual, to experimentally validated genomic features. We found that
de novo
assemblies were 16.2% shorter than the reference genome and that 420.2 megabase pairs of common repeats and 99.1% of validated duplicated sequences were missing from the genome. Consequently, over 2,377 coding exons were completely missing. We conclude that high-quality sequencing approaches must be considered in conjunction with high-throughput sequencing for comparative genomics analyses and studies of genome evolution.
Journal Article
Nanopore Sequencing: From Imagination to Reality
2015
Information about the identity of an analyte is contained within the mean duration of the current blockades, the amplitude of the blockades, and additional characteristics of the blockades, such as an increase in current noise while the analyte is bound. Because signals from related analytes (e.g., various divalent metal ions or structurally related organic molecules) differ, the engineered binding site does not have to be absolutely selective for an analyte, which is often very difficult to achieve. [...]it should not be forgotten that nanopore sensing is a platform technology, readily adapted for the detection of virtually any water-soluble analyte (5 ).
Journal Article
Tackling the widespread and critical impact of batch effects in high-throughput data
by
Baggerly, Keith
,
Scharpf, Robert B.
,
Irizarry, Rafael A.
in
631/1647/1513
,
631/1647/48
,
Agriculture
2010
Batch effects can lead to incorrect biological conclusions but are not widely considered. The authors show that batch effects are relevant to a range of high-throughput 'omics' data sets and are crucial to address. They also explain how batch effects can be mitigated.
High-throughput technologies are widely used, for example to assay genetic variants, gene and protein expression, and epigenetic modifications. One often overlooked complication with such studies is batch effects, which occur because measurements are affected by laboratory conditions, reagent lots and personnel differences. This becomes a major problem when batch effects are correlated with an outcome of interest and lead to incorrect conclusions. Using both published studies and our own analyses, we argue that batch effects (as well as other technical and biological artefacts) are widespread and critical to address. We review experimental and computational approaches for doing so.
Journal Article
The gut microbiota associated with high‐Gleason prostate cancer
2021
We have found that intestinal bacteria and their metabolites, short‐chain fatty acids (SCFAs), promote cancer growth in prostate cancer (PCa) mouse models. To clarify the association between gut microbiota and PCa in humans, we analyzed the gut microbiota profiles of men with suspected PCa. One hundred and fifty‐two Japanese men undergoing prostate biopsies (96 with cancer and 56 without cancer) were included in the study and randomly divided into two cohorts: a discovery cohort (114 samples) and a test cohort (38 samples). The gut microbiota was compared between two groups, a high‐risk group (men with Grade group 2 or higher PCa) and a negative + low‐risk group (men with negative biopsy or Grade group 1 PCa), using 16S rRNA gene sequencing. The relative abundances of Rikenellaceae, Alistipes, and Lachnospira, all SCFA‐producing bacteria, were significantly increased in high‐risk group. In receiver operating characteristic curve analysis, the index calculated from the abundance of 18 bacterial genera which were selected by least absolute shrinkage and selection operator regression detected high‐risk PCa in the discovery cohort with higher accuracy than the prostate specific antigen test (area under the curve [AUC] = 0.85 vs 0.74). Validation of the index in the test cohort showed similar results (AUC = 0.81 vs 0.67). The specific bacterial taxa were associated with high‐risk PCa. The gut microbiota profile could be a novel useful marker for the detection of high‐risk PCa and could contribute to the carcinogenesis of PCa. To clarify the association between the gut microbiota and prostate cancer, we analyzed the gut microbiota profiles of 152 men undergoing prostate biopsy. The abundances of Rikenellaceae, Alistipes, and Lachnospira were significantly increased in men with high‐risk cancer. The index calculated from the abundance of 18 bacterial genera detected high‐risk cancer with higher accuracy than the prostate specific antigen test.
Journal Article