Asset Details
MbrlCatalogueTitleDetail
Do you wish to reserve the book?
Validation and assessment of variant calling pipelines for next-generation sequencing
by
Kramer, Melissa
, Zandi, Peter P
, Parla, Jennifer
, Potash, James B
, Pirooznia, Mehdi
, Goes, Fernando S
, McCombie, W Richard
in
Bioinformatics
/ Biomedical and Life Sciences
/ Biomedicine
/ Bipolar Disorder - genetics
/ Data Interpretation, Statistical
/ Exome
/ High-Throughput Nucleotide Sequencing
/ Human Genetics
/ Humans
/ Polymorphism, Single Nucleotide
/ Primary Research
/ Proteomics
/ Sequence Analysis, DNA
/ Software
2014
Hey, we have placed the reservation for you!
By the way, why not check out events that you can attend while you pick your title.
You are currently in the queue to collect this book. You will be notified once it is your turn to collect the book.
Oops! Something went wrong.
Looks like we were not able to place the reservation. Kindly try again later.
Are you sure you want to remove the book from the shelf?
Validation and assessment of variant calling pipelines for next-generation sequencing
by
Kramer, Melissa
, Zandi, Peter P
, Parla, Jennifer
, Potash, James B
, Pirooznia, Mehdi
, Goes, Fernando S
, McCombie, W Richard
in
Bioinformatics
/ Biomedical and Life Sciences
/ Biomedicine
/ Bipolar Disorder - genetics
/ Data Interpretation, Statistical
/ Exome
/ High-Throughput Nucleotide Sequencing
/ Human Genetics
/ Humans
/ Polymorphism, Single Nucleotide
/ Primary Research
/ Proteomics
/ Sequence Analysis, DNA
/ Software
2014
Oops! Something went wrong.
While trying to remove the title from your shelf something went wrong :( Kindly try again later!
Do you wish to request the book?
Validation and assessment of variant calling pipelines for next-generation sequencing
by
Kramer, Melissa
, Zandi, Peter P
, Parla, Jennifer
, Potash, James B
, Pirooznia, Mehdi
, Goes, Fernando S
, McCombie, W Richard
in
Bioinformatics
/ Biomedical and Life Sciences
/ Biomedicine
/ Bipolar Disorder - genetics
/ Data Interpretation, Statistical
/ Exome
/ High-Throughput Nucleotide Sequencing
/ Human Genetics
/ Humans
/ Polymorphism, Single Nucleotide
/ Primary Research
/ Proteomics
/ Sequence Analysis, DNA
/ Software
2014
Please be aware that the book you have requested cannot be checked out. If you would like to checkout this book, you can reserve another copy
We have requested the book for you!
Your request is successful and it will be processed during the Library working hours. Please check the status of your request in My Requests.
Oops! Something went wrong.
Looks like we were not able to place your request. Kindly try again later.
Validation and assessment of variant calling pipelines for next-generation sequencing
Journal Article
Validation and assessment of variant calling pipelines for next-generation sequencing
2014
Request Book From Autostore
and Choose the Collection Method
Overview
Background
The processing and analysis of the large scale data generated by next-generation sequencing (NGS) experiments is challenging and is a burgeoning area of new methods development. Several new bioinformatics tools have been developed for calling sequence variants from NGS data. Here, we validate the variant calling of these tools and compare their relative accuracy to determine which data processing pipeline is optimal.
Results
We developed a unified pipeline for processing NGS data that encompasses four modules: mapping, filtering, realignment and recalibration, and variant calling. We processed 130 subjects from an ongoing whole exome sequencing study through this pipeline. To evaluate the accuracy of each module, we conducted a series of comparisons between the single nucleotide variant (SNV) calls from the NGS data and either gold-standard Sanger sequencing on a total of 700 variants or array genotyping data on a total of 9,935 single-nucleotide polymorphisms. A head to head comparison showed that Genome Analysis Toolkit (GATK) provided more accurate calls than SAMtools (positive predictive value of 92.55% vs. 80.35%, respectively). Realignment of mapped reads and recalibration of base quality scores before SNV calling proved to be crucial to accurate variant calling. GATK HaplotypeCaller algorithm for variant calling outperformed the UnifiedGenotype algorithm. We also showed a relationship between mapping quality, read depth and allele balance, and SNV call accuracy. However, if best practices are used in data processing, then additional filtering based on these metrics provides little gains and accuracies of >99% are achievable.
Conclusions
Our findings will help to determine the best approach for processing NGS data to confidently call variants for downstream analyses. To enable others to implement and replicate our results, all of our codes are freely available at
http://metamoodics.org/wes
.
Publisher
BioMed Central,Springer Nature B.V
This website uses cookies to ensure you get the best experience on our website.