Asset Details
MbrlCatalogueTitleDetail
Do you wish to reserve the book?
Designing small universal k-mer hitting sets for improved analysis of high-throughput sequencing
by
Orenstein, Yaron
, Pellow, David
, Kingsford, Carl
, Marçais, Guillaume
, Shamir, Ron
in
Algorithms
/ Animals
/ Bioinformatics
/ Biology
/ Biology and Life Sciences
/ Caenorhabditis elegans - genetics
/ Computational Biology - methods
/ Computer and Information Sciences
/ Computer Heuristics
/ Computer science
/ Data structures
/ Docks
/ Funding
/ Genome, Bacterial
/ Genome, Human
/ Genomes
/ Genomics
/ Heuristic
/ High-Throughput Nucleotide Sequencing - methods
/ Humans
/ Integers
/ International conferences
/ Mathematical analysis
/ Methods
/ Next-generation sequencing
/ Physical Sciences
/ Problem solving
/ Research and Analysis Methods
/ Sequence Analysis, DNA - methods
/ Software
2017
Hey, we have placed the reservation for you!
By the way, why not check out events that you can attend while you pick your title.
You are currently in the queue to collect this book. You will be notified once it is your turn to collect the book.
Oops! Something went wrong.
Looks like we were not able to place the reservation. Kindly try again later.
Are you sure you want to remove the book from the shelf?
Designing small universal k-mer hitting sets for improved analysis of high-throughput sequencing
by
Orenstein, Yaron
, Pellow, David
, Kingsford, Carl
, Marçais, Guillaume
, Shamir, Ron
in
Algorithms
/ Animals
/ Bioinformatics
/ Biology
/ Biology and Life Sciences
/ Caenorhabditis elegans - genetics
/ Computational Biology - methods
/ Computer and Information Sciences
/ Computer Heuristics
/ Computer science
/ Data structures
/ Docks
/ Funding
/ Genome, Bacterial
/ Genome, Human
/ Genomes
/ Genomics
/ Heuristic
/ High-Throughput Nucleotide Sequencing - methods
/ Humans
/ Integers
/ International conferences
/ Mathematical analysis
/ Methods
/ Next-generation sequencing
/ Physical Sciences
/ Problem solving
/ Research and Analysis Methods
/ Sequence Analysis, DNA - methods
/ Software
2017
Oops! Something went wrong.
While trying to remove the title from your shelf something went wrong :( Kindly try again later!
Do you wish to request the book?
Designing small universal k-mer hitting sets for improved analysis of high-throughput sequencing
by
Orenstein, Yaron
, Pellow, David
, Kingsford, Carl
, Marçais, Guillaume
, Shamir, Ron
in
Algorithms
/ Animals
/ Bioinformatics
/ Biology
/ Biology and Life Sciences
/ Caenorhabditis elegans - genetics
/ Computational Biology - methods
/ Computer and Information Sciences
/ Computer Heuristics
/ Computer science
/ Data structures
/ Docks
/ Funding
/ Genome, Bacterial
/ Genome, Human
/ Genomes
/ Genomics
/ Heuristic
/ High-Throughput Nucleotide Sequencing - methods
/ Humans
/ Integers
/ International conferences
/ Mathematical analysis
/ Methods
/ Next-generation sequencing
/ Physical Sciences
/ Problem solving
/ Research and Analysis Methods
/ Sequence Analysis, DNA - methods
/ Software
2017
Please be aware that the book you have requested cannot be checked out. If you would like to checkout this book, you can reserve another copy
We have requested the book for you!
Your request is successful and it will be processed during the Library working hours. Please check the status of your request in My Requests.
Oops! Something went wrong.
Looks like we were not able to place your request. Kindly try again later.
Designing small universal k-mer hitting sets for improved analysis of high-throughput sequencing
Journal Article
Designing small universal k-mer hitting sets for improved analysis of high-throughput sequencing
2017
Request Book From Autostore
and Choose the Collection Method
Overview
With the rapidly increasing volume of deep sequencing data, more efficient algorithms and data structures are needed. Minimizers are a central recent paradigm that has improved various sequence analysis tasks, including hashing for faster read overlap detection, sparse suffix arrays for creating smaller indexes, and Bloom filters for speeding up sequence search. Here, we propose an alternative paradigm that can lead to substantial further improvement in these and other tasks. For integers k and L > k, we say that a set of k-mers is a universal hitting set (UHS) if every possible L-long sequence must contain a k-mer from the set. We develop a heuristic called DOCKS to find a compact UHS, which works in two phases: The first phase is solved optimally, and for the second we propose several efficient heuristics, trading set size for speed and memory. The use of heuristics is motivated by showing the NP-hardness of a closely related problem. We show that DOCKS works well in practice and produces UHSs that are very close to a theoretical lower bound. We present results for various values of k and L and by applying them to real genomes show that UHSs indeed improve over minimizers. In particular, DOCKS uses less than 30% of the 10-mers needed to span the human genome compared to minimizers. The software and computed UHSs are freely available at github.com/Shamir-Lab/DOCKS/ and acgt.cs.tau.ac.il/docks/, respectively.
Publisher
Public Library of Science,Public Library of Science (PLoS)
Subject
/ Animals
/ Biology
/ Caenorhabditis elegans - genetics
/ Computational Biology - methods
/ Computer and Information Sciences
/ Docks
/ Funding
/ Genomes
/ Genomics
/ High-Throughput Nucleotide Sequencing - methods
/ Humans
/ Integers
/ Methods
/ Research and Analysis Methods
/ Sequence Analysis, DNA - methods
/ Software
MBRLCatalogueRelatedBooks
Related Items
Related Items
This website uses cookies to ensure you get the best experience on our website.