Asset Details
MbrlCatalogueTitleDetail
Do you wish to reserve the book?
PhyloMissForest: a random forest framework to construct phylogenetic trees with missing data
by
Santander-Jimenéz, Sergio
, Pinheiro, Diogo
, Ilic, Aleksandar
in
Algorithms
/ Amino acid sequence
/ Amino Acids
/ Analysis
/ Animal Genetics and Genomics
/ Biodiversity
/ Biological evolution
/ Biomedical and Life Sciences
/ Deoxyribonucleic acid
/ DNA
/ Evolution
/ Genomics
/ Hypotheses
/ Learning algorithms
/ Life Sciences
/ Machine learning
/ Methods
/ Microarrays
/ Microbial Genetics and Genomics
/ Missing data
/ Missing data imputation
/ Missing observations (Statistics)
/ Nucleotide sequence
/ Phylogenetic tree
/ Phylogenetic trees
/ Phylogenetics
/ Phylogeny
/ Plant Genetics and Genomics
/ Proteomics
/ Random forest
/ Research methodology
/ Robustness
/ Search methods
/ Sequence Alignment
/ Severe acute respiratory syndrome coronavirus 2
/ Statistical analysis
/ Taxonomy
/ Wildlife conservation
2022
Hey, we have placed the reservation for you!
By the way, why not check out events that you can attend while you pick your title.
You are currently in the queue to collect this book. You will be notified once it is your turn to collect the book.
Oops! Something went wrong.
Looks like we were not able to place the reservation. Kindly try again later.
Are you sure you want to remove the book from the shelf?
PhyloMissForest: a random forest framework to construct phylogenetic trees with missing data
by
Santander-Jimenéz, Sergio
, Pinheiro, Diogo
, Ilic, Aleksandar
in
Algorithms
/ Amino acid sequence
/ Amino Acids
/ Analysis
/ Animal Genetics and Genomics
/ Biodiversity
/ Biological evolution
/ Biomedical and Life Sciences
/ Deoxyribonucleic acid
/ DNA
/ Evolution
/ Genomics
/ Hypotheses
/ Learning algorithms
/ Life Sciences
/ Machine learning
/ Methods
/ Microarrays
/ Microbial Genetics and Genomics
/ Missing data
/ Missing data imputation
/ Missing observations (Statistics)
/ Nucleotide sequence
/ Phylogenetic tree
/ Phylogenetic trees
/ Phylogenetics
/ Phylogeny
/ Plant Genetics and Genomics
/ Proteomics
/ Random forest
/ Research methodology
/ Robustness
/ Search methods
/ Sequence Alignment
/ Severe acute respiratory syndrome coronavirus 2
/ Statistical analysis
/ Taxonomy
/ Wildlife conservation
2022
Oops! Something went wrong.
While trying to remove the title from your shelf something went wrong :( Kindly try again later!
Do you wish to request the book?
PhyloMissForest: a random forest framework to construct phylogenetic trees with missing data
by
Santander-Jimenéz, Sergio
, Pinheiro, Diogo
, Ilic, Aleksandar
in
Algorithms
/ Amino acid sequence
/ Amino Acids
/ Analysis
/ Animal Genetics and Genomics
/ Biodiversity
/ Biological evolution
/ Biomedical and Life Sciences
/ Deoxyribonucleic acid
/ DNA
/ Evolution
/ Genomics
/ Hypotheses
/ Learning algorithms
/ Life Sciences
/ Machine learning
/ Methods
/ Microarrays
/ Microbial Genetics and Genomics
/ Missing data
/ Missing data imputation
/ Missing observations (Statistics)
/ Nucleotide sequence
/ Phylogenetic tree
/ Phylogenetic trees
/ Phylogenetics
/ Phylogeny
/ Plant Genetics and Genomics
/ Proteomics
/ Random forest
/ Research methodology
/ Robustness
/ Search methods
/ Sequence Alignment
/ Severe acute respiratory syndrome coronavirus 2
/ Statistical analysis
/ Taxonomy
/ Wildlife conservation
2022
Please be aware that the book you have requested cannot be checked out. If you would like to checkout this book, you can reserve another copy
We have requested the book for you!
Your request is successful and it will be processed during the Library working hours. Please check the status of your request in My Requests.
Oops! Something went wrong.
Looks like we were not able to place your request. Kindly try again later.
PhyloMissForest: a random forest framework to construct phylogenetic trees with missing data
Journal Article
PhyloMissForest: a random forest framework to construct phylogenetic trees with missing data
2022
Request Book From Autostore
and Choose the Collection Method
Overview
Background
In the pursuit of a better understanding of biodiversity, evolutionary biologists rely on the study of phylogenetic relationships to illustrate the course of evolution. The relationships among natural organisms, depicted in the shape of phylogenetic trees, not only help to understand evolutionary history but also have a wide range of additional applications in science. One of the most challenging problems that arise when building phylogenetic trees is the presence of missing biological data. More specifically, the possibility of inferring wrong phylogenetic trees increases proportionally to the amount of missing values in the input data. Although there are methods proposed to deal with this issue, their applicability and accuracy is often restricted by different constraints.
Results
We propose a framework, called PhyloMissForest, to impute missing entries in phylogenetic distance matrices and infer accurate evolutionary relationships. PhyloMissForest is built upon a random forest structure that infers the missing entries of the input data, based on the known parts of it. PhyloMissForest contributes with a robust and configurable framework that incorporates multiple search strategies and machine learning, complemented by phylogenetic techniques, to provide a more accurate inference of lost phylogenetic distances. We evaluate our framework by examining three real-world datasets, two DNA-based sequence alignments and one containing amino acid data, and two additional instances with simulated DNA data. Moreover, we follow a design of experiments methodology to define the hyperparameter values of our algorithm, which is a concise method, preferable in comparison to the well-known exhaustive parameters search. By varying the percentages of missing data from 5% to 60%, we generally outperform the state-of-the-art alternative imputation techniques in the tests conducted on real DNA data. In addition, significant improvements in execution time are observed for the amino acid instance. The results observed on simulated data also denote the attainment of improved imputations when dealing with large percentages of missing data.
Conclusions
By merging multiple search strategies, machine learning, and phylogenetic techniques, PhyloMissForest provides a highly customizable and robust framework for phylogenetic missing data imputation, with significant topological accuracy and effective speedups over the state of the art.
Publisher
BioMed Central,BioMed Central Ltd,Springer Nature B.V,BMC
This website uses cookies to ensure you get the best experience on our website.