Catalogue Search | MBRL
Search Results Heading
Explore the vast range of titles available.
MBRLSearchResults
-
DisciplineDiscipline
-
Is Peer ReviewedIs Peer Reviewed
-
Item TypeItem Type
-
SubjectSubject
-
YearFrom:-To:
-
More FiltersMore FiltersSourceLanguage
Done
Filters
Reset
1,876
result(s) for
"phylogenomics"
Sort by:
Improved phylogenetic resolution within the Neotropical rainforest genus Zygia (Mimoseae, Fabaceae) using phylogenomic data
2026
IntroductionThe Neotropical legume genus Zygia consists of approximately 60 species of small, cauliflorous trees. Phylogenetic relationships within Zygia are not fully known since previous phylogenies were for the most part poorly supported and unresolved with non-monophyletic species. Previous studies have shown that the genus Marmaroxylon is nested within Zygia, but a full sample of Marmaroxylon has yet to be included in analyses. This study aims to resolve taxonomic limits of Zygia and to test if phylogenomic data and more sampling of taxa improve resolution within the genus.MethodsWe utilize data from 1,315 nuclear loci sequenced from 134 accessions, representing 47 species of Zygia, as well as four species of the closely related genus Macrosamanea, the poorly known Marmaroxylon magdalenae, endemic to the Magdalena Valley in Colombia, and Marmaroxylon eperuetorum, which is the last remaining species of Marmaroxylon not tested with DNA data. The data are analyzed using species tree reconstruction based on the multispecies coalescent model and maximum likelihood analyses of concatenated datasets.ResultsOur phylogenetic analysis showed that the core Zygia clade is sister to a clade consisting of Macrosamanea, Zygia turneri, M. magdalenae, Inga, Ingopsis, and Pseudocojoba. Zygia is non-monophyletic since Z. turneri is more closely related to M. magdalenae and Macrosamanea than to the remaining species of Zygia. Marmaroxylon eperuetorum is found within the core Zygia clade. Resolution within Zygia clearly improved with phylogenomic data and denser sampling of taxa, but non-monophyletic species are still present.DiscussionThe core Zygia clade is strongly supported statistically and relationships within it are well-resolved. However, 13 species of Zygia are non-monophyletic, and the reasons for this are not yet fully known but could be due to, e.g., misidentification, hybridization, or incomplete lineage sorting. Both species trees reconstructed based on the multispecies coalescent model and phylogenetic trees inferred from maximum likelihood analyses of the concatenated datasets show similar topologies. Gene tree discordance is high for many branches within Zygia and is, in most cases, due to incomplete lineage sorting. To render Zygia monophyletic, new combinations in Macrosamanea are given for Z. turneri and Marmaroxylon magdalenae.
Journal Article
IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era
2020
IQ-TREE (http://www.iqtree.org, last accessed February 6, 2020) is a user-friendly and widely used software package for phylogenetic inference using maximum likelihood. Since the release of version 1 in 2014, we have continuously expanded IQ-TREE to integrate a plethora of new models of sequence evolution and efficient computational approaches of phylogenetic inference to deal with genomic data. Here, we describe notable features of IQ-TREE version 2 and highlight the key advantages over other software.
Journal Article
MEGA12: Molecular Evolutionary Genetic Analysis Version 12 for Adaptive and Green Computing
2024
Abstract
We introduce the 12th version of the Molecular Evolutionary Genetics Analysis (MEGA12) software. This latest version brings many significant improvements by reducing the computational time needed for selecting optimal substitution models and conducting bootstrap tests on phylogenies using maximum likelihood (ML) methods. These improvements are achieved by implementing heuristics that minimize likely unnecessary computations. Analyses of empirical and simulated datasets show substantial time savings by using these heuristics without compromising the accuracy of results. MEGA12 also links-in an evolutionary sparse learning approach to identify fragile clades and associated sequences in evolutionary trees inferred through phylogenomic analyses. In addition, this version includes fine-grained parallelization for ML analyses, support for high-resolution monitors, and an enhanced Tree Explorer. MEGA12 can be downloaded from https://www.megasoftware.net.
Journal Article
New Methods to Calculate Concordance Factors for Phylogenomic Datasets
2020
We implement two measures for quantifying genealogical concordance in phylogenomic data sets: the gene concordance factor (gCF) and the novel site concordance factor (sCF). For every branch of a reference tree, gCF is defined as the percentage of “decisive” gene trees containing that branch. This measure is already in wide usage, but here we introduce a package that calculates it while accounting for variable taxon coverage among gene trees. sCF is a new measure defined as the percentage of decisive sites supporting a branch in the reference tree. gCF and sCF complement classical measures of branch support in phylogenetics by providing a full description of underlying disagreement among loci and sites. An easy to use implementation and tutorial is freely available in the IQ-TREE software package (http://www.iqtree.org/doc/Concordance-Factor, last accessed May 13, 2020).
Journal Article
High-Throughput Genomic Data in Systematics and Phylogenetics
by
Lemmon, Emily Moriarty
,
Lemmon, Alan R.
in
Accuracy
,
Biological taxonomies
,
Comparative analysis
2013
High-throughput genomic sequencing is rapidly changing the field of phylogenetics by decreasing the cost and increasing the quantity and rate of data collection by several orders of magnitude. This deluge of data is exerting tremendous pressure on downstream data-analysis methods providing new opportunities for method development. In this review, we present (a) recent advances in laboratory methods for collection of high-throughput phylogenetic data and (b) challenges and constraints for phylogenetic analysis of these data. We compare the merits of multiple laboratory approaches, compare methods of data analysis, and offer recommendations for the most promising protocols and data-analysis workflows currently available for phylogenetics. We also discuss several strategies for increasing accuracy, with an emphasis on locus selection and proper model choice.
Journal Article
Disentangling Sources of Gene Tree Discordance in Phylogenomic Data Sets
by
Kadereit, Gudrun
,
Tefarikis, Delphine T.
,
Yim, Won C.
in
Amaranthaceae
,
BASIC BIOLOGICAL SCIENCES
,
Data processing
2021
Gene tree discordance in large genomic data sets can be caused by evolutionary processes such as incomplete lineage sorting and hybridization, as well as model violation, and errors in data processing, orthology inference, and gene tree estimation. Species tree methods that identify and accommodate all sources of conflict are not available, but a combination of multiple approaches can help tease apart alternative sources of conflict. Here, using a phylotranscriptomic analysis in combination with reference genomes, we test a hypothesis of ancient hybridization events within the plant family Amaranthaceae s.l. thatwas previously supported bymorphological, ecological, and Sanger-based molecular data. The data set included seven genomes and 88 transcriptomes, 17 generated for this study. We examined gene-tree discordance using coalescent-based species trees and network inference, gene tree discordance analyses, site pattern tests of introgression, topology tests, synteny analyses, and simulations. We found that a combination of processes might have generated the high levels of gene tree discordance in the backbone of Amaranthaceae s.l. Furthermore, we found evidence that three consecutive short internal branches produce anomalous trees contributing to the discordance. Overall, our results suggest that Amaranthaceae s.l. might be a product of an ancient and rapid lineage diversification, and remains, and probably will remain, unresolved. This work highlights the potential problems of identifiability associated with the sources of gene tree discordance including, in particular, phylogenetic network methods. Our results also demonstrate the importance of thoroughly testing for multiple sources of conflict in phylogenomic analyses, especially in the context of ancient, rapid radiations. We provide several recommendations for exploring conflicting signals in such situations.
Journal Article
Revisiting metazoan phylogeny with genomic sampling of all phyla
by
Combosch, David
,
Laumer, Christopher E.
,
Fernández, Rosa
in
Animals
,
Classification
,
Evolution
2019
Proper biological interpretation of a phylogeny can sometimes hinge on the placement of key taxa—or fail when such key taxa are not sampled. In this light, we here present the first attempt to investigate (though not conclusively resolve) animal relationships using genome-scale data from all phyla. Results from the site-heterogeneous CAT + GTR model recapitulate many established major clades, and strongly confirm some recent discoveries, such as a monophyletic Lophophorata, and a sister group relationship between Gnathifera and Chaetognatha, raising continued questions on the nature of the spiralian ancestor. We also explore matrix construction with an eye towards testing specific relationships; this approach uniquely recovers support for Panarthropoda, and shows that Lophotrochozoa (a subclade of Spiralia) can be constructed in strongly conflicting ways using different taxon- and/or orthologue sets. Dayhoff-6 recoding sacrifices information, but can also reveal surprising outcomes, e.g. full support for a clade of Lophophorata and Entoprocta + Cycliophora, a clade of Placozoa + Cnidaria, and raising support for Ctenophora as sister group to the remaining Metazoa, in a manner dependent on the gene and/or taxon sampling of the matrix in question. Future work should test the hypothesis that the few remaining uncertainties in animal phylogeny might reflect violations of the various stationarity assumptions used in contemporary inference methods.
Journal Article
ASTRAL-III: polynomial time species tree reconstruction from partially resolved gene trees
2018
Background
Evolutionary histories can be discordant across the genome, and such discordances need to be considered in reconstructing the species phylogeny. ASTRAL is one of the leading methods for inferring species trees from gene trees while accounting for gene tree discordance. ASTRAL uses dynamic programming to search for the tree that shares the maximum number of quartet topologies with input gene trees, restricting itself to a predefined set of bipartitions.
Results
We introduce ASTRAL-III, which substantially improves the running time of ASTRAL-II and guarantees polynomial running time as a function of both the number of species (
n
) and the number of genes (
k
). ASTRAL-III limits the bipartition constraint set (
X
) to grow at most linearly with
n
and
k
. Moreover, it handles polytomies more efficiently than ASTRAL-II, exploits similarities between gene trees better, and uses several techniques to avoid searching parts of the search space that are mathematically guaranteed not to include the optimal tree. The asymptotic running time of ASTRAL-III in the presence of polytomies is
O
(
nk
)
1.726
D
where
D
=
O
(
nk
) is the sum of degrees of all
unique
nodes in input trees. The running time improvements enable us to test whether contracting low support branches in gene trees improves the accuracy by reducing noise. In extensive simulations, we show that removing branches with
very
low support (e.g., below 10%) improves accuracy while overly aggressive filtering is harmful. We observe on a biological avian phylogenomic dataset of 14K genes that contracting low support branches greatly improve results.
Conclusions
ASTRAL-III is a faster version of the ASTRAL method for phylogenetic reconstruction and can scale up to 10,000 species. With ASTRAL-III, low support branches can be removed, resulting in improved accuracy.
Journal Article
MACSE v2: Toolkit for the Alignment of Coding Sequences Accounting for Frameshifts and Stop Codons
by
Chantret, Nathalie
,
Ranwez, Vincent
,
Douzery, Emmanuel J P
in
Algorithms
,
Alignment
,
Amino acids
2018
Multiple sequence alignment is a prerequisite for many evolutionary analyses. Multiple Alignment of Coding Sequences (MACSE) is a multiple sequence alignment program that explicitly accounts for the underlying codon structure of protein-coding nucleotide sequences. Its unique characteristic allows building reliable codon alignments even in the presence of frameshifts. This facilitates downstream analyses such as selection pressure estimation based on the ratio of nonsynonymous to synonymous substitutions. Here, we present MACSE v2, a major update with an improved version of the initial algorithm enriched with a complete toolkit to handle multiple alignments of protein-coding sequences. A graphical interface now provides user-friendly access to the different subprograms.
Journal Article
Comprehensive phylogeny of ray-finned fishes (Actinopterygii) based on transcriptomic and genomic data
by
Sun, Ying
,
Venkatesh, Byrappa
,
Zhou, Zhuocheng
in
Animals
,
Bioinformatics
,
Biological Sciences
2018
Our understanding of phylogenetic relationships among bony fishes has been transformed by analysis of a small number of genes, but uncertainty remains around critical nodes. Genome-scale inferences so far have sampled a limited number of taxa and genes. Here we leveraged 144 genomes and 159 transcriptomes to investigate fish evolution with an unparalleled scale of data: >0.5 Mb from 1,105 orthologous exon sequences from 303 species, representing 66 out of 72 ray-finned fish orders. We apply phylogenetic tests designed to trace the effect of whole-genome duplication events on gene trees and find paralogy-free loci using a bioinformatics approach. Genome-wide data support the structure of the fish phylogeny, and hypothesis-testing procedures appropriate for phylogenomic datasets using explicit gene genealogy interrogation settle some long-standing uncertainties, such as the branching order at the base of the teleosts and among early euteleosts, and the sister lineage to the acanthomorph and percomorph radiations. Comprehensive fossil calibrations date the origin of all major fish lineages before the end of the Cretaceous.
Journal Article