Catalogue Search | MBRL
Search Results Heading
Explore the vast range of titles available.
MBRLSearchResults
-
DisciplineDiscipline
-
Is Peer ReviewedIs Peer Reviewed
-
Item TypeItem Type
-
SubjectSubject
-
YearFrom:-To:
-
More FiltersMore FiltersSourceLanguage
Done
Filters
Reset
355
result(s) for
"genetic reference database"
Sort by:
The global depth range of marine fishes and their genetic coverage for environmental DNA metabarcoding
by
MARine Biodiversity Exploitation and Conservation - MARBEC (UMR MARBEC) ; Institut de Recherche pour le Développement (IRD)-Institut Français de Recherche pour l'Exploitation de la Mer (IFREMER)-Centre National de la Recherche Scientifique (CNRS)-Université de Montpellier (UM)
,
Duhamet, Agnès
,
Swiss Federal Institute for Forest, Snow and Landscape Research WSL
in
Applied Ecology
,
Biodiversity
,
Biodiversity and Ecology
2023
The bathymetric and geographical distribution of marine species represent a key information in biodiversity conservation. Yet, deep-sea ecosystems are among the least explored on Earth and are increasingly impacted by human activities. Environmental DNA (eDNA) metabarcoding has emerged as a promising method to study fish biodiversity but applications to the deep-sea are still scarce. A major limitation in the application of eDNA metabarcoding is the incompleteness of species sequences available in public genetic databases which reduces the extent of detected species. This incompleteness by depth is still unknown. Here, we built the global bathymetric and geographical distribution of 10,826 actinopterygian and 960 chondrichthyan fish species. We assessed their genetic coverage by depth and by ocean for three main metabarcoding markers used in the literature: teleo and MiFish-U/E. We also estimated the number of primer mismatches per species amplified by in silico polymerase chain reaction which influence the probability of species detection. Actinopterygians show a stronger decrease in species richness with depth than Chondrichthyans. These richness gradients are accompanied by a continuous species turnover between depths. Fish species coverage with the MiFish-U/E markers is higher than with teleo while threatened species are more sequenced than the others. “Deep-endemic” species, those not ascending to the shallow depth layer, are less sequenced than not threatened species. The number of primer mismatches is not higher for deep-sea species than for shallower ones. eDNA metabarcoding is promising for species detection in the deep-sea to better account for the 3-dimensional structure of the ocean in marine biodiversity monitoring and conservation. However, we argue that sequencing efforts on “deep-endemic” species are needed.
Journal Article
An efficient reverse genetics platform in the model legume Medicago truncatula
2014
Medicago truncatula is one of the model species for legume studies. In an effort to develop legume genetics resources, > 21 700 Tnt1 retrotransposon insertion lines have been generated.
To facilitate fast-growing needs in functional genomics, two reverse genetics approaches have been established: web-based database searching and PCR-based reverse screening. More than 840 genes have been reverse screened using the PCR-based approach over the past 6 yr to identify mutants in these genes. Overall, c. 84% (705 genes) success rate was achieved in identifying mutants with at least one Tnt1 insertion, of which c. 50% (358 genes) had three or more alleles.
To demonstrate the utility of the two reverse genetics platforms, two mutant alleles were isolated for each of the two floral homeotic MADS-box genes, MtPISTILATA and MtAGAMOUS. Molecular and genetic analyses indicate that Tnt1 insertions in exons of both genes are responsible for the defects in floral organ development.
In summary, we have developed two efficient reverse genetics platforms to facilitate functional characterization of M. truncatula genes.
Journal Article
Predicting disease genes using protein–protein interactions
2006
Background: The responsible genes have not yet been identified for many genetically mapped disease loci. Physically interacting proteins tend to be involved in the same cellular process, and mutations in their genes may lead to similar disease phenotypes. Objective: To investigate whether protein–protein interactions can predict genes for genetically heterogeneous diseases. Methods: 72 940 protein–protein interactions between 10 894 human proteins were used to search 432 loci for candidate disease genes representing 383 genetically heterogeneous hereditary diseases. For each disease, the protein interaction partners of its known causative genes were compared with the disease associated loci lacking identified causative genes. Interaction partners located within such loci were considered candidate disease gene predictions. Prediction accuracy was tested using a benchmark set of known disease genes. Results: Almost 300 candidate disease gene predictions were made. Some of these have since been confirmed. On average, 10% or more are expected to be genuine disease genes, representing a 10-fold enrichment compared with positional information only. Examples of interesting candidates are AKAP6 for arrythmogenic right ventricular dysplasia 3 and SYN3 for familial partial epilepsy with variable foci. Conclusions: Exploiting protein–protein interactions can greatly increase the likelihood of finding positional candidate disease genes. When applied on a large scale they can lead to novel candidate gene predictions.
Journal Article
RefSeq database growth influences the accuracy of k-mer-based lowest common ancestor species identification
by
Treangen, Todd J.
,
Phillippy, Adam M.
,
Koren, Sergey
in
ancestry
,
Animal Genetics and Genomics
,
Bayesian analysis
2018
In order to determine the role of the database in taxonomic sequence classification, we examine the influence of the database over time on
k
-mer-based lowest common ancestor taxonomic classification. We present three major findings: the number of new species added to the NCBI RefSeq database greatly outpaces the number of new genera; as a result, more reads are classified with newer database versions, but fewer are classified at the species level; and Bayesian-based re-estimation mitigates this effect but struggles with novel genomes. These results suggest a need for new classification approaches specially adapted for large databases.
Journal Article
Increasing African genomic data generation and sharing to resolve rare and undiagnosed diseases in Africa: a call-to-action by the H3Africa rare diseases working group
2022
The rich and diverse genomics of African populations is significantly underrepresented in reference and in disease-associated databases. This renders interpreting the Next Generation Sequencing (NGS) data and reaching a diagnostic more difficult in Africa and for the African diaspora. It increases chances for false positives with variants being misclassified as pathogenic due to their novelty or rarity. We can increase African genomic data by (1) making consent for sharing aggregate frequency data an essential component of research toolkit; (2) encouraging investigators with African data to share available data through public resources such as gnomAD, AVGD, ClinVar, DECIPHER and to use MatchMaker Exchange; (3) educating African research participants on the meaning and value of sharing aggregate frequency data; and (4) increasing funding to scale-up the production of African genomic data that will be more representative of the geographical and ethno-linguistic variation on the continent. The RDWG of H3Africa is hereby calling to action because this underrepresentation accentuates the health disparities. Applying the NGS to shorten the diagnostic odyssey or to guide therapeutic options for rare diseases will fully work for Africans only when public repositories include sufficient data from African subjects.
Journal Article
Cultivation and sequencing of rumen microbiome members from the Hungate1000 Collection
2018
Rumen microbiome biology gets a boost with the release of 410 high-quality reference genomes from the Hungate1000 project.
Productivity of ruminant livestock depends on the rumen microbiota, which ferment indigestible plant polysaccharides into nutrients used for growth. Understanding the functions carried out by the rumen microbiota is important for reducing greenhouse gas production by ruminants and for developing biofuels from lignocellulose. We present 410 cultured bacteria and archaea, together with their reference genomes, representing every cultivated rumen-associated archaeal and bacterial family. We evaluate polysaccharide degradation, short-chain fatty acid production and methanogenesis pathways, and assign specific taxa to functions. A total of 336 organisms were present in available rumen metagenomic data sets, and 134 were present in human gut microbiome data sets. Comparison with the human microbiome revealed rumen-specific enrichment for genes encoding
de novo
synthesis of vitamin B
12
, ongoing evolution by gene loss and potential vertical inheritance of the rumen microbiome based on underrepresentation of markers of environmental stress. We estimate that our Hungate genome resource represents ∼75% of the genus-level bacterial and archaeal taxa present in the rumen.
Journal Article
NARD: whole-genome reference panel of 1779 Northeast Asians improves imputation accuracy of rare and low-frequency variants
by
Shin, Jong-Yeon
,
Kim, Sungjae
,
Kim, Hie Lim
in
Asian Continental Ancestry Group - genetics
,
Asians
,
Base sequence
2019
Here, we present the Northeast Asian Reference Database (NARD), including whole-genome sequencing data of 1779 individuals from Korea, Mongolia, Japan, China, and Hong Kong. NARD provides the genetic diversity of Korean (
n
= 850) and Mongolian (
n
= 384) ancestries that were not present in the 1000 Genomes Project Phase 3 (1KGP3). We combined and re-phased the genotypes from NARD and 1KGP3 to construct a union set of haplotypes. This approach established a robust imputation reference panel for Northeast Asians, which yields the greatest imputation accuracy of rare and low-frequency variants compared with the existing panels. NARD imputation panel is available at
https://nard.macrogen.com/
.
Journal Article
The use of taxon-specific reference databases compromises metagenomic classification
by
R. Marcelino, Vanessa
,
Sorrell, Tania C.
,
Holmes, Edward C.
in
Amphibians
,
Amphibians - classification
,
Amphibians - genetics
2020
A recent article in
BMC Genomics
describes a new bioinformatics tool, HumanMycobiomeScan, to classify fungal taxa in metagenomic samples. This tool was used to characterize the gut mycobiome of hunter-gatherers and Western populations, resulting in the identification of a range of fungal species in the vast majority of samples. In the HumanMycobiomeScan pipeline, sequence reads are mapped against a reference database containing fungal genome sequences only. We argue that using reference databases comprised of a single taxonomic group leads to an unacceptably high number of false-positives due to: (i) mapping to conserved genetic regions in reference genomes, and (ii) sequence contamination in the assembled reference genomes. To demonstrate this, we replaced the HumanMycobiomeScan’s fungal reference database with one containing genome sequences of amphibians and reptiles and re-analysed their case study. The classification pipeline recovered all species present in the reference database, revealing turtles (Geoemydidae), bull frogs (Pyxicephalidae) and snakes (Colubridae) as the most abundant herpetological taxa in the human gut. We also re-analysed their case study using a kingdom-agnostic pipeline. This revealed that while the gut of hunter-gatherers and Western subjects may be colonized by a range of microbial eukaryotes, only three fungal families were retrieved. These results highlight the pitfalls of using taxon-specific reference databases for metagenome classification, even when they are comprised of curated whole genome data. We propose that databases containing all domains of life provide the most suitable option for metagenomic species profiling, especially when targeting microbial eukaryotes.
Journal Article
GAPeDNA: Assessing and mapping global species gaps in genetic databases for eDNA metabarcoding
by
MARine Biodiversity Exploitation and Conservation (UMR MARBEC) ; Institut de Recherche pour le Développement (IRD)-Institut Français de Recherche pour l'Exploitation de la Mer (IFREMER)-Université de Montpellier (UM)-Centre National de la Recherche Scientifique (CNRS)
,
Dejean, Tony
,
Milhau, Tristan
in
Archives & records
,
Biodiversity
,
Biodiversity and Ecology
2021
Aim: Environmental DNA metabarcoding has recently emerged as a non-invasive tool for aquatic biodiversity inventories, frequently surpassing traditional methods for detecting a wide range of taxa in most habitats. The major limitation currently impairing the large-scale application of eDNA-based inventories is the lack of species sequences available in public genetic databases. Unfortunately, these gaps are still unknown spatially and taxonomically, hindering targeted future sequencing efforts. Innovation: We propose GAPeDNA, a user-friendly web interface that provides a global overview of genetic database completeness for a given taxon across space and conservation status. As an application, we synthetized data from regional checklists for marine and freshwater fishes along with their IUCN conservation status to provide global maps of species coverage using the European Nucleotide Archive public reference database for 19 metabarcoding primers. This tool automatizes the scanning of gaps in these databases to guide future sequencing efforts and support the deployment of eDNA inventories at larger scale. This tool is flexible and can be expanded to other taxa and primers upon data availability. Main conclusions: Using our global fish case study, we show that gaps increase towards the tropics where species diversity and the number of threatened species are the highest. It highlights priority areas for fish sequencing like the Congo, the Mekong and the Mississippi freshwater basins which host more than 60 non-sequenced threatened fish species. For marine fishes, the Caribbean and East Africa host up to 42 non-sequenced threatened species. By presenting the global genetic database completeness for several primers on any taxa and building an open-access, updatable and flexible tool, GAPeDNA appears as a valuable contribution to support any kind of eDNA metabarcoding study.
Journal Article