Search Results Heading

MBRLSearchResults

mbrl.module.common.modules.added.book.to.shelf
Title added to your shelf!
View what I already have on My Shelf.
Oops! Something went wrong.
Oops! Something went wrong.
While trying to add the title to your shelf something went wrong :( Kindly try again later!
Are you sure you want to remove the book from the shelf?
Oops! Something went wrong.
Oops! Something went wrong.
While trying to remove the title from your shelf something went wrong :( Kindly try again later!
    Done
    Filters
    Reset
  • Discipline
      Discipline
      Clear All
      Discipline
  • Is Peer Reviewed
      Is Peer Reviewed
      Clear All
      Is Peer Reviewed
  • Item Type
      Item Type
      Clear All
      Item Type
  • Subject
      Subject
      Clear All
      Subject
  • Year
      Year
      Clear All
      From:
      -
      To:
  • More Filters
      More Filters
      Clear All
      More Filters
      Source
    • Language
355 result(s) for "genetic reference database"
Sort by:
The global depth range of marine fishes and their genetic coverage for environmental DNA metabarcoding
The bathymetric and geographical distribution of marine species represent a key information in biodiversity conservation. Yet, deep-sea ecosystems are among the least explored on Earth and are increasingly impacted by human activities. Environmental DNA (eDNA) metabarcoding has emerged as a promising method to study fish biodiversity but applications to the deep-sea are still scarce. A major limitation in the application of eDNA metabarcoding is the incompleteness of species sequences available in public genetic databases which reduces the extent of detected species. This incompleteness by depth is still unknown. Here, we built the global bathymetric and geographical distribution of 10,826 actinopterygian and 960 chondrichthyan fish species. We assessed their genetic coverage by depth and by ocean for three main metabarcoding markers used in the literature: teleo and MiFish-U/E. We also estimated the number of primer mismatches per species amplified by in silico polymerase chain reaction which influence the probability of species detection. Actinopterygians show a stronger decrease in species richness with depth than Chondrichthyans. These richness gradients are accompanied by a continuous species turnover between depths. Fish species coverage with the MiFish-U/E markers is higher than with teleo while threatened species are more sequenced than the others. “Deep-endemic” species, those not ascending to the shallow depth layer, are less sequenced than not threatened species. The number of primer mismatches is not higher for deep-sea species than for shallower ones. eDNA metabarcoding is promising for species detection in the deep-sea to better account for the 3-dimensional structure of the ocean in marine biodiversity monitoring and conservation. However, we argue that sequencing efforts on “deep-endemic” species are needed.
An efficient reverse genetics platform in the model legume Medicago truncatula
Medicago truncatula is one of the model species for legume studies. In an effort to develop legume genetics resources, > 21 700 Tnt1 retrotransposon insertion lines have been generated. To facilitate fast-growing needs in functional genomics, two reverse genetics approaches have been established: web-based database searching and PCR-based reverse screening. More than 840 genes have been reverse screened using the PCR-based approach over the past 6 yr to identify mutants in these genes. Overall, c. 84% (705 genes) success rate was achieved in identifying mutants with at least one Tnt1 insertion, of which c. 50% (358 genes) had three or more alleles. To demonstrate the utility of the two reverse genetics platforms, two mutant alleles were isolated for each of the two floral homeotic MADS-box genes, MtPISTILATA and MtAGAMOUS. Molecular and genetic analyses indicate that Tnt1 insertions in exons of both genes are responsible for the defects in floral organ development. In summary, we have developed two efficient reverse genetics platforms to facilitate functional characterization of M. truncatula genes.
Predicting disease genes using protein–protein interactions
Background: The responsible genes have not yet been identified for many genetically mapped disease loci. Physically interacting proteins tend to be involved in the same cellular process, and mutations in their genes may lead to similar disease phenotypes. Objective: To investigate whether protein–protein interactions can predict genes for genetically heterogeneous diseases. Methods: 72 940 protein–protein interactions between 10 894 human proteins were used to search 432 loci for candidate disease genes representing 383 genetically heterogeneous hereditary diseases. For each disease, the protein interaction partners of its known causative genes were compared with the disease associated loci lacking identified causative genes. Interaction partners located within such loci were considered candidate disease gene predictions. Prediction accuracy was tested using a benchmark set of known disease genes. Results: Almost 300 candidate disease gene predictions were made. Some of these have since been confirmed. On average, 10% or more are expected to be genuine disease genes, representing a 10-fold enrichment compared with positional information only. Examples of interesting candidates are AKAP6 for arrythmogenic right ventricular dysplasia 3 and SYN3 for familial partial epilepsy with variable foci. Conclusions: Exploiting protein–protein interactions can greatly increase the likelihood of finding positional candidate disease genes. When applied on a large scale they can lead to novel candidate gene predictions.
RefSeq database growth influences the accuracy of k-mer-based lowest common ancestor species identification
In order to determine the role of the database in taxonomic sequence classification, we examine the influence of the database over time on k -mer-based lowest common ancestor taxonomic classification. We present three major findings: the number of new species added to the NCBI RefSeq database greatly outpaces the number of new genera; as a result, more reads are classified with newer database versions, but fewer are classified at the species level; and Bayesian-based re-estimation mitigates this effect but struggles with novel genomes. These results suggest a need for new classification approaches specially adapted for large databases.
Increasing African genomic data generation and sharing to resolve rare and undiagnosed diseases in Africa: a call-to-action by the H3Africa rare diseases working group
The rich and diverse genomics of African populations is significantly underrepresented in reference and in disease-associated databases. This renders interpreting the Next Generation Sequencing (NGS) data and reaching a diagnostic more difficult in Africa and for the African diaspora. It increases chances for false positives with variants being misclassified as pathogenic due to their novelty or rarity. We can increase African genomic data by (1) making consent for sharing aggregate frequency data an essential component of research toolkit; (2) encouraging investigators with African data to share available data through public resources such as gnomAD, AVGD, ClinVar, DECIPHER and to use MatchMaker Exchange; (3) educating African research participants on the meaning and value of sharing aggregate frequency data; and (4) increasing funding to scale-up the production of African genomic data that will be more representative of the geographical and ethno-linguistic variation on the continent. The RDWG of H3Africa is hereby calling to action because this underrepresentation accentuates the health disparities. Applying the NGS to shorten the diagnostic odyssey or to guide therapeutic options for rare diseases will fully work for Africans only when public repositories include sufficient data from African subjects.
Cultivation and sequencing of rumen microbiome members from the Hungate1000 Collection
Rumen microbiome biology gets a boost with the release of 410 high-quality reference genomes from the Hungate1000 project. Productivity of ruminant livestock depends on the rumen microbiota, which ferment indigestible plant polysaccharides into nutrients used for growth. Understanding the functions carried out by the rumen microbiota is important for reducing greenhouse gas production by ruminants and for developing biofuels from lignocellulose. We present 410 cultured bacteria and archaea, together with their reference genomes, representing every cultivated rumen-associated archaeal and bacterial family. We evaluate polysaccharide degradation, short-chain fatty acid production and methanogenesis pathways, and assign specific taxa to functions. A total of 336 organisms were present in available rumen metagenomic data sets, and 134 were present in human gut microbiome data sets. Comparison with the human microbiome revealed rumen-specific enrichment for genes encoding de novo synthesis of vitamin B 12 , ongoing evolution by gene loss and potential vertical inheritance of the rumen microbiome based on underrepresentation of markers of environmental stress. We estimate that our Hungate genome resource represents ∼75% of the genus-level bacterial and archaeal taxa present in the rumen.
NARD: whole-genome reference panel of 1779 Northeast Asians improves imputation accuracy of rare and low-frequency variants
Here, we present the Northeast Asian Reference Database (NARD), including whole-genome sequencing data of 1779 individuals from Korea, Mongolia, Japan, China, and Hong Kong. NARD provides the genetic diversity of Korean ( n  = 850) and Mongolian ( n  = 384) ancestries that were not present in the 1000 Genomes Project Phase 3 (1KGP3). We combined and re-phased the genotypes from NARD and 1KGP3 to construct a union set of haplotypes. This approach established a robust imputation reference panel for Northeast Asians, which yields the greatest imputation accuracy of rare and low-frequency variants compared with the existing panels. NARD imputation panel is available at https://nard.macrogen.com/ .
The use of taxon-specific reference databases compromises metagenomic classification
A recent article in BMC Genomics describes a new bioinformatics tool, HumanMycobiomeScan, to classify fungal taxa in metagenomic samples. This tool was used to characterize the gut mycobiome of hunter-gatherers and Western populations, resulting in the identification of a range of fungal species in the vast majority of samples. In the HumanMycobiomeScan pipeline, sequence reads are mapped against a reference database containing fungal genome sequences only. We argue that using reference databases comprised of a single taxonomic group leads to an unacceptably high number of false-positives due to: (i) mapping to conserved genetic regions in reference genomes, and (ii) sequence contamination in the assembled reference genomes. To demonstrate this, we replaced the HumanMycobiomeScan’s fungal reference database with one containing genome sequences of amphibians and reptiles and re-analysed their case study. The classification pipeline recovered all species present in the reference database, revealing turtles (Geoemydidae), bull frogs (Pyxicephalidae) and snakes (Colubridae) as the most abundant herpetological taxa in the human gut. We also re-analysed their case study using a kingdom-agnostic pipeline. This revealed that while the gut of hunter-gatherers and Western subjects may be colonized by a range of microbial eukaryotes, only three fungal families were retrieved. These results highlight the pitfalls of using taxon-specific reference databases for metagenome classification, even when they are comprised of curated whole genome data. We propose that databases containing all domains of life provide the most suitable option for metagenomic species profiling, especially when targeting microbial eukaryotes.
GAPeDNA: Assessing and mapping global species gaps in genetic databases for eDNA metabarcoding
Aim: Environmental DNA metabarcoding has recently emerged as a non-invasive tool for aquatic biodiversity inventories, frequently surpassing traditional methods for detecting a wide range of taxa in most habitats. The major limitation currently impairing the large-scale application of eDNA-based inventories is the lack of species sequences available in public genetic databases. Unfortunately, these gaps are still unknown spatially and taxonomically, hindering targeted future sequencing efforts. Innovation: We propose GAPeDNA, a user-friendly web interface that provides a global overview of genetic database completeness for a given taxon across space and conservation status. As an application, we synthetized data from regional checklists for marine and freshwater fishes along with their IUCN conservation status to provide global maps of species coverage using the European Nucleotide Archive public reference database for 19 metabarcoding primers. This tool automatizes the scanning of gaps in these databases to guide future sequencing efforts and support the deployment of eDNA inventories at larger scale. This tool is flexible and can be expanded to other taxa and primers upon data availability. Main conclusions: Using our global fish case study, we show that gaps increase towards the tropics where species diversity and the number of threatened species are the highest. It highlights priority areas for fish sequencing like the Congo, the Mekong and the Mississippi freshwater basins which host more than 60 non-sequenced threatened fish species. For marine fishes, the Caribbean and East Africa host up to 42 non-sequenced threatened species. By presenting the global genetic database completeness for several primers on any taxa and building an open-access, updatable and flexible tool, GAPeDNA appears as a valuable contribution to support any kind of eDNA metabarcoding study.