Search Results Heading

MBRLSearchResults

mbrl.module.common.modules.added.book.to.shelf
Title added to your shelf!
View what I already have on My Shelf.
Oops! Something went wrong.
Oops! Something went wrong.
While trying to add the title to your shelf something went wrong :( Kindly try again later!
Are you sure you want to remove the book from the shelf?
Oops! Something went wrong.
Oops! Something went wrong.
While trying to remove the title from your shelf something went wrong :( Kindly try again later!
    Done
    Filters
    Reset
  • Language
      Language
      Clear All
      Language
  • Subject
      Subject
      Clear All
      Subject
  • Item Type
      Item Type
      Clear All
      Item Type
  • Discipline
      Discipline
      Clear All
      Discipline
  • Year
      Year
      Clear All
      From:
      -
      To:
  • More Filters
31 result(s) for "Wadsworth, Mark E."
Sort by:
Genome-Wide Association Study of CSF Levels of 59 Alzheimer's Disease Candidate Proteins: Significant Associations with Proteins Involved in Amyloid Processing and Inflammation
Cerebrospinal fluid (CSF) 42 amino acid species of amyloid beta (Aβ42) and tau levels are strongly correlated with the presence of Alzheimer's disease (AD) neuropathology including amyloid plaques and neurodegeneration and have been successfully used as endophenotypes for genetic studies of AD. Additional CSF analytes may also serve as useful endophenotypes that capture other aspects of AD pathophysiology. Here we have conducted a genome-wide association study of CSF levels of 59 AD-related analytes. All analytes were measured using the Rules Based Medicine Human DiscoveryMAP Panel, which includes analytes relevant to several disease-related processes. Data from two independently collected and measured datasets, the Knight Alzheimer's Disease Research Center (ADRC) and Alzheimer's Disease Neuroimaging Initiative (ADNI), were analyzed separately, and combined results were obtained using meta-analysis. We identified genetic associations with CSF levels of 5 proteins (Angiotensin-converting enzyme (ACE), Chemokine (C-C motif) ligand 2 (CCL2), Chemokine (C-C motif) ligand 4 (CCL4), Interleukin 6 receptor (IL6R) and Matrix metalloproteinase-3 (MMP3)) with study-wide significant p-values (p<1.46×10-10) and significant, consistent evidence for association in both the Knight ADRC and the ADNI samples. These proteins are involved in amyloid processing and pro-inflammatory signaling. SNPs associated with ACE, IL6R and MMP3 protein levels are located within the coding regions of the corresponding structural gene. The SNPs associated with CSF levels of CCL4 and CCL2 are located in known chemokine binding proteins. The genetic associations reported here are novel and suggest mechanisms for genetic control of CSF and plasma levels of these disease-related proteins. Significant SNPs in ACE and MMP3 also showed association with AD risk. Our findings suggest that these proteins/pathways may be valuable therapeutic targets for AD. Robust associations in cognitively normal individuals suggest that these SNPs also influence regulation of these proteins more generally and may therefore be relevant to other diseases.
Genome annotations matter: characterizing Ensembl hg38 annotations from 2014 to 2023
Background An accurate genome annotation is essential in many contexts, including RNA sequencing studies. Annotations include known genes and isoforms, detailing their location (chromosome, start, and end) and coding sequence, among other important metadata. Results We characterized changes in human Ensembl annotations from 2014 to 2023 and the important gains in our biological understanding in recent years. While generally gene and isoform annotations increased (2014: 58,812 genes ; 2023: 62,710), some years dropped (e.g., 2016). A similar pattern exists for the gene and isoform biotypes; both 2015 (19,825) and 2017 (19,828) have fewer genes annotated as protein-coding than 2014 (19,953) and 2016 (19,961)— 2023 has the most (20,048). PCBP1-AS1 had the most annotated isoforms (296). We quantified expression for isoforms that were new between 2019 and 2023 across nine GTEx tissues (58 samples) to demonstrate our significant gains in understanding recently. We saw 2,054 of these ‘new’ isoforms expressed in cerebellar hemisphere (594 in liver). For many genes, we saw that the relative expression of the ‘new’ isoforms was much greater than the previously known isoforms. Conclusions This study demonstrates the importance of an accurate genome annotation to truly understand the underlying complexity of biology that is often oversimplified by ignoring transcriptional complexity.
Evaluating the necessity of PCR duplicate removal from next-generation sequencing data and a comparison of approaches
Background Analyzing next-generation sequencing data is difficult because datasets are large, second generation sequencing platforms have high error rates, and because each position in the target genome (exome, transcriptome, etc.) is sequenced multiple times. Given these challenges, numerous bioinformatic algorithms have been developed to analyze these data. These algorithms aim to find an appropriate balance between data loss, errors, analysis time, and memory footprint. Typical analysis pipelines require multiple steps. If one or more of these steps is unnecessary, it would significantly decrease compute time and data manipulation to remove the step. One step in many pipelines is PCR duplicate removal, where PCR duplicates arise from multiple PCR products from the same template molecule binding on the flowcell. These are often removed because there is concern they can lead to false positive variant calls. Picard (MarkDuplicates) and SAMTools (rmdup) are the two main softwares used for PCR duplicate removal. Results Approximately 92 % of the 17+ million variants called were called whether we removed duplicates with Picard or SAMTools, or left the PCR duplicates in the dataset. There were no significant differences between the unique variant sets when comparing the transition/transversion ratios ( p  = 1.0), percentage of novel variants ( p  = 0.99), average population frequencies ( p  = 0.99), and the percentage of protein-changing variants ( p  = 1.0). Results were similar for variants in the American College of Medical Genetics genes. Genotype concordance between NGS and SNP chips was above 99 % for all genotype groups (e.g., homozygous reference). Conclusions Our results suggest that PCR duplicate removal has minimal effect on the accuracy of subsequent variant calls.
A human breast cancer-derived xenograft and organoid platform for drug discovery and precision oncology
Models that recapitulate the complexity of human tumors are urgently needed to develop more effective cancer therapies. We report a bank of human patient-derived xenografts (PDXs) and matched organoid cultures from tumors that represent the greatest unmet need: endocrine-resistant, treatment-refractory and metastatic breast cancers. We leverage matched PDXs and PDX-derived organoids (PDxO) for drug screening that is feasible and cost-effective with in vivo validation. Moreover, we demonstrate the feasibility of using these models for precision oncology in real time with clinical care in a case of triple-negative breast cancer (TNBC) with early metastatic recurrence. Our results uncovered a Food and Drug Administration (FDA)-approved drug with high efficacy against the models. Treatment with this therapy resulted in a complete response for the individual and a progression-free survival (PFS) period more than three times longer than their previous therapies. This work provides valuable methods and resources for functional precision medicine and drug development for human breast cancer.
A bioinformatic survey of RNA isoform diversity and expression across 9 GTEx tissues using long-read sequencing data
Background Even though alternative RNA splicing was discovered nearly 50 years ago (1977), we still understand very little about most isoforms arising from a single gene, including in which tissues they are expressed and if their functions differ. Human gene annotations suggest remarkable transcriptional complexity, with approximately 252,798 distinct RNA isoform annotations from 62,710 gene bodies (Ensembl v109; 2023), emphasizing the need to understand their biological effects. For example, 256 gene bodies have ≥ 50 annotated isoforms, and 30 have ≥ 100, where one protein-coding gene ( MAPK10 ) even has 192 distinct RNA isoform annotations. Whether such isoform diversity results from biological redundancy or spurious alternative splicing (i.e., noise), or whether individual isoforms have specialized functions (even if subtle) remains a mystery for most genes. Three recent studies demonstrated that long-read RNAseq enables improved RNA isoform quantification for essentially any tissue, cell type, or biological condition (e.g., disease, development, aging, etc.), making it possible to better assess individual isoform expression and function. While each study provided important discoveries related to RNA isoform diversity, deeper exploration is needed. Results We sought to quantify and characterize real isoform usage across tissues (compared to annotations). We used long-read RNAseq data from 58 GTEx samples across nine tissues (three brain, two heart, muscle, lung, liver, and cultured fibroblasts) generated by Glinos et al. and found considerable isoform diversity within and across tissues. Cerebellar hemisphere was the most transcriptionally complex tissue (22,522 distinct isoforms; 3,726 unique); liver was the least diverse (12,435 distinct isoforms; 1,039 unique). We highlight gene clusters exhibiting high tissue-specific isoform diversity per tissue (e.g., TPM1 expresses 19 in heart’s atrial appendage). We also validated 447 of the 700 new isoforms discovered by Aguzzoli-Heberle et al. and found that 88 were expressed in all nine tissues, while 58 were specific to a single tissue. Conclusions This study represents a broad bioinformatic survey of the RNA isoform landscape, demonstrating isoform diversity across nine tissues and emphasizes the need for further verification, validation, and functional annotation research to better understand how individual isoforms from a single gene body contribute to human health and disease.
Variant Tool Chest: an improved tool to analyze and manipulate variant call format (VCF) files
Background Since the advent of next-generation sequencing many previously untestable hypotheses have been realized. Next-generation sequencing has been used for a wide range of studies in diverse fields such as population and medical genetics, phylogenetics, microbiology, and others. However, this novel technology has created unanticipated challenges such as the large numbers of genetic variants. Each caucasian genome has more than four million single nucleotide variants, insertions and deletions, copy number variants, and structural variants. Several formats have been suggested for storing these variants; however, the variant call format (VCF) has become the community standard. Results We developed new software called the Variant Tool Chest (VTC) to provide much needed tools to work with VCF files. VTC provides a variety of tools for manipulating, comparing, and analyzing VCF files beyond the functionality of existing tools. In addition, VTC was written to be easily extended with new tools. Conclusions Variant Tool Chest brings new and important functionality that complements and integrates well with existing software. VTC is available at https://github.com/mebbert/VariantToolChest
Genome annotations matter: characterizing Ensembl hg38 annotations from 2014 to 2023
An accurate genome annotation is essential in many contexts, including RNA sequencing studies. Annotations include known genes and isoforms, detailing their location (chromosome, start, and end) and coding sequence, among other important metadata. We characterized changes in human Ensembl annotations from 2014 to 2023 and the important gains in our biological understanding in recent years. While generally gene and isoform annotations increased (2014: 58,812 genes ; 2023: 62,710), some years dropped (e.g., 2016). A similar pattern exists for the gene and isoform biotypes; both 2015 (19,825) and 2017 (19,828) have fewer genes annotated as protein-coding than 2014 (19,953) and 2016 (19,961)-- 2023 has the most (20,048). PCBP1-AS1 had the most annotated isoforms (296). We quantified expression for isoforms that were new between 2019 and 2023 across nine GTEx tissues (58 samples) to demonstrate our significant gains in understanding recently. We saw 2,054 of these 'new' isoforms expressed in cerebellar hemisphere (594 in liver). For many genes, we saw that the relative expression of the 'new' isoforms was much greater than the previously known isoforms. This study demonstrates the importance of an accurate genome annotation to truly understand the underlying complexity of biology that is often oversimplified by ignoring transcriptional complexity.
A bioinformatic survey of RNA isoform diversity and expression across 9 GTEx tissues using long-read sequencing data
Even though alternative RNA splicing was discovered nearly 50 years ago (1977), we still understand very little about most isoforms arising from a single gene, including in which tissues they are expressed and if their functions differ. Human gene annotations suggest remarkable transcriptional complexity, with approximately 252,798 distinct RNA isoform annotations from 62,710 gene bodies (Ensembl v109; 2023), emphasizing the need to understand their biological effects. For example, 256 gene bodies have [greater than or equal to] 50 annotated isoforms, and 30 have [greater than or equal to] 100, where one protein-coding gene (MAPK10) even has 192 distinct RNA isoform annotations. Whether such isoform diversity results from biological redundancy or spurious alternative splicing (i.e., noise), or whether individual isoforms have specialized functions (even if subtle) remains a mystery for most genes. Three recent studies demonstrated that long-read RNAseq enables improved RNA isoform quantification for essentially any tissue, cell type, or biological condition (e.g., disease, development, aging, etc.), making it possible to better assess individual isoform expression and function. While each study provided important discoveries related to RNA isoform diversity, deeper exploration is needed. We sought to quantify and characterize real isoform usage across tissues (compared to annotations). We used long-read RNAseq data from 58 GTEx samples across nine tissues (three brain, two heart, muscle, lung, liver, and cultured fibroblasts) generated by Glinos et al. and found considerable isoform diversity within and across tissues. Cerebellar hemisphere was the most transcriptionally complex tissue (22,522 distinct isoforms; 3,726 unique); liver was the least diverse (12,435 distinct isoforms; 1,039 unique). We highlight gene clusters exhibiting high tissue-specific isoform diversity per tissue (e.g., TPM1 expresses 19 in heart's atrial appendage). We also validated 447 of the 700 new isoforms discovered by Aguzzoli-Heberle et al. and found that 88 were expressed in all nine tissues, while 58 were specific to a single tissue. This study represents a broad bioinformatic survey of the RNA isoform landscape, demonstrating isoform diversity across nine tissues and emphasizes the need for further verification, validation, and functional annotation research to better understand how individual isoforms from a single gene body contribute to human health and disease.
Sequencing the gaps: dark genomic regions persist in CHM13 despite long-read advances
Comprehensive genomic analysis is essential for advancing our understanding of human genetics and disease. However, short-read sequencing technologies are inherently limited in their ability to resolve highly repetitive, structurally complex, and low-mappability genomic regions, previously coined as \"dark\" regions. Long-read sequencing technologies, such as PacBio and Oxford Nanopore Technologies (ONT), offer improved resolution of these regions, yet they are not perfect. With the advent of the new Telomere-to-Telomere (T2T) CHM13 reference genome, exploring its effect on dark regions is prudent. In this study, we systematically analyze dark regions across four human genome references-HG19, HG38 (with and without alternate contigs), and CHM13-using both short- and long-read sequencing data. We found that dark regions increase as the reference becomes more complete, especially dark-by-MAPQ regions, but that long-read sequencing significantly reduces the number of dark regions in the genome, particularly within gene bodies. However, we identify potential alignment challenges in long-read data, such as centromeric regions. These findings highlight the importance of both reference genome selection and sequencing technology choice in achieving a truly comprehensive genomic analysis.
Surveying the landscape of RNA isoform diversity and expression across 9 GTEx tissues using long-read sequencing data
Even though alternative RNA splicing was discovered nearly 50 years ago (1977), we still understand very little about most isoforms arising from a single gene, including in which tissues they are expressed and if their functions differ. Human gene annotations suggest remarkable transcriptional complexity, with approximately 252,798 distinct RNA isoform annotations from 62,710 gene bodies (Ensembl v109; 2023), emphasizing the need to understand their biological effects. For example, 256 gene bodies have ≥50 annotated isoforms and 30 have ≥100, where one protein-coding gene ( ) even has 192 distinct RNA isoform annotations. Whether such isoform diversity results from biological redundancy or spurious alternative splicing (i.e., noise), or whether individual isoforms have specialized functions (even if subtle) remains a mystery for most genes. Recent studies by Aguzzoli-Heberle et al., Leung et al., and Glinos et al. demonstrated long-read RNAseq enables improved RNA isoform quantification for essentially any tissue, cell type, or biological condition ( disease, development, aging, etc.), making it possible to better assess individual isoform expression and function. While each study provided important discoveries related to RNA isoform diversity, deeper exploration is needed. We sought to quantify and characterize real isoform usage across tissues (compared to annotations). We used long-read RNAseq data from 58 GTEx samples across nine tissues (three brain, two heart, muscle, lung, liver, and cultured fibroblasts) generated by Glinos et al. and found considerable isoform diversity within and across tissues. Cerebellar hemisphere was the most transcriptionally complex tissue (22,522 distinct isoforms; 3,726 unique); liver was least diverse (12,435 distinct isoforms; 1,039 unique). We highlight gene clusters exhibiting high tissue-specific isoform diversity per tissue ( , expresses 19 in heart's atrial appendage). We also validated 447 of the 700 new isoforms discovered by Aguzzoli-Heberle et al. and found that 88 were expressed in all nine tissues, while 58 were specific to a single tissue. This study represents a broad survey of the RNA isoform landscape, demonstrating isoform diversity across nine tissues and emphasizes the need to better understand how individual isoforms from a single gene body contribute to human health and disease.