Search Results Heading

MBRLSearchResults

mbrl.module.common.modules.added.book.to.shelf
Title added to your shelf!
View what I already have on My Shelf.
Oops! Something went wrong.
Oops! Something went wrong.
While trying to add the title to your shelf something went wrong :( Kindly try again later!
Are you sure you want to remove the book from the shelf?
Oops! Something went wrong.
Oops! Something went wrong.
While trying to remove the title from your shelf something went wrong :( Kindly try again later!
    Done
    Filters
    Reset
  • Language
      Language
      Clear All
      Language
  • Subject
      Subject
      Clear All
      Subject
  • Item Type
      Item Type
      Clear All
      Item Type
  • Discipline
      Discipline
      Clear All
      Discipline
  • Year
      Year
      Clear All
      From:
      -
      To:
  • More Filters
83 result(s) for "Atwal, Gurinder S."
Sort by:
Equitability, mutual information, and the maximal information coefficient
How should one quantify the strength of association between two random variables without bias for relationships of a specific form? Despite its conceptual simplicity, this notion of statistical “equitability” has yet to receive a definitive mathematical formalization. Here we argue that equitability is properly formalized by a self-consistency condition closely related to Data Processing Inequality. Mutual information, a fundamental quantity in information theory, is shown to satisfy this equitability criterion. These findings are at odds with the recent work of Reshef et al. [Reshef DN, et al. (2011) Science 334(6062):1518–1524], which proposed an alternative definition of equitability and introduced a new statistic, the “maximal information coefficient” (MIC), said to satisfy equitability in contradistinction to mutual information. These conclusions, however, were supported only with limited simulation evidence, not with mathematical arguments. Upon revisiting these claims, we prove that the mathematical definition of equitability proposed by Reshef et al. cannot be satisfied by any (nontrivial) dependence measure. We also identify artifacts in the reported simulation evidence. When these artifacts are removed, estimates of mutual information are found to be more equitable than estimates of MIC. Mutual information is also observed to have consistently higher statistical power than MIC. We conclude that estimating mutual information provides a natural (and often practical) way to equitably quantify statistical associations in large datasets.
Designing meaningful continuous representations of T cell receptor sequences with deep generative models
T Cell Receptor (TCR) antigen binding underlies a key mechanism of the adaptive immune response yet the vast diversity of TCRs and the complexity of protein interactions limits our ability to build useful low dimensional representations of TCRs. To address the current limitations in TCR analysis we develop a capacity-controlled disentangling variational autoencoder trained using a dataset of approximately 100 million TCR sequences, that we name TCR-VALID. We design TCR-VALID such that the model representations are low-dimensional, continuous, disentangled, and sufficiently informative to provide high-quality TCR sequence de novo generation. We thoroughly quantify these properties of the representations, providing a framework for future protein representation learning in low dimensions. The continuity of TCR-VALID representations allows fast and accurate TCR clustering and is benchmarked against other state-of-the-art TCR clustering tools and pre-trained language models. Relating T cell receptor (TCR) sequencing to antigen specificity is a challenge especially when TCR specificity is unclear. Here the authors use a low dimensional generative approach to model TCR sequence similarity and to associate TCR sequences with the same specificity.
Optimized murine sample sizes for RNA sequencing studies revealed from large scale comparative analysis
Determining the appropriate sample size (N) for bulk RNA sequencing experiments is critical for obtaining reliable results. We show in two N  = 30 profiling studies, comparing wild-type mice and mice in which one copy of a gene has been deleted, the N required to minimize false positives and maximize true discoveries found in the N  = 30 experiment. Results from experiments with N  = 4 or less are shown to be highly misleading, given the high false positive rate and the lack of discovery of genes later found with higher N. For a cut-off of 2-fold expression differences, we find an N of 6-7 mice is required to consistently decrease the false positive rate to below 50%, and the detection sensitivity to above 50%. More is always better for both metrics – and an N of 8-12 is significantly better in recapitulating the full experiment.A common way to reduce the false discovery rate in underpowered experiments is to raise the fold cutoff. We show that this strategy is no substitute for increasing the N of the experiment: it results in consistently inflated effect sizes and causes a substantial drop in sensitivity of detection. These data should be helpful to scientists in choosing their Ns. Determining the appropriate sample size (N) for bulk RNA sequencing experiments is critical to ensure reliable results. Here the authors perform an unusually large N experiment ( N  = 30 per group), analyzing changes in gene expression in two genetically modified mice compared to controls. They find that a surprisingly high N is required to keep the false positive rate below 50% and detection sensitivity above 50%.
Single T Cell Sequencing Demonstrates the Functional Role of αβ TCR Pairing in Cell Lineage and Antigen Specificity
Although structural studies of individual T cell receptors (TCRs) have revealed important roles for both the α and β chain in directing MHC and antigen recognition, repertoire-level immunogenomic analyses have historically examined the β chain alone. To determine the amount of useful information about TCR repertoire function encoded within αβ pairings, we analyzed paired TCR sequences from nearly 100,000 unique CD4 and CD8 T cells captured using two different high-throughput, single-cell sequencing approaches. Our results demonstrate little overlap in the healthy CD4 and CD8 repertoires, with shared TCR sequences possessing significantly shorter CDR3 sequences corresponding to higher generation probabilities. We further utilized tools from information theory and machine learning to show that while α and β chains are only weakly associated with lineage, αβ pairings appear to synergistically drive TCR-MHC interactions. Vαβ gene pairings were found to be the TCR feature most informative of T cell lineage, supporting the existence of germline-encoded paired αβ TCR-MHC interaction motifs. Finally, annotating our TCR pairs using a database of sequences with known antigen specificities, we demonstrate that approximately a third of the T cells possess α and β chains that each recognize different known antigens, suggesting that αβ pairing is critical for the accurate inference of repertoire functionality. Together, these findings provide biological insight into the functional implications of αβ pairing and highlight the utility of single-cell sequencing in immunogenomics.
Age-related gene expression signatures from limb skeletal muscles and the diaphragm in mice and rats reveal common and species-specific changes
Background As a result of aging, skeletal muscle undergoes atrophy and a decrease in function. This age-related skeletal muscle weakness is known as “sarcopenia”. Sarcopenia is part of the frailty observed in humans. In order to discover treatments for sarcopenia, it is necessary to determine appropriate preclinical models and the genes and signaling pathways that change with age in these models. Methods and results To understand the changes in gene expression that occur as a result of aging in skeletal muscles, we generated a multi-time-point gene expression signature throughout the lifespan of mice and rats, as these are the most commonly used species in preclinical research and intervention testing. Gastrocnemius, tibialis anterior, soleus, and diaphragm muscles from male and female C57Bl/6J mice and male Sprague Dawley rats were analyzed at ages 6, 12, 18, 21, 24, and 27 months, plus an additional 9-month group was used for rats. More age-related genes were identified in rat skeletal muscles compared with mice; this was consistent with the finding that rat muscles undergo more robust age-related decline in mass. In both species, pathways associated with innate immunity and inflammation linearly increased with age. Pathways linked with extracellular matrix remodeling were also universally downregulated. Interestingly, late downregulated pathways were exclusively found in the rat limb muscles and these were linked to metabolism and mitochondrial respiration; this was not seen in the mouse. Conclusions This extensive, side-by-side transcriptomic profiling shows that the skeletal muscle in rats is impacted more by aging compared with mice, and the pattern of decline in the rat may be more representative of the human. The observed changes point to potential therapeutic interventions to avoid age-related decline in skeletal muscle function.
Proteogenomic identification of Hepatitis B virus (HBV) genotype-specific HLA-I restricted peptides from HBV-positive patient liver tissues
The presentation of virus-derived peptides by HLA class I molecules on the surface of an infected cell and the recognition of these HLA-peptide complexes by, and subsequent activation of, CD8 + cytotoxic T cells provides an important mechanism for immune protection against viruses. Recent advances in proteogenomics have allowed researchers to discover a growing number of unique HLA-restricted viral peptides, resulting in a rapidly expanding repertoire of targets for immunotherapeutics (i.e. bispecific antibodies, engineered T-cell receptors (TCRs), chimeric antigen receptor T-cells (CAR-Ts)) to infected tissues. However, genomic variability between viral strains, such as Hepatitis-B virus (HBV), in combination with differences in patient HLA alleles, make it difficult to develop therapeutics against these targets. To address this challenge, we developed a novel proteogenomics approach for generating patient-specific databases that enable the identification of viral peptides based on the viral transcriptomes sequenced from individual patient liver samples. We also utilized DNA sequencing of patient samples to identify HLA genotypes and assist in target selection. Liver samples from 48 HBV infected patients, primarily from Asia, were examined to reconstruct patient-specific HBV genomes, identify regions within the human chromosomes targeted by HBV integrations and obtain a comprehensive view of HBV peptide epitopes using our HLA class-I (HLA-I) immunopeptidomics discovery platform. Two previously reported HLA associated HBV-derived peptides, HLA-A02 binder FLLTRILTI (S 194-202 ) from the large surface antigen and HLA-A11 binder STLPETTVVRR (C 141-151 ) from the capsid protein were validated by our discovery platform, but both were detected at very low frequencies. In addition, we identified and validated, using heavy peptide analogues, novel strain-specific HBV-HLA associated peptides, such as GSLPQEHIVQK (P 606-616 ) and variants. Overall, our novel approach can guide the development of bispecific antibody, TCR-T, or CAR-T based therapeutics for the treatment of HBV-related HCC and inform vaccine development.
Microenvironment-dependent growth of Sezary cells in humanized IL-15 mice
Sezary syndrome (SS) is a rare, aggressive leukemic variant of cutaneous T-cell lymphoma (CTCL) that lacks adequate therapeutic options and representative small-animal models. Here, we demonstrate that IL-15 is a critical CTCL growth factor. Importantly, an immunodeficient knock-in mouse model genetically engineered to express human IL-15 uniquely supported the growth of SS patient samples relative to conventional immunodeficient mouse strains. SS patient-derived xenograft (PDX) models recapacitated key pathological features of the human disease, including skin infiltration and spread of leukemic cells to the periphery, and maintained the dependence on human IL-15 upon serial in vivo passaging. Detailed molecular characterization of the engrafted cells by single-cell transcriptomic analysis revealed congruent neoplastic gene expression signatures but distinct clonal engraftment patterns. Overall, we document an important dependence of Sezary cell survival and proliferation on IL-15 signaling and the utility of immunodeficient humanized IL-15 mice as hosts for SS – and potentially other T and NK cell-derived hematologic malignancies – PDX model generation. Furthermore, these studies advocate the thorough molecular understanding of the resultant PDX models to maximize their translational impact.
Digital wearable insole-based identification of knee arthropathies and gait signatures using machine learning
Gait is impaired in musculoskeletal conditions, such as knee arthropathy. Gait analysis is used in clinical practice to inform diagnosis and monitor disease progression or intervention response. However, clinical gait analysis relies on subjective visual observation of walking as objective gait analysis has not been possible within clinical settings due to the expensive equipment, large-scale facilities, and highly trained staff required. Relatively low-cost wearable digital insoles may offer a solution to these challenges. In this work, we demonstrate how a digital insole measuring osteoarthritis-specific gait signatures yields similar results to the clinical gait-lab standard. To achieve this, we constructed a machine learning model, trained on force plate data collected in participants with knee arthropathy and controls. This model was highly predictive of force plate data from a validation set (area under the receiver operating characteristics curve [auROC] = 0.86; area under the precision-recall curve [auPR] = 0.90) and of a separate, independent digital insole dataset containing control and knee osteoarthritis subjects (auROC = 0.83; auPR = 0.86). After showing that digital insole-derived gait characteristics are comparable to traditional gait measurements, we next showed that a single stride of raw sensor time-series data could be accurately assigned to each subject, highlighting that individuals using digital insoles can be identified by their gait characteristics. This work provides a framework for a promising alternative to traditional clinical gait analysis methods, adds to the growing body of knowledge regarding wearable technology analytical pipelines, and supports clinical development of at-home gait assessments, with the potential to improve the ease, frequency, and depth of patient monitoring. The way we walk – our ‘gait’ – is a key indicator of health. Gait irregularities like limping, shuffling or a slow pace can be signs of muscle or joint problems. Assessing a patient’s gait is therefore an important element in diagnosing these conditions, and in evaluating whether treatments are working. Gait is often assessed via a simple visual inspection, with patients being asked to walk back and forth in a doctor’s office. While quick and easy, this approach is highly subjective and therefore imprecise. ‘Objective gait analysis’ is a more accurate alternative, but it relies on tests being conducted in specialised laboratories with large-scale, expensive equipment operated by highly trained staff. Unfortunately, this means that gait laboratories are not accessible for everyday clinical use. In response, Wipperman et al. aimed to develop a low-cost alternative to the complex equipment used in gait laboratories. To do this, they harnessed wearable sensor technologies – devices that can directly measure physiological data while embedded in clothing or attached to the user. Wearable sensors have the advantage of being cheap, easy to use, and able to provide clinically useful information without specially trained staff. Wipperman et al. analysed data from classic gait laboratory devices, as well as ‘digital insoles’ equipped with sensors that captured foot movements and pressure as participants walked. The analysis first ‘trained’ on data from gait laboratories (called force plates) and then applied the method to gait measurements obtained from digital insoles worn by either healthy participants or patients with knee problems. Analysis of the pressure data from the insoles confirmed that they could accurately predict which measurements were from healthy individuals, and which were from patients. The gait characteristics detected by the insoles were also comparable to lab-based measurements – in other words, the insoles provided similar type and quality of data as a gait laboratory. Further analysis revealed that information from just a single step could reveal additional information about the subject’s walking. These results support the use of wearable devices as a simple and relatively inexpensive way to measure gait in everyday clinical practice, without the need for specialised laboratories and visits to the doctor’s office. Although the digital insoles will require further analytical and clinical study before they can be widely used, Wipperman et al. hope they will eventually make monitoring muscle and joint conditions easier and more affordable.
Identification of pan-cancer/testis genes and validation of therapeutic targeting in triple-negative breast cancer: Lin28a-based and Siglece-based vaccination induces antitumor immunity and inhibits metastasis
BackgroundCancer–testis (CT) genes are targets for tumor antigen-specific immunotherapy given that their expression is normally restricted to the immune-privileged testis in healthy individuals with aberrant expression in tumor tissues. While they represent targetable germ tissue antigens and play important functional roles in tumorigenesis, there is currently no standardized approach for identifying clinically relevant CT genes. Optimized algorithms and validated methods for accurate prediction of reliable CT antigens (CTAs) with high immunogenicity are also lacking.MethodsSequencing data from the Genotype-Tissue Expression (GTEx) and The Genomic Data Commons (GDC) databases was used for the development of a bioinformatic pipeline to identify CT exclusive genes. A CT germness score was calculated based on the number of CT genes expressed within a tumor type and their degree of expression. The impact of tumor germness on clinical outcome was evaluated using healthy GTEx and GDC tumor samples. We then used a triple-negative breast cancer mouse model to develop and test an algorithm that predicts epitope immunogenicity based on the identification of germline sequences with strong major histocompatibility complex class I (MHCI) and MHCII binding affinities. Germline sequences for CT genes were synthesized as long synthetic peptide vaccines and tested in the 4T1 triple-negative model of invasive breast cancer with Poly(I:C) adjuvant. Vaccine immunogenicity was determined by flow cytometric analysis of in vitro and in vivo T-cell responses. Primary tumor growth and lung metastasis was evaluated by histopathology, flow cytometry and colony formation assay.ResultsWe developed a new bioinformatic pipeline to reliably identify CT exclusive genes as immunogenic targets for immunotherapy. We identified CT genes that are exclusively expressed within the testis, lack detectable thymic expression, and are significantly expressed in multiple tumor types. High tumor germness correlated with tumor progression but not with tumor mutation burden, supporting CTAs as appealing targets in low mutation burden tumors. Importantly, tumor germness also correlated with markers of antitumor immunity. Vaccination of 4T1 tumor-bearing mice with Siglece and Lin28a antigens resulted in increased T-cell antitumor immunity and reduced primary tumor growth and lung metastases.ConclusionOur results present a novel strategy for the identification of highly immunogenic CTAs for the development of targeted vaccines that induce antitumor immunity and inhibit metastasis.