Search Results Heading

MBRLSearchResults

mbrl.module.common.modules.added.book.to.shelf
Title added to your shelf!
View what I already have on My Shelf.
Oops! Something went wrong.
Oops! Something went wrong.
While trying to add the title to your shelf something went wrong :( Kindly try again later!
Are you sure you want to remove the book from the shelf?
Oops! Something went wrong.
Oops! Something went wrong.
While trying to remove the title from your shelf something went wrong :( Kindly try again later!
    Done
    Filters
    Reset
  • Language
      Language
      Clear All
      Language
  • Subject
      Subject
      Clear All
      Subject
  • Item Type
      Item Type
      Clear All
      Item Type
  • Discipline
      Discipline
      Clear All
      Discipline
  • Year
      Year
      Clear All
      From:
      -
      To:
  • More Filters
91 result(s) for "Sun, Shiquan"
Sort by:
Statistical analysis of spatial expression patterns for spatially resolved transcriptomic studies
Identifying genes that display spatial expression patterns in spatially resolved transcriptomic studies is an important first step toward characterizing the spatial transcriptomic landscape of complex tissues. Here we present a statistical method, SPARK, for identifying spatial expression patterns of genes in data generated from various spatially resolved transcriptomic techniques. SPARK directly models spatial count data through generalized linear spatial models. It relies on recently developed statistical formulas for hypothesis testing, providing effective control of type I errors and yielding high statistical power. With a computationally efficient algorithm, which is based on penalized quasi-likelihood, SPARK is also scalable to datasets with tens of thousands of genes measured on tens of thousands of samples. Analyzing four published spatially resolved transcriptomic datasets using SPARK, we show it can be up to ten times more powerful than existing methods and disclose biological discoveries that otherwise cannot be revealed by existing approaches. A statistical method called SPARK for analyzing spatially resolved transcriptomic data can efficiently identify spatially expressed genes with effective control of type I errors and high statistical power.
SPARK-X: non-parametric modeling enables scalable and robust detection of spatial expression patterns for large spatial transcriptomic studies
Spatial transcriptomic studies are becoming increasingly common and large, posing important statistical and computational challenges for many analytic tasks. Here, we present SPARK-X, a non-parametric method for rapid and effective detection of spatially expressed genes in large spatial transcriptomic studies. SPARK-X not only produces effective type I error control and high power but also brings orders of magnitude computational savings. We apply SPARK-X to analyze three large datasets, one of which is only analyzable by SPARK-X. In these data, SPARK-X identifies many spatially expressed genes including those that are spatially expressed within the same cell type, revealing new biological insights.
Accuracy, robustness and scalability of dimensionality reduction methods for single-cell RNA-seq analysis
Background Dimensionality reduction is an indispensable analytic component for many areas of single-cell RNA sequencing (scRNA-seq) data analysis. Proper dimensionality reduction can allow for effective noise removal and facilitate many downstream analyses that include cell clustering and lineage reconstruction. Unfortunately, despite the critical importance of dimensionality reduction in scRNA-seq analysis and the vast number of dimensionality reduction methods developed for scRNA-seq studies, few comprehensive comparison studies have been performed to evaluate the effectiveness of different dimensionality reduction methods in scRNA-seq. Results We aim to fill this critical knowledge gap by providing a comparative evaluation of a variety of commonly used dimensionality reduction methods for scRNA-seq studies. Specifically, we compare 18 different dimensionality reduction methods on 30 publicly available scRNA-seq datasets that cover a range of sequencing techniques and sample sizes. We evaluate the performance of different dimensionality reduction methods for neighborhood preserving in terms of their ability to recover features of the original expression matrix, and for cell clustering and lineage reconstruction in terms of their accuracy and robustness. We also evaluate the computational scalability of different dimensionality reduction methods by recording their computational cost. Conclusions Based on the comprehensive evaluation results, we provide important guidelines for choosing dimensionality reduction methods for scRNA-seq data analysis. We also provide all analysis scripts used in the present study at www.xzlab.org/reproduce.html .
Testing and controlling for horizontal pleiotropy with probabilistic Mendelian randomization in transcriptome-wide association studies
Integrating results from genome-wide association studies (GWASs) and gene expression studies through transcriptome-wide association study (TWAS) has the potential to shed light on the causal molecular mechanisms underlying disease etiology. Here, we present a probabilistic Mendelian randomization (MR) method, PMR-Egger, for TWAS applications. PMR-Egger relies on a MR likelihood framework that unifies many existing TWAS and MR methods, accommodates multiple correlated instruments, tests the causal effect of gene on trait in the presence of horizontal pleiotropy, and is scalable to hundreds of thousands of individuals. In simulations, PMR-Egger provides calibrated type I error control for causal effect testing in the presence of horizontal pleiotropic effects, is reasonably robust under various types of model misspecifications, is more powerful than existing TWAS/MR approaches, and can directly test for horizontal pleiotropy. We illustrate the benefits of PMR-Egger in applications to 39 diseases and complex traits obtained from three GWASs including the UK Biobank. Transcriptome-wide association studies integrate GWAS and transcriptome data to examine the molecular mechanisms underlying disease etiology. Here the authors present PMR-Egger, a powerful TWAS method based on probabilistic Mendelian Randomization.
Integrative differential expression and gene set enrichment analysis using summary statistics for scRNA-seq studies
Differential expression (DE) analysis and gene set enrichment (GSE) analysis are commonly applied in single cell RNA sequencing (scRNA-seq) studies. Here, we develop an integrative and scalable computational method, iDEA, to perform joint DE and GSE analysis through a hierarchical Bayesian framework. By integrating DE and GSE analyses, iDEA can improve the power and consistency of DE analysis and the accuracy of GSE analysis. Importantly, iDEA uses only DE summary statistics as input, enabling effective data modeling through complementing and pairing with various existing DE methods. We illustrate the benefits of iDEA with extensive simulations. We also apply iDEA to analyze three scRNA-seq data sets, where iDEA achieves up to five-fold power gain over existing GSE methods and up to 64% power gain over existing DE methods. The power gain brought by iDEA allows us to identify many pathways that would not be identified by existing approaches in these data. Differential expression (DE) and gene set enrichment (GSE) analysis tend to be carried out separately. Here, the authors present iDEA (integrative Differential expression and gene set Enrichment Analysis) for the analysis of scRNAseq data which uses a Baysian approach to jointly model DE and GSE for improved power in both tasks.
Powerful and accurate detection of temporal gene expression patterns from multi-sample multi-stage single-cell transcriptomics data with TDEseq
We present a non-parametric statistical method called TDEseq that takes full advantage of smoothing splines basis functions to account for the dependence of multiple time points in scRNA-seq studies, and uses hierarchical structure linear additive mixed models to model the correlated cells within an individual. As a result, TDEseq demonstrates powerful performance in identifying four potential temporal expression patterns within a specific cell type. Extensive simulation studies and the analysis of four published scRNA-seq datasets show that TDEseq can produce well-calibrated p -values and up to 20% power gain over the existing methods for detecting temporal gene expression patterns.
Delineating the dynamic evolution from preneoplasia to invasive lung adenocarcinoma by integrating single-cell RNA sequencing and spatial transcriptomics
The cell ecology and spatial niche implicated in the dynamic and sequential process of lung adenocarcinoma (LUAD) from adenocarcinoma in situ (AIS) to minimally invasive adenocarcinoma (MIA) and subsequent invasive adenocarcinoma (IAC) have not yet been elucidated. Here, we performed an integrative analysis of single-cell RNA sequencing (scRNA-seq) and spatial transcriptomics (ST) to characterize the cell atlas of the invasion trajectory of LUAD. We found that the UBE2C + cancer cell subpopulation constantly increased during the invasive process of LUAD with remarkable elevation in IAC, and its spatial distribution was in the peripheral cancer region of the IAC, representing a more malignant phenotype. Furthermore, analysis of the TME cell type subpopulation showed a constant decrease in mast cells, monocytes, and lymphatic endothelial cells, which were implicated in the whole process of invasive LUAD, accompanied by an increase in NK cells and MALT B cells from AIS to MIA and an increase in Tregs and secretory B cells from MIA to IAC. Notably, for AIS, cancer cells, NK cells, and mast cells were colocalized in the cancer region; however, for IAC, Tregs colocalized with cancer cells. Finally, communication and interaction between cancer cells and TME cell-induced constitutive activation of TGF-β signaling were involved in the invasion of IAC. Therefore, our results reveal the specific cellular information and spatial architecture of cancer cells and TME subpopulations, as well as the cellular interaction between them, which will facilitate the identification and development of precision medicine in the invasive process of LUAD from AIS to IAC.Lung cancer: Mapping the invasionA map showing how lung cancer progresses from static to invasive carcinomas could inform new therapeutic approaches. The World Health Organisation defines three stages of lung adenocarcinoma: first, in situ lesions, then minimally invasive, then fully invasive. Details about the cell populations involved and their locations during cancer development have been limited. Jianfei Zhu at Air Force Medical University in Xi’an, China, and co-workers have combined the strengths of single-cell RNA sequencing and spatial transcriptomics to produce a map showing where and when different cell types become involved in lung cancer progression. They showed that UBE2C cells, well-known tumor-forming cells, are involved at early stages, and could be key clinical indicators of invasion. Later, regulatory T-cells, which can suppress the body’s immune responses to tumors, are recruited into cancer regions while other cell types are reduced.
Unveiling inflammatory and prehypertrophic cell populations as key contributors to knee cartilage degeneration in osteoarthritis using multi-omics data integration
ObjectivesSingle-cell and spatial transcriptomics analysis of human knee articular cartilage tissue to present a comprehensive transcriptome landscape and osteoarthritis (OA)-critical cell populations.MethodsSingle-cell RNA sequencing and spatially resolved transcriptomic technology have been applied to characterise the cellular heterogeneity of human knee articular cartilage which were collected from 8 OA donors, and 3 non-OA control donors, and a total of 19 samples. The novel chondrocyte population and marker genes of interest were validated by immunohistochemistry staining, quantitative real-time PCR, etc. The OA-critical cell populations were validated through integrative analyses of publicly available bulk RNA sequencing data and large-scale genome-wide association studies.ResultsWe identified 33 cell population-specific marker genes that define 11 chondrocyte populations, including 9 known populations and 2 new populations, that is, pre-inflammatory chondrocyte population (preInfC) and inflammatory chondrocyte population (InfC). The novel findings that make this an important addition to the literature include: (1) the novel InfC activates the mediator MIF-CD74; (2) the prehypertrophic chondrocyte (preHTC) and hypertrophic chondrocyte (HTC) are potentially OA-critical cell populations; (3) most OA-associated differentially expressed genes reside in the articular surface and superficial zone; (4) the prefibrocartilage chondrocyte (preFC) population is a major contributor to the stratification of patients with OA, resulting in both an inflammatory-related subtype and a non-inflammatory-related subtype.ConclusionsOur results highlight InfC, preHTC, preFC and HTC as potential cell populations to target for therapy. Also, we conclude that profiling of those cell populations in patients might be used to stratify patient populations for defining cohorts for clinical trials and precision medicine.
A Kernel-Based Multivariate Feature Selection Method for Microarray Data Classification
High dimensionality and small sample sizes, and their inherent risk of overfitting, pose great challenges for constructing efficient classifiers in microarray data classification. Therefore a feature selection technique should be conducted prior to data classification to enhance prediction performance. In general, filter methods can be considered as principal or auxiliary selection mechanism because of their simplicity, scalability, and low computational complexity. However, a series of trivial examples show that filter methods result in less accurate performance because they ignore the dependencies of features. Although few publications have devoted their attention to reveal the relationship of features by multivariate-based methods, these methods describe relationships among features only by linear methods. While simple linear combination relationship restrict the improvement in performance. In this paper, we used kernel method to discover inherent nonlinear correlations among features as well as between feature and target. Moreover, the number of orthogonal components was determined by kernel Fishers linear discriminant analysis (FLDA) in a self-adaptive manner rather than by manual parameter settings. In order to reveal the effectiveness of our method we performed several experiments and compared the results between our method and other competitive multivariate-based features selectors. In our comparison, we used two classifiers (support vector machine, [Formula: see text]-nearest neighbor) on two group datasets, namely two-class and multi-class datasets. Experimental results demonstrate that the performance of our method is better than others, especially on three hard-classify datasets, namely Wang's Breast Cancer, Gordon's Lung Adenocarcinoma and Pomeroy's Medulloblastoma.
The pyroptosis-related gene signature predicts prognosis and indicates immune activity in hepatocellular carcinoma
Background Hepatocellular carcinoma (HCC) remains one of the most common malignant tumors with poor survival. Pyroptosis is a kind of programmed cell death that can regulate the proliferation, invasion, and metastasis of tumor cells. However, the expression levels of pyroptosis-related genes (PRGs) in HCC and their relationship with prognosis are still unclear. Methods Our study identified 35 PRGs through bioinformatics analysis that were differentially expressed between tumor samples and nontumor samples. According to these differentially expressed genes, HCC patients could be divided into two groups, cluster 1 and cluster 2. The least absolute shrinkage and selection operator (LASSO) Cox regression method was performed to construct a 10-gene signature that classified HCC patients in the cancer genome atlas (TCGA) database into low-risk and high-risk groups. Results The results showed that the survival rate of HCC patients in the low-risk group was significantly higher than that in the high-risk group (p < 0.001). The validation cohort, the Gene Expression Omnibus (GEO) cohort, was divided into two risk groups based on the median risk score calculated by the TCGA cohort. The overall survival (OS) of the low-risk group was significantly better than that of the high-risk group (p = 0.007). Univariate and multivariate Cox regression analyses revealed that the risk score was an independent factor in predicting OS in HCC patients. Gene Ontology and Kyoto Encyclopedia of Genes and Genomes analyses showed that immune-related high-risk groups were rich in genes and had reduced immune status. Conclusions PRGs play a significant role in tumor immunity and have the potential capability to predict the prognosis of HCC patients.