Catalogue Search | MBRL
Search Results Heading
Explore the vast range of titles available.
MBRLSearchResults
-
DisciplineDiscipline
-
Is Peer ReviewedIs Peer Reviewed
-
Item TypeItem Type
-
SubjectSubject
-
YearFrom:-To:
-
More FiltersMore FiltersSourceLanguage
Done
Filters
Reset
39
result(s) for
"Andrews, Tallulah"
Sort by:
Challenges in unsupervised clustering of single-cell RNA-seq data
by
Vladimir Yu Kiselev
,
Hemberg, Martin
,
Andrews, Tallulah S
in
Cells
,
Computer applications
,
Data processing
2019
Single-cell RNA sequencing (scRNA-seq) allows researchers to collect large catalogues detailing the transcriptomes of individual cells. Unsupervised clustering is of central importance for the analysis of these data, as it is used to identify putative cell types. However, there are many challenges involved. We discuss why clustering is a challenging problem from a computational point of view and what aspects of the data make it challenging. We also consider the difficulties related to the biological interpretation and annotation of the identified clusters.
Journal Article
Tutorial: guidelines for the computational analysis of single-cell RNA sequencing data
by
McCarthy, Davis
,
Hemberg, Martin
,
Andrews, Tallulah S.
in
631/114/1314
,
631/1647/514/1949
,
Analysis
2021
Single-cell RNA sequencing (scRNA-seq) is a popular and powerful technology that allows you to profile the whole transcriptome of a large number of individual cells. However, the analysis of the large volumes of data generated from these experiments requires specialized statistical and computational methods. Here we present an overview of the computational workflow involved in processing scRNA-seq data. We discuss some of the most common tasks and the tools available for addressing central biological questions. In this article and our companion website (
https://scrnaseq-course.cog.sanger.ac.uk/website/index.html
), we provide guidelines regarding best practices for performing computational analyses. This tutorial provides a hands-on guide for experimentalists interested in analyzing their data as well as an overview for bioinformaticians seeking to develop new computational methods.
In this Tutorial Review, Hemberg et al. present an overview of the computational workflow involved in processing single-cell RNA sequencing data.
Journal Article
Tutorial: guidelines for annotating single-cell transcriptomic maps using automated and manual methods
by
Clarke, Zoe A.
,
MacParland, Sonya A.
,
Atif, Jawairia
in
631/114/1314
,
631/114/1386
,
631/114/2399
2021
Single-cell transcriptomics can profile thousands of cells in a single experiment and identify novel cell types, states and dynamics in a wide variety of tissues and organisms. Standard experimental protocols and analysis workflows have been developed to create single-cell transcriptomic maps from tissues. This tutorial focuses on how to interpret these data to identify cell types, states and other biologically relevant patterns with the objective of creating an annotated map of cells. We recommend a three-step workflow including automatic cell annotation (wherever possible), manual cell annotation and verification. Frequently encountered challenges are discussed, as well as strategies to address them. Guiding principles and specific recommendations for software tools and resources that can be used for each step are covered, and an R notebook is included to help run the recommended workflow. Basic familiarity with computer software is assumed, and basic knowledge of programming (e.g., in the R language) is recommended.
This tutorial provides guidelines for interpreting single-cell transcriptomic maps to identify cell types, states and other biologically relevant patterns.
Journal Article
False signals induced by single-cell imputation version 1; peer review: 4 approved with reservations
2018
Background: Single-cell RNASeq is a powerful tool for measuring gene expression at the resolution of individual cells. A significant challenge in the analysis of this data is the large amount of zero values, representing either missing data or no expression. Several imputation approaches have been proposed to deal with this issue, but since these methods generally rely on structure inherent to the dataset under consideration they may not provide any additional information.
Methods: We evaluated the risk of generating false positive or irreproducible results when imputing data with five different methods. We applied each method to a variety of simulated datasets as well as to permuted real single-cell RNASeq datasets and consider the number of false positive gene-gene correlations and differentially expressed genes. Using matched 10X Chromium and Smartseq2 data from the Tabula Muris database we examined the reproducibility of markers before and after imputation.
Results: The extent of false-positive signals introduced by imputation varied considerably by method. Data smoothing based methods, MAGIC and knn-smooth, generated a very high number of false-positives in both real and simulated data. Model-based imputation methods typically generated fewer false-positives but this varied greatly depending on how well datasets conformed to the underlying model. Furthermore, only SAVER exhibited reproducibility comparable to unimputed data across matched data.
Conclusions: Imputation of single-cell RNASeq data introduces circularity that can generate false-positive results. Thus, statistical tests applied to imputed data should be treated with care. Additional filtering by effect size can reduce but not fully eliminate these effects. Of the methods we considered, SAVER was the least likely to generate false or irreproducible results, thus should be favoured over alternatives if imputation is necessary.
Journal Article
EmptyDrops: distinguishing cells from empty droplets in droplet-based single-cell RNA sequencing data
by
Dao, The Phuong
,
Lun, Aaron T. L.
,
Riesenfeld, Samantha
in
Animal Genetics and Genomics
,
Bioinformatics
,
Biomarkers - metabolism
2019
Droplet-based single-cell RNA sequencing protocols have dramatically increased the throughput of single-cell transcriptomics studies. A key computational challenge when processing these data is to distinguish libraries for real cells from empty droplets. Here, we describe a new statistical method for calling cells from droplet-based data, based on detecting significant deviations from the expression profile of the ambient solution. Using simulations, we demonstrate that EmptyDrops has greater power than existing approaches while controlling the false discovery rate among detected cells. Our method also retains distinct cell types that would have been discarded by existing methods in several real data sets.
Journal Article
Single-cell atlas of the first intra-mammalian developmental stage of the human parasite Schistosoma mansoni
2020
Over 250 million people suffer from schistosomiasis, a tropical disease caused by parasitic flatworms known as schistosomes. Humans become infected by free-swimming, water-borne larvae, which penetrate the skin. The earliest intra-mammalian stage, called the schistosomulum, undergoes a series of developmental transitions. These changes are critical for the parasite to adapt to its new environment as it navigates through host tissues to reach its niche, where it will grow to reproductive maturity. Unravelling the mechanisms that drive intra-mammalian development requires knowledge of the spatial organisation and transcriptional dynamics of different cell types that comprise the schistomulum body. To fill these important knowledge gaps, we perform single-cell RNA sequencing on two-day old schistosomula of
Schistosoma mansoni
. We identify likely gene expression profiles for muscle, nervous system, tegument, oesophageal gland, parenchymal/primordial gut cells, and stem cells. In addition, we validate cell markers for all these clusters by in situ hybridisation in schistosomula and adult parasites. Taken together, this study provides a comprehensive cell-type atlas for the early intra-mammalian stage of this devastating metazoan parasite.
Schistosomes undergo several develepmental stages during infection of humans. Here, the authors perform single-cell RNA sequencing on the earliest intra-mammalian stage of
Schistosoma mansoni
and generate a comprehensive cell-type atlas for this human parasite.
Journal Article
Interpretable single-cell factor decomposition using sciRED
by
Pouyabahar, Delaram
,
Bader, Gary D.
,
Andrews, Tallulah
in
631/114/1305
,
631/114/2164
,
631/114/2415
2025
Single-cell RNA sequencing maps gene expression heterogeneity within a tissue. However, identifying biological signals in this data is challenging due to confounding technical factors, sparsity, and high dimensionality. Data factorization methods address this by separating and identifying signals in the data, such as gene expression programs, but the resulting factors must be manually interpreted. We developed Single-Cell Interpretable REsidual Decomposition (sciRED) to improve the interpretation of scRNA-seq factor analysis. sciRED removes known confounding effects, uses rotations to improve factor interpretability, maps factors to known covariates, identifies unexplained factors that may capture hidden biological phenomena, and determines the genes and biological processes represented by the resulting factors. We apply sciRED to multiple scRNA-seq datasets and identify sex-specific variation in a kidney map, discern strong and weak immune stimulation signals in a PBMC dataset, reduce ambient RNA contamination in a rat liver atlas to help identify strain variation and reveal rare cell type signatures and anatomical zonation gene programs in a healthy human liver map. These demonstrate that sciRED is useful in characterizing diverse biological signals within scRNA-seq data.
Single-cell RNA sequencing maps tissue-level gene expression heterogeneity but faces challenges in interpreting biological signals due to noise and technical confounders. Here, the authors present sciRED, a method that enhances the interpretation of scRNA-seq factorization by linking factors to known covariates and hidden biological phenomena.
Journal Article
Single‐Cell, Single‐Nucleus, and Spatial RNA Sequencing of the Human Liver Identifies Cholangiocyte and Mesenchymal Heterogeneity
by
Fischer, Sandra
,
MacParland, Sonya A.
,
Atif, Jawairia
in
Cell Nucleus - genetics
,
Cytoplasm
,
Datasets
2022
The critical functions of the human liver are coordinated through the interactions of hepatic parenchymal and non‐parenchymal cells. Recent advances in single‐cell transcriptional approaches have enabled an examination of the human liver with unprecedented resolution. However, dissociation‐related cell perturbation can limit the ability to fully capture the human liver’s parenchymal cell fraction, which limits the ability to comprehensively profile this organ. Here, we report the transcriptional landscape of 73,295 cells from the human liver using matched single‐cell RNA sequencing (scRNA‐seq) and single‐nucleus RNA sequencing (snRNA‐seq). The addition of snRNA‐seq enabled the characterization of interzonal hepatocytes at a single‐cell resolution, revealed the presence of rare subtypes of liver mesenchymal cells, and facilitated the detection of cholangiocyte progenitors that had only been observed during in vitro differentiation experiments. However, T and B lymphocytes and natural killer cells were only distinguishable using scRNA‐seq, highlighting the importance of applying both technologies to obtain a complete map of tissue‐resident cell types. We validated the distinct spatial distribution of the hepatocyte, cholangiocyte, and mesenchymal cell populations by an independent spatial transcriptomics data set and immunohistochemistry. Conclusion: Our study provides a systematic comparison of the transcriptomes captured by scRNA‐seq and snRNA‐seq and delivers a high‐resolution map of the parenchymal cell populations in the healthy human liver.
Journal Article
False signals induced by single-cell imputation version 2; peer review: 4 approved
2018
Background: Single-cell RNA-seq is a powerful tool for measuring gene expression at the resolution of individual cells. A challenge in the analysis of this data is the large amount of zero values, representing either missing data or no expression. Several imputation approaches have been proposed to address this issue, but they generally rely on structure inherent to the dataset under consideration they may not provide any additional information, hence, are limited by the information contained therein and the validity of their assumptions.
Methods: We evaluated the risk of generating false positive or irreproducible differential expression when imputing data with six different methods. We applied each method to a variety of simulated datasets as well as to permuted real single-cell RNA-seq datasets and consider the number of false positive gene-gene correlations and differentially expressed genes. Using matched 10X and Smart-seq2 data we examined whether cell-type specific markers were reproducible across datasets derived from the same tissue before and after imputation.
Results: The extent of false-positives introduced by imputation varied considerably by method. Data smoothing based methods, MAGIC, knn-smooth and dca, generated many false-positives in both real and simulated data. Model-based imputation methods typically generated fewer false-positives but this varied greatly depending on the diversity of cell-types in the sample. All imputation methods decreased the reproducibility of cell-type specific markers, although this could be mitigated by selecting markers with large effect size and significance.
Conclusions: Imputation of single-cell RNA-seq data introduces circularity that can generate false-positive results. Thus, statistical tests applied to imputed data should be treated with care. Additional filtering by effect size can reduce but not fully eliminate these effects. Of the methods we considered, SAVER was the least likely to generate false or irreproducible results, thus should be favoured over alternatives if imputation is necessary.
Journal Article
Publisher Correction: Challenges in unsupervised clustering of single-cell RNA-seq data
2019
During typesetting of this article, errors were inadvertently introduced to the hyperlinked URLs of some of the clustering tools in table 1 (Seurat, CIDR, pcaReduce and mpath), as well as to the numbering of the bold-text annotations in the reference list. The article has now been corrected online. The editors apologize for this error.
Journal Article