Catalogue Search | MBRL
Search Results Heading
Explore the vast range of titles available.
MBRLSearchResults
-
LanguageLanguage
-
SubjectSubject
-
Item TypeItem Type
-
DisciplineDiscipline
-
YearFrom:-To:
-
More FiltersMore FiltersIs Peer Reviewed
Done
Filters
Reset
13,135
result(s) for
"RNA‐seq"
Sort by:
Current best practices in single‐cell RNA‐seq analysis: a tutorial
by
Luecken, Malte D
,
Theis, Fabian J
in
analysis pipeline development
,
computational biology
,
data analysis tutorial
2019
Single‐cell RNA‐seq has enabled gene expression to be studied at an unprecedented resolution. The promise of this technology is attracting a growing user base for single‐cell analysis methods. As more analysis tools are becoming available, it is becoming increasingly difficult to navigate this landscape and produce an up‐to‐date workflow to analyse one's data. Here, we detail the steps of a typical single‐cell RNA‐seq analysis, including pre‐processing (quality control, normalization, data correction, feature selection, and dimensionality reduction) and cell‐ and gene‐level downstream analysis. We formulate current best‐practice recommendations for these steps based on independent comparison studies. We have integrated these best‐practice recommendations into a workflow, which we apply to a public dataset to further illustrate how these steps work in practice. Our documented case study can be found at
https://www.github.com/theislab/single-cell-tutorial
. This review will serve as a workflow tutorial for new entrants into the field, and help established users update their analysis pipelines.
Graphical Abstract
This Tutorial details the steps of a typical single‐cell RNA‐seq analysis. Best‐practice recommendations are provided and illustrated with a workflow provided in the form of an open source code repository.
Journal Article
Single human oocyte transcriptome analysis reveals distinct maturation stage‐dependent pathways impacted by age
by
Barragán, Montserrat
,
Zambelli, Filippo
,
Heyn, Holger
in
Adolescent
,
Adult
,
advanced maternal age
2021
Female fertility is inversely correlated with maternal age due to a depletion of the oocyte pool and a reduction in oocyte developmental competence. Few studies have addressed the effect of maternal age on the human mature oocyte (MII) transcriptome, which is established during oocyte growth and maturation, however, the pathways involved remain unclear. Here, we characterize and compare the transcriptomes of a large cohort of fully grown germinal vesicle stage (GV) and in vitro matured (IVM‐MII) oocytes from women of varying reproductive age. First, we identified two clusters of cells reflecting the oocyte maturation stage (GV and IVM‐MII) with 4445 and 324 putative marker genes, respectively. Furthermore, we identified genes for which transcript representation either progressively increased or decreased with age. Our results indicate that the transcriptome is more affected by age in IVM‐MII oocytes (1219 genes) than in GV oocytes (596 genes). In particular, we found that transcripts of genes involved in chromosome segregation and RNA splicing significantly increased representation with age, while genes related to mitochondrial activity showed a lower representation. Gene regulatory network analysis facilitated the identification of potential upstream master regulators of the genes involved in those biological functions. Our analysis suggests that advanced maternal age does not globally affect the oocyte transcriptome at GV or IVM‐MII stages. Nonetheless, hundreds of genes displayed altered transcript representation, particularly in IVM‐MII oocytes, which might contribute to the age‐related quality decline in human oocytes. Here we used single‐oocyte RNA‐Seq to study the transcriptome changes occurring during human oocyte aging. We sampled Germinal Vesicle (GV) stage oocytes from women of age 18–43 and analyzed them directly (GV) or after in vitro maturation (IVM‐MII). We observed that age has a bigger impact on the IVM‐MII transcriptome. Transcripts belonging to biological pathways critical for oocyte maturation and function either increased (green, top) or decreased (red, bottom) in representation with age.
Journal Article
Best practices on the differential expression analysis of multi-species RNA-seq
by
Shetty, Amol C.
,
Dunning Hotopp, Julie C.
,
Chung, Matthew
in
Animal Genetics and Genomics
,
Animals
,
Best practice
2021
Advances in transcriptome sequencing allow for simultaneous interrogation of differentially expressed genes from multiple species originating from a single RNA sample, termed dual or multi-species transcriptomics. Compared to single-species differential expression analysis, the design of multi-species differential expression experiments must account for the relative abundances of each organism of interest within the sample, often requiring enrichment methods and yielding differences in total read counts across samples. The analysis of multi-species transcriptomics datasets requires modifications to the alignment, quantification, and downstream analysis steps compared to the single-species analysis pipelines. We describe best practices for multi-species transcriptomics and differential gene expression.
Journal Article
A new era of long-read sequencing for cancer genomics
by
Sereewattanawoot, Sarun
,
Sakamoto, Yoshitaka
,
Suzuki, Ayako
in
Cancer
,
DNA methylation
,
DNA sequencing
2020
Cancer is a disease largely caused by genomic aberrations. Utilizing many rapidly emerging sequencing technologies, researchers have studied cancer genomes to understand the molecular statuses of cancer cells and to reveal their vulnerabilities, such as driver mutations or gene expression. Long-read technologies enable us to identify and characterize novel types of cancerous mutations, including complicated structural variants in haplotype resolution. In this review, we introduce three representative platforms for long-read sequencing and research trends of cancer genomics with long-read data. Further, we describe that aberrant transcriptome and epigenome statuses, namely, fusion transcripts, as well as aberrant transcript isoforms and the phase information of DNA methylation, are able to be elucidated by long-read sequencers. Long-read sequencing may shed light on novel types of aberrations in cancer genomics that are being missed by conventional short-read sequencing analyses.
Journal Article
A benchmark of batch-effect correction methods for single-cell RNA sequencing data
by
Ang, Kok Siong
,
Goh, Michelle
,
Zhang, Xiaomeng
in
Algorithms
,
Animal Genetics and Genomics
,
Animals
2020
Background
Large-scale single-cell transcriptomic datasets generated using different technologies contain batch-specific systematic variations that present a challenge to batch-effect removal and data integration. With continued growth expected in scRNA-seq data, achieving effective batch integration with available computational resources is crucial. Here, we perform an in-depth benchmark study on available batch correction methods to determine the most suitable method for batch-effect removal.
Results
We compare 14 methods in terms of computational runtime, the ability to handle large datasets, and batch-effect correction efficacy while preserving cell type purity. Five scenarios are designed for the study: identical cell types with different technologies, non-identical cell types, multiple batches, big data, and simulated data. Performance is evaluated using four benchmarking metrics including kBET, LISI, ASW, and ARI. We also investigate the use of batch-corrected data to study differential gene expression.
Conclusion
Based on our results, Harmony, LIGER, and Seurat 3 are the recommended methods for batch integration. Due to its significantly shorter runtime, Harmony is recommended as the first method to try, with the other methods as viable alternatives.
Journal Article
A systematic performance evaluation of clustering methods for single-cell RNA-seq data
by
Duò, Angelo
,
Robinson, Mark D.
,
Soneson, Charlotte
in
Algorithms
,
Clustering
,
Data processing
2018
Subpopulation identification, usually via some form of unsupervised clustering, is a fundamental step in the analysis of many single-cell RNA-seq data sets. This has motivated the development and application of a broad range of clustering methods, based on various underlying algorithms. Here, we provide a systematic and extensible performance evaluation of 14 clustering algorithms implemented in R, including both methods developed explicitly for scRNA-seq data and more general-purpose methods. The methods were evaluated using nine publicly available scRNA-seq data sets as well as three simulations with varying degree of cluster separability. The same feature selection approaches were used for all methods, allowing us to focus on the investigation of the performance of the clustering algorithms themselves. We evaluated the ability of recovering known subpopulations, the stability and the run time and scalability of the methods. Additionally, we investigated whether the performance could be improved by generating consensus partitions from multiple individual clustering methods. We found substantial differences in the performance, run time and stability between the methods, with SC3 and Seurat showing the most favorable results. Additionally, we found that consensus clustering typically did not improve the performance compared to the best of the combined methods, but that several of the top-performing methods already perform some type of consensus clustering. All the code used for the evaluation is available on GitHub ( https://github.com/markrobinsonuzh/scRNAseq_clustering_comparison ). In addition, an R package providing access to data and clustering results, thereby facilitating inclusion of new methods and data sets, is available from Bioconductor ( https://bioconductor.org/packages/DuoClustering2018 ).
Journal Article
Scalable, multimodal profiling of chromatin accessibility, gene expression and protein levels in single cells
by
Nazor, Kristopher L.
,
Sakaguchi, Shimon
,
Wing, James Badger
in
631/1647/2217/2218
,
631/61/212/177
,
631/61/514/2254
2021
Recent technological advances have enabled massively parallel chromatin profiling with scATAC-seq (single-cell assay for transposase accessible chromatin by sequencing). Here we present ATAC with select antigen profiling by sequencing (ASAP-seq), a tool to simultaneously profile accessible chromatin and protein levels. Our approach pairs sparse scATAC-seq data with robust detection of hundreds of cell surface and intracellular protein markers and optional capture of mitochondrial DNA for clonal tracking, capturing three distinct modalities in single cells. ASAP-seq uses a bridging approach that repurposes antibody:oligonucleotide conjugates designed for existing technologies that pair protein measurements with single-cell RNA sequencing. Together with DOGMA-seq, an adaptation of CITE-seq (cellular indexing of transcriptomes and epitopes by sequencing) for measuring gene activity across the central dogma of gene regulation, we demonstrate the utility of systematic multi-omic profiling by revealing coordinated and distinct changes in chromatin, RNA and surface proteins during native hematopoietic differentiation and peripheral blood mononuclear cell stimulation and as a combinatorial decoder and reporter of multiplexed perturbations in primary T cells.
Chromatin accessibility, gene expression and protein levels are measured in the same single cell.
Journal Article
A deep proteome and transcriptome abundance atlas of 29 healthy human tissues
2019
Genome‐, transcriptome‐ and proteome‐wide measurements provide insights into how biological systems are regulated. However, fundamental aspects relating to which human proteins exist, where they are expressed and in which quantities are not fully understood. Therefore, we generated a quantitative proteome and transcriptome abundance atlas of 29 paired healthy human tissues from the Human Protein Atlas project representing human genes by 18,072 transcripts and 13,640 proteins including 37 without prior protein‐level evidence. The analysis revealed that hundreds of proteins, particularly in testis, could not be detected even for highly expressed mRNAs, that few proteins show tissue‐specific expression, that strong differences between mRNA and protein quantities within and across tissues exist and that protein expression is often more stable across tissues than that of transcripts. Only 238 of 9,848 amino acid variants found by exome sequencing could be confidently detected at the protein level showing that proteogenomics remains challenging, needs better computational methods and requires rigorous validation. Many uses of this resource can be envisaged including the study of gene/protein expression regulation and biomarker specificity evaluation.
Synopsis
Proteome and transcriptome quantification across tissues reveals which human genes exist as transcripts and proteins, where they are expressed and in which approximate quantities. Tissue‐specific protein expression is found to be a rare and quantitative rather than qualitative characteristic.
The study presents the most comprehensive atlas of protein expression to date, across 29 healthy human tissues.
Protein level evidence is provided for 13,640 genes and 15,257 isoforms, including 37 missing proteins.
Tissue‐specific protein expression is rare and quantitative rather than qualitative characteristic.
Proteogenomics is still challenging and needs rigorous validation by synthetic peptides.
Graphical Abstract
Proteome and transcriptome quantification across tissues reveals which human genes exist as transcripts and proteins, where they are expressed and in which approximate quantities. Tissue‐specific protein expression is found to be a rare and quantitative rather than qualitative characteristic.
Journal Article
Souporcell: robust clustering of single-cell RNA-seq data by genotype without reference genotypes
by
Hemberg, Martin
,
Gaffney, Daniel J.
,
Imaz, Maria
in
631/114/2785
,
631/114/794
,
631/208/514/1949
2020
Methods to deconvolve single-cell RNA-sequencing (scRNA-seq) data are necessary for samples containing a mixture of genotypes, whether they are natural or experimentally combined. Multiplexing across donors is a popular experimental design that can avoid batch effects, reduce costs and improve doublet detection. By using variants detected in scRNA-seq reads, it is possible to assign cells to their donor of origin and identify cross-genotype doublets that may have highly similar transcriptional profiles, precluding detection by transcriptional profile. More subtle cross-genotype variant contamination can be used to estimate the amount of ambient RNA. Ambient RNA is caused by cell lysis before droplet partitioning and is an important confounder of scRNA-seq analysis. Here we develop souporcell, a method to cluster cells using the genetic variants detected within the scRNA-seq reads. We show that it achieves high accuracy on genotype clustering, doublet detection and ambient RNA estimation, as demonstrated across a range of challenging scenarios.
Souporcell clusters single-cell RNA-seq data using genotype information without the use of a genotype reference.
Journal Article
Decontamination of ambient RNA in single-cell RNA-seq with DecontX
by
Wang, Zhe
,
Johnson, W Evan
,
Corbett, Sean E.
in
Animal Genetics and Genomics
,
Animals
,
Bayes Theorem
2020
Droplet-based microfluidic devices have become widely used to perform single-cell RNA sequencing (scRNA-seq). However, ambient RNA present in the cell suspension can be aberrantly counted along with a cell’s native mRNA and result in cross-contamination of transcripts between different cell populations. DecontX is a novel Bayesian method to estimate and remove contamination in individual cells. DecontX accurately predicts contamination levels in a mouse-human mixture dataset and removes aberrant expression of marker genes in PBMC datasets. We also compare the contamination levels between four different scRNA-seq protocols. Overall, DecontX can be incorporated into scRNA-seq workflows to improve downstream analyses.
Journal Article