Catalogue Search | MBRL

DEBrowser: interactive differential expression analysis and visualization tool for count data

by Ozata, Deniz M. , Kucukural, Alper , Garber, Manuel in Animal Genetics and Genomics , Biochemistry , Biomedical and Life Sciences

2019

Background Sequencing data has become a standard measure of diverse cellular activities. For example, gene expression is accurately measured by RNA sequencing (RNA-Seq) libraries, protein-DNA interactions are captured by chromatin immunoprecipitation sequencing (ChIP-Seq), protein-RNA interactions by crosslinking immunoprecipitation sequencing (CLIP-Seq) or RNA immunoprecipitation (RIP-Seq) sequencing, DNA accessibility by assay for transposase-accessible chromatin (ATAC-Seq), DNase or MNase sequencing libraries. The processing of these sequencing techniques involves library-specific approaches. However, in all cases, once the sequencing libraries are processed, the result is a count table specifying the estimated number of reads originating from each genomic locus. Differential analysis to determine which loci have different cellular activity under different conditions starts with the count table and iterates through a cycle of data assessment, preparation and analysis. Such complex analysis often relies on multiple programs and is therefore a challenge for those without programming skills. Results We developed DEBrowser as an R bioconductor project to interactively visualize every step of the differential analysis, without programming. The application provides a rich and interactive web based graphical user interface built on R’s shiny infrastructure. DEBrowser allows users to visualize data with various types of graphs that can be explored further by selecting and re-plotting any desired subset of data. Using the visualization approaches provided, users can determine and correct technical variations such as batch effects and sequencing depth that affect differential analysis. We show DEBrowser’s ease of use by reproducing the analysis of two previously published data sets. Conclusions DEBrowser is a flexible, intuitive, web-based analysis platform that enables an iterative and interactive analysis of count data without any requirement of programming knowledge.

Journal Article

Share this book

Add to My Shelf

VAPOR: A Visualization Package Tailored to Analyze Simulation Data in Earth System Science

by Li, Shaomeng , Orf, Leigh , Pearse, Scott in Algorithms , Communication , Computer simulation

2019

Visualization is an essential tool for analysis of data and communication of findings in the sciences, and the Earth System Sciences (ESS) are no exception. However, within ESS, specialized visualization requirements and data models, particularly for those data arising from numerical models, often make general purpose visualization packages difficult, if not impossible, to use effectively. This paper presents VAPOR: a domain-specific visualization package that targets the specialized needs of ESS modelers, particularly those working in research settings where highly-interactive exploratory visualization is beneficial. We specifically describe VAPOR’s ability to handle ESS simulation data from a wide variety of numerical models, as well as a multi-resolution representation that enables interactive visualization on very large data while using only commodity computing resources. We also describe VAPOR’s visualization capabilities, paying particular attention to features for geo-referenced data and advanced rendering algorithms suitable for time-varying, 3D data. Finally, we illustrate VAPOR’s utility in the study of a numerically- simulated tornado. Our results demonstrate both ease-of-use and the rich capabilities of VAPOR in such a use case.

Journal Article

Share this book

Add to My Shelf

GeneTonic: an R/Bioconductor package for streamlining the interpretation of RNA-seq data

by Marini, Federico , Linke, Jan , Strauch, Konstantin in Algorithms , Base Sequence , Bioinformatics

2021

Background The interpretation of results from transcriptome profiling experiments via RNA sequencing (RNA-seq) can be a complex task, where the essential information is distributed among different tabular and list formats—normalized expression values, results from differential expression analysis, and results from functional enrichment analyses. A number of tools and databases are widely used for the purpose of identification of relevant functional patterns, yet often their contextualization within the data and results at hand is not straightforward, especially if these analytic components are not combined together efficiently. Results We developed the GeneTonic software package, which serves as a comprehensive toolkit for streamlining the interpretation of functional enrichment analyses, by fully leveraging the information of expression values in a differential expression context. GeneTonic is implemented in R and Shiny, leveraging packages that enable HTML-based interactive visualizations for executing drilldown tasks seamlessly, viewing the data at a level of increased detail. GeneTonic is integrated with the core classes of existing Bioconductor workflows, and can accept the output of many widely used tools for pathway analysis, making this approach applicable to a wide range of use cases. Users can effectively navigate interlinked components (otherwise available as flat text or spreadsheet tables), bookmark features of interest during the exploration sessions, and obtain at the end a tailored HTML report, thus combining the benefits of both interactivity and reproducibility. Conclusion GeneTonic is distributed as an R package in the Bioconductor project ( https://bioconductor.org/packages/GeneTonic/ ) under the MIT license. Offering both bird’s-eye views of the components of transcriptome data analysis and the detailed inspection of single genes, individual signatures, and their relationships, GeneTonic aims at simplifying the process of interpretation of complex and compelling RNA-seq datasets for many researchers with different expertise profiles.

Journal Article

Share this book

Add to My Shelf

ideal: an R/Bioconductor package for interactive differential expression analysis

by Marini, Federico , Linke, Jan , Binder, Harald in Algorithms , Analysis , Applications programs

2020

Background RNA sequencing (RNA-seq) is an ever increasingly popular tool for transcriptome profiling. A key point to make the best use of the available data is to provide software tools that are easy to use but still provide flexibility and transparency in the adopted methods. Despite the availability of many packages focused on detecting differential expression, a method to streamline this type of bioinformatics analysis in a comprehensive, accessible, and reproducible way is lacking. Results We developed the ideal software package, which serves as a web application for interactive and reproducible RNA-seq analysis, while producing a wealth of visualizations to facilitate data interpretation. ideal is implemented in R using the Shiny framework, and is fully integrated with the existing core structures of the Bioconductor project. Users can perform the essential steps of the differential expression analysis workflow in an assisted way, and generate a broad spectrum of publication-ready outputs, including diagnostic and summary visualizations in each module, all the way down to functional analysis. ideal also offers the possibility to seamlessly generate a full HTML report for storing and sharing results together with code for reproducibility. Conclusion ideal is distributed as an R package in the Bioconductor project ( http://bioconductor.org/packages/ideal/ ), and provides a solution for performing interactive and reproducible analyses of summarized RNA-seq expression data, empowering researchers with many different profiles (life scientists, clinicians, but also experienced bioinformaticians) to make the ideal use of the data at hand.

Journal Article

Share this book

Add to My Shelf

SeqExpressionAnalyser: An R Package for Automated End-to-End RNA-Seq Analysis From Reads to Differential Expression

by Radouani, Fouzia , Maamar, Jaouhara , Lamzouri, Afaf in Method and Protocol

2026

Differential gene expression analysis of RNA Sequencing (RNA-Seq) data is crucial for understanding key patterns of gene regulation and enhancing our knowledge of biological processes and diseases. The workflow of this analysis comprises quality control, filtering of low-quality data, alignment, read counting, and final differential analysis. In this case, users often need to manually combine several tools and write multiple scripts to cover the entire pipeline. This fragmented approach is time-consuming and not user-friendly, especially for non-expert users. There is a need for an integrated, automated and accessible solution that unifies the entire analysis process within a single, easy-to-use platform. To address this need, we developed SeqExpressionAnalyser, an R package that provides a web application for interactive differential gene-expression analysis of RNA-seq data, making it accessible to R users for the first time. Built on the Shiny framework, SeqExpressionAnalyser enables users to read FASTQ files and perform analyses, including quality control, filtering, alignment, read counting, and differential expression analysis. The tool generates multiple outputs, including data tables, an HTML report and visualisations. The source code is available on GitHub ( https://github.com/sanaeesskhayry/SeqExpressionAnalyser ) and is licensed under the GPLv3 license. Also available as a Docker image at https://hub.docker.com/repository/docker/biomix/seq-expression-analyser/general .

Journal Article

Share this book

Add to My Shelf

oposSOM-Browser: an interactive tool to explore omics data landscapes in health science

by Wagner, Jonas , Loeffler-Wirth, Henry , Hakobyan, Siras in Algorithms , B cells , Bioinformatics

2020

Background oposSOM is a comprehensive, machine learning based open-source data analysis software combining functionalities such as diversity analyses, biomarker selection, function mining, and visualization. Results These functionalities are now available as interactive web-browser application for a broader user audience interested in extracting detailed information from high-throughput omics data sets pre-processed by oposSOM. It enables interactive browsing of single-gene and gene set profiles, of molecular ‘portrait landscapes’, of associated phenotype diversity, and signalling pathway activation patterns. Conclusion The oposSOM-Browser makes available interactive data browsing for five transcriptome data sets of cancer (melanomas, B-cell lymphomas, gliomas) and of peripheral blood (sepsis and healthy individuals) at www.izbi.uni-leipzig.de/opossom-browser .

Journal Article

Share this book

Add to My Shelf

Immunopeptidomics toolkit library (IPTK): a python-based modular toolbox for analyzing immunopeptidomics data

by Degenhardt, Frauke , Bacher, Petra , Wendorff, Mareike in Adaptive systems , Algorithms , Analysis

2021

Background The human leukocyte antigen (HLA) proteins play a fundamental role in the adaptive immune system as they present peptides to T cells. Mass-spectrometry-based immunopeptidomics is a promising and powerful tool for characterizing the immunopeptidomic landscape of HLA proteins, that is the peptides presented on HLA proteins. Despite the growing interest in the technology, and the recent rise of immunopeptidomics-specific identification pipelines, there is still a gap in data-analysis and software tools that are specialized in analyzing and visualizing immunopeptidomics data. Results We present the IPTK library which is an open-source Python-based library for analyzing, visualizing, comparing, and integrating different omics layers with the identified peptides for an in-depth characterization of the immunopeptidome. Using different datasets, we illustrate the ability of the library to enrich the result of the identified peptidomes. Also, we demonstrate the utility of the library in developing other software and tools by developing an easy-to-use dashboard that can be used for the interactive analysis of the results. Conclusion IPTK provides a modular and extendable framework for analyzing and integrating immunopeptidomes with different omics layers. The library is deployed into PyPI at https://pypi.org/project/IPTKL/ and into Bioconda at https://anaconda.org/bioconda/iptkl , while the source code of the library and the dashboard, along with the online tutorials are available at https://github.com/ikmb/iptoolkit .

Journal Article

Share this book

Add to My Shelf

Visualization of Traditional Chinese Medicine Formulas: Development and Usability Study

by Peng, Suyuan , Wu, Zhiyue , Zhou, Liang in Data visualization , Design , Herbal medicine

2023

Traditional Chinese medicine (TCM) formulas are combinations of Chinese herbal medicines. Knowledge of classic medicine formulas is the basis of TCM diagnosis and treatment and is the core of TCM inheritance. The large number and flexibility of medicine formulas make memorization difficult, and understanding their composition rules is even more difficult. The multifaceted and multidimensional properties of herbal medicines are important for understanding the formula; however, these are usually separated from the formula information. Furthermore, these data are presented as text and cannot be analyzed jointly and interactively. We aimed to devise a visualization method for TCM formulas that shows the composition of medicine formulas and the multidimensional properties of herbal medicines involved and supports the comparison of medicine formulas. A TCM formula visualization method with multiple linked views is proposed and implemented as a web-based tool after close collaboration between visualization and TCM experts. The composition of medicine formulas is visualized in a formula view with a similarity-based layout supporting the comparison of compositing herbs; a shared herb view complements the formula view by showing all overlaps of pair-wise formulas; and a dimensionality-reduction plot of herbs enables the visualization of multidimensional herb properties. The usefulness of the tool was evaluated through a usability study with TCM experts. Our method was applied to 2 typical categories of medicine formulas, namely tonic formulas and heat-clearing formulas, which contain 20 and 26 formulas composed of 58 and 73 herbal medicines, respectively. Each herbal medicine has a 23-dimensional characterizing attribute. In the usability study, TCM experts explored the 2 data sets with our web-based tool and quickly gained insight into formulas and herbs of interest, as well as the overall features of the formula groups that are difficult to identify with the traditional text-based method. Moreover, feedback from the experts indicated the usefulness of the proposed method. Our TCM formula visualization method is able to visualize and compare complex medicine formulas and the multidimensional attributes of herbal medicines using a web-based tool. TCM experts gained insights into 2 typical medicine formula categories using our method. Overall, the new method is a promising first step toward new TCM formula education and analysis methodologies.

Journal Article

Share this book

Add to My Shelf

GeoBrick: exploration of spatiotemporal data

by Nadeem, Saad , Park, Ji Hwan , Kaufman, Arie in Artificial Intelligence , Computer Graphics , Computer Science

2019

We present GeoBrick, an interactive technique for exploring spatiotemporal data. In GeoBrick, each region is comprised of multivariate data, which is encoded into simple shapes with colors. Additionally, users can adjust the resolution of data values to get an overview as well as details of the data. GeoBrick allows users to (1) juxtapose data and spatial profiles of discontiguous regions, (2) identify temporal patterns of user-defined classes of regions, and (3) comparatively evaluate across distinct configurations of regions. We demonstrate the effectiveness and efficacy of GeoBrick using two case studies.

Journal Article

Share this book

Add to My Shelf

ProSecCo: progressive sequence mining with convergence guarantees

by Servan-Schreiber Sacha , Riondato Matteo , Zgraggen Emanuel in Algorithms , Approximation , Collection

2020

We present ProSecCo, an algorithm for the progressive mining of frequent sequences from large transactional datasets: It processes the dataset in blocks and it outputs, after having analyzed each block, a high-quality approximation of the collection of frequent sequences. ProSecCo can be used for interactive data exploration, as the intermediate results enable the user to make informed decisions as the computation proceeds. These intermediate results have strong probabilistic approximation guarantees and the final output is the exact collection of frequent sequences. Our correctness analysis uses the Vapnik–Chervonenkis (VC) dimension, a key concept from statistical learning theory. The results of our experimental evaluation of ProSecCo on real and artificial datasets show that it produces fast-converging high-quality results almost immediately. Its practical performance is even better than what is guaranteed by the theoretical analysis, and ProSecCo can even be faster than existing state-of-the-art non-progressive algorithms. Additionally, our experimental results show that ProSecCo uses a constant amount of memory, and orders of magnitude less than other standard, non-progressive, sequential pattern mining algorithms.

Journal Article

Share this book

Add to My Shelf

Language Selector

MBRLGlobalSearch

Language Selector

Catalogue Search | MBRL

Search Results Heading

Explore the vast range of titles available.

MBRLSearchResults

MBRLHappinessMeter