Catalogue Search | MBRL
Search Results Heading
Explore the vast range of titles available.
MBRLSearchResults
-
DisciplineDiscipline
-
Is Peer ReviewedIs Peer Reviewed
-
Item TypeItem Type
-
SubjectSubject
-
YearFrom:-To:
-
More FiltersMore FiltersSourceLanguage
Done
Filters
Reset
7,382
result(s) for
"gene set analysis"
Sort by:
Gene set enrichment analysis for genome-wide DNA methylation data
by
Maksimovic, Jovana
,
Phipson, Belinda
,
Oshlack, Alicia
in
Animal Genetics and Genomics
,
Annotations
,
Arrays
2021
DNA methylation is one of the most commonly studied epigenetic marks, due to its role in disease and development. Illumina methylation arrays have been extensively used to measure methylation across the human genome. Methylation array analysis has primarily focused on preprocessing, normalization, and identification of differentially methylated CpGs and regions. GOmeth and GOregion are new methods for performing unbiased gene set testing following differential methylation analysis. Benchmarking analyses demonstrate GOmeth outperforms other approaches, and GOregion is the first method for gene set testing of differentially methylated regions. Both methods are publicly available in the
missMethyl
Bioconductor R package.
Journal Article
Functional Enrichment Analysis of Regulatory Elements
by
López-Domínguez, Raul
,
Villatoro-García, Juan Antonio
,
Garcia-Moreno, Adrian
in
Algorithms
,
Annotations
,
Arrhythmia
2022
Statistical methods for enrichment analysis are important tools to extract biological information from omics experiments. Although these methods have been widely used for the analysis of gene and protein lists, the development of high-throughput technologies for regulatory elements demands dedicated statistical and bioinformatics tools. Here, we present a set of enrichment analysis methods for regulatory elements, including CpG sites, miRNAs, and transcription factors. Statistical significance is determined via a power weighting function for target genes and tested by the Wallenius noncentral hypergeometric distribution model to avoid selection bias. These new methodologies have been applied to the analysis of a set of miRNAs associated with arrhythmia, showing the potential of this tool to extract biological information from a list of regulatory elements. These new methods are available in GeneCodis 4, a web tool able to perform singular and modular enrichment analysis that allows the integration of heterogeneous information.
Journal Article
Pan-Cancer Analysis of Immune Cell Infiltration Identifies a Prognostic Immune-Cell Characteristic Score (ICCS) in Lung Adenocarcinoma
by
Dong, Jie
,
Wang, Shiqun
,
Zuo, Shuguang
in
Adenocarcinoma
,
Adenocarcinoma of Lung - genetics
,
Adenocarcinoma of Lung - immunology
2020
The tumor microenvironment (TME) consists of heterogeneous cell populations, including malignant cells and nonmalignant cells that support tumor proliferation, invasion, and metastasis through extensive cross talk. The intra-tumor immune landscape is a critical factor influencing patient survival and response to immunotherapy.
Gene expression data were downloaded from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus databases. Immune cell infiltration was determined by single-sample Gene Set Enrichment Analysis (ssGSEA) depending on the integrated immune gene sets from published studies. Univariate analysis was used to determine the prognostic value of the infiltrated immune cells. Least absolute shrinkage and selection operator (LASSO) regression was performed to screen for the most survival-relevant immune cells. An immune-cell characteristic score (ICCS) model was constructed by using multivariate Cox regression analysis.
The immune cell infiltration patterns across 32 cancer types were identified, and patients in the high immune cell infiltration cluster had worse overall survival (OS) but better progression-free interval (PFI) compared to the low immune cell infiltration cluster. However, immune cell infiltration showed inconsistent prognostic value depending on the cancer type. High immune cell infiltration (High CI) indicated a worse prognosis in brain lower grade glioma (LGG), glioblastoma multiforme (GBM), and uveal melanoma (UVM), and favorable prognosis in adrenocortical carcinoma (ACC), cervical squamous cell carcinoma and endocervical adenocarcinoma (CESC), cholangiocarcinoma (CHOL), head and neck squamous cell carcinoma (HNSC), liver hepatocellular carcinoma (LIHC), lung adenocarcinoma (LUAD), sarcoma (SARC), and skin cutaneous melanoma (SKCM). LUAD prognosis was significantly influenced by the infiltration of 13 immune cell types, with high infiltration of all but Type 2 T helper (Th2) cells correlating with a favorable prognosis. The ICCS model based on six most survival-relevant immune cell populations was generated that classified patients into low- and high-ICCS groups with good and poor prognoses, respectively. The multivariate and stratified analyses further revealed that the ICCS was an independent prognostic factor for LUAD.
The infiltration of immune cells in 32 cancer types was quantified, and considerable heterogeneity was observed in the prognostic relevance of these cells in different cancer types. An ICCS model was constructed for LUAD with competent prognostic performance, which can further deepen our understanding of the TME of LUAD and can have implications for immunotherapy.
Journal Article
GeneSetCluster 2.0: a comprehensive toolset for summarizing and integrating gene-sets analysis
by
Pahlevan Kakhki, Majid
,
López-Pérez, Ana Rosa
,
Maillo, Alberto
in
Accessibility
,
Algorithms
,
Analysis
2025
Background
Gene-Set Analysis (GSA) is commonly used to analyze high-throughput experiments. However, GSA cannot readily disentangle clusters or pathways due to redundancies in upstream knowledge bases, which hinders comprehensive exploration and interpretation of biological findings. To address this challenge, we developed GeneSetCluster, an R package designed to summarize and integrate GSA results. Over time, we and users as well identified limitations in the original version, such as difficulties in managing redundancies across multiple gene-sets, large computational times, and its lack of accessibility for users without programming expertise.
Results
We present GeneSetCluster 2.0, a comprehensive upgrade that delivers methodological, computational, interpretative, and user-experience enhancements. Methodologically, GeneSetCluster 2.0 introduces a novel approach to address duplicated gene-sets and implements a seriation-based clustering algorithm that reorders results, aiding pattern identification. Computationally, the package is optimized for parallel processing, significantly reducing execution time. GeneSetCluster 2.0 enhances cluster annotations by associating clusters with relevant tissues and biological processes to improve biological interpretation, particularly for human and mouse data. To broaden accessibility, we have developed a user-friendly web application enabling non-programmers to use it. This version also ensures seamless integration between the R package, catering to users with programming expertise, and the web application for broader audiences. We evaluated the updates in a single-cell RNA public dataset.
Conclusion
GeneSetCluster 2.0 offers substantial improvements over its predecessor. Furthermore, by bridging the gap between bioinformaticians and clinicians in multidisciplinary teams, GeneSetCluster 2.0 facilitates collaborative research. The R package and web application, along with detailed installation and usage guides, are available on GitHub (
https://github.com/TranslationalBioinformaticsUnit/GeneSetCluster2.0
), and the web application can be accessed at
https://translationalbio.shinyapps.io/genesetcluster/
.
Journal Article
A Novel Immune-Related Gene Signature to Identify the Tumor Microenvironment and Prognose Disease Among Patients With Oral Squamous Cell Carcinoma Patients Using ssGSEA: A Bioinformatics and Biological Validation Study
2022
Oral squamous cell carcinoma (OSCC) is the most invasive oral malignancy in adults and is associated with a poor prognosis. Accurate prognostic models are urgently needed, however, knowledge of the probable mechanisms behind OSCC tumorigenesis and prognosis remain limited. The clinical importance of the interplay between the immune system and tumor microenvironment has become increasingly evident. This study explored immune-related alterations at the multi-omics level to extract accurate prognostic markers linked to the immune response and presents a more accurate landscape of the immune genomic map during OSCC. The Cancer Genome Atlas (TCGA) OSCC cohort (n = 329) was used to detect the immune infiltration pattern of OSCC and categorize patients into two immunity groups using single-sample gene set enrichment analysis (ssGSEA) and hierarchical clustering analysis. Multiple strategies, including lasso regression (LASSO), Cox proportional hazards regression, and principal component analysis (PCA) were used to screen clinically significant signatures and identify an incorporated prognosis model with robust discriminative power on the survival status of both the training and testing set. We identified two OSCC subtypes based on immunological characteristics: Immunity-high and immunity low, and verified that the categorization was accurate and repeatable. Immunity_ high cluster with a higher immunological and stromal score. 1047 differential genes (DEGs) integrate with immune genes to obtain 319 immue-related DEGs. A robust model with five signatures for OSCC patient prognosis was established. The GEO cohort (n = 97) were used to validate the risk model’s predictive value. The low-risk group had a better overall survival (OS) than the high-risk group. Significant prognostic potential for OSCC patients was found using ROC analysis and immune checkpoint gene expression was lower in the low-risk group. We also investigated at the therapeutic sensitivity of a number of frequently used chemotherapeutic drugs in patients with various risk factors. The underlying biological behavior of the OSCC cell line was preliminarily validated. This study characterizes a reliable marker of OSCC disease progression and provides a new potential target for immunotherapy against this disease.
Journal Article
Roastgsa: a comparison of rotation-based scores for gene set enrichment analysis
by
Stephan-Otto Attolini, Camille
,
Berenguer-Llergo, Antoni
,
Caballé-Mestres, Adrià
in
Algorithms
,
Bioinformatics
,
Biomedical and Life Sciences
2023
Background
Gene-wise differential expression is usually the first major step in the statistical analysis of high-throughput data obtained from techniques such as microarrays or RNA-sequencing. The analysis at gene level is often complemented by interrogating the data in a broader biological context that considers as unit of measure groups of genes that may have a common function or biological trait. Among the vast number of publications about gene set analysis (GSA), the rotation test for gene set analysis, also referred to as roast, is a general sample randomization approach that maintains the integrity of the intra-gene set correlation structure in defining the null distribution of the test.
Results
We present
roastgsa
, an R package that contains several enrichment score functions that feed the roast algorithm for hypothesis testing. These implemented methods are evaluated using both simulated and benchmarking data in microarray and RNA-seq datasets. We find that computationally intensive measures based on Kolmogorov-Smirnov (KS) statistics fail to improve the rates of simpler measures of GSA like mean and maxmean scores. We also show the importance of accounting for the gene linear dependence structure of the testing set, which is linked to the loss of effective signature size. Complete graphical representation of the results, including an approximation for the effective signature size, can be obtained as part of the
roastgsa
output.
Conclusions
We encourage the usage of the absmean (non-directional), mean (directional) and maxmean (directional) scores for roast GSA analysis as these are simple measures of enrichment that have presented dominant results in all provided analyses in comparison to the more complex KS measures.
Journal Article
vissE: a versatile tool to identify and visualise higher-order molecular phenotypes from functional enrichment analysis
by
Liu, Ning
,
Papachristos, Nicholas
,
Whitfield, Holly J.
in
Adaptability
,
Algorithms
,
Bioinformatics
2024
Functional analysis of high throughput experiments using pathway analysis is now ubiquitous. Though powerful, these methods often produce thousands of redundant results owing to knowledgebase redundancies upstream. This scale of results hinders extensive exploration by biologists and can lead to investigator biases due to previous knowledge and expectations. To address this issue, we present vissE, a flexible network-based analysis and visualisation tool that organises information into semantic categories and provides various visualisation modules to characterise them with respect to the underlying data, thus providing a comprehensive view of the biological system. We demonstrate vissE’s versatility by applying it to three different technologies: bulk, single-cell and spatial transcriptomics. Applying vissE to a factor analysis of a breast cancer spatial transcriptomic data, we identified stromal phenotypes that support tumour dissemination. Its adaptability allows vissE to enhance all existing gene-set enrichment and pathway analysis workflows, empowering biologists during molecular discovery.
Journal Article
Identification of molecular targets and small drug candidates for Huntington's disease via bioinformatics and a network‐based screening approach
by
Hossain, Md Ridoy
,
Tareq, Md. Mohaimenul Islam
,
Zilani, Md. Nazmul Hasan
in
Bioinformatics
,
Biomarkers
,
Biomarkers - metabolism
2024
Huntington's disease (HD) is a gradually severe neurodegenerative ailment characterised by an increase of a specific trinucleotide repeat sequence (cytosine–adenine–guanine, CAG). It is passed down as a dominant characteristic that worsens over time, creating a significant risk. Despite being monogenetic, the underlying mechanisms as well as biomarkers remain poorly understood. Furthermore, early detection of HD is challenging, and the available diagnostic procedures have low precision and accuracy. The research was conducted to provide knowledge of the biomarkers, pathways and therapeutic targets involved in the molecular processes of HD using informatic based analysis and applying network‐based systems biology approaches. The gene expression profile datasets GSE97100 and GSE74201 relevant to HD were studied. As a consequence, 46 differentially expressed genes (DEGs) were identified. 10 hub genes (TPM1, EIF2S3, CCN2, ACTN1, ACTG2, CCN1, CSRP1, EIF1AX, BEX2 and TCEAL5) were further differentiated in the protein–protein interaction (PPI) network. These hub genes were typically down‐regulated. Additionally, DEGs‐transcription factors (TFs) connections (e.g. GATA2, YY1 and FOXC1), DEG‐microRNA (miRNA) interactions (e.g. hsa‐miR‐124‐3p and has‐miR‐26b‐5p) were also comprehensively forecast. Additionally, related gene ontology concepts (e.g. sequence‐specific DNA binding and TF activity) connected to DEGs in HD were identified using gene set enrichment analysis (GSEA). Finally, in silico drug design was employed to find candidate drugs for the treatment HD, and while the possible modest therapeutic compounds (e.g. cortistatin A, 13,16‐Epoxy‐25‐hydroxy‐17‐cheilanthen‐19,25‐olide, Hecogenin) against HD were expected. Consequently, the results from this study may give researchers useful resources for the experimental validation of Huntington's diagnosis and therapeutic approaches.
Journal Article
Identification of Regeneration and Hub Genes and Pathways at Different Time Points after Spinal Cord Injury
by
Wang, An-quan
,
Zhang, Hui
,
Yin, Zong-Sheng
in
Animals
,
Biomedical and Life Sciences
,
Biomedicine
2021
Spinal cord injury (SCI) is a neurological injury that can cause neuronal loss around the lesion site and leads to locomotive and sensory deficits. However, the underlying molecular mechanisms remain unclear. This study aimed to verify differential gene time-course expression in SCI and provide new insights for gene-level studies. We downloaded two rat expression profiles (GSE464 and GSE45006) from the Gene Expression Omnibus database, including 1 day, 3 days, 7 days, and 14 days post-SCI, along with thoracic spinal cord data for analysis. At each time point, gene integration was performed using “batch normalization.” The raw data were standardized, and differentially expressed genes at the different time points versus the control were analyzed by Gene Ontology enrichment analysis, the Kyoto Encyclopedia of Genes and Genomes pathway analysis, and gene set enrichment analysis. A protein-protein interaction network was then built and visualized. In addition, ten hub genes were identified at each time point. Among them,
Gnb5
,
Gng8
,
Agt
,
Gnai1
, and
Psap
lack correlation studies in SCI and deserve further investigation. Finally, we screened and analyzed genes for tissue repair, reconstruction, and regeneration and found that
Anxa1
,
Snap25
, and
Spp1
were closely related to repair and regeneration after SCI. In conclusion, hub genes, signaling pathways, and regeneration genes involved in secondary SCI were identified in our study. These results may be useful for understanding SCI-related biological processes and the development of targeted intervention strategies.
Journal Article
Screening of hub genes for sepsis-induced myopathy by weighted gene co-expression network analysis and protein-protein interaction network construction
2024
Sepsis-induced myopathy is one of the serious complications of sepsis, which severely affects the respiratory and peripheral motor systems of patients, reduces their quality of life, and jeopardizes their lives, as evidenced by muscle atrophy, loss of strength, and impaired regeneration after injury. The pathogenesis of sepsis-induced myopathy is complex, mainly including cytokine action, enhances free radical production in muscle, increases muscle protein hydrolysis, and decreases skeletal muscle protein synthesis, etc. The above mechanisms have been demonstrated in existing studies. However, it is still unclear how the overall pattern of gene co-expression affects the pathological process of sepsis-induced myopathy. Therefore, we intend to identify hub genes and signaling pathways. Weighted gene co-expression network analysis was our main approach to study gene expression profiles: skeletal muscle transcriptome in ICU patients with sepsis-induced multi-organ failure (GSE13205). After data pre-processing, about 15,181 genes were used to identify 13 co-expression modules. Then, 16 genes (FEM1B, KLHDC3, GPX3, NIFK, GNL2, EBNA1BP2, PES1, FBP2, PFKP, BYSL, HEATR1, WDR75, TBL3, and WDR43) were selected as the hub genes including 3 up-regulated genes and 13 down-regulated genes. Then, Gene Set Enrichment Analysis was performed to show that the hub genes were closely associated with skeletal muscle dysfunction, necrotic and apoptotic skeletal myoblasts, and apoptosis in sepsis-induced myopathy. Overall, 16 candidate biomarkers were certified as reliable features for more in-depth exploration of sepsis-induced myopathy in basic and clinical studies.
Journal Article