Catalogue Search | MBRL

Mining Thermophile Photosynthesis Genes: A Synthetic Operon Expressing Chloroflexota Species Reaction Center Genes in Rhodobacter sphaeroides

by Blaby-Haas, Crysten E. , Blaby, Ian K. , Kim, Younghoon in Analysis , anoxygenic , Bacteria

2025

Photosynthesis is the foundation of the vast majority of life systems, and is therefore the most important bioenergetic process on earth. The greatest diversity of photosynthetic systems is found in microorganisms. However, our understanding of the biophysical and biochemical processes that transduce light into chemical energy is derived from a relatively small subset of proteins from microbes that are amenable to cultivation, in contrast to the huge number of predicted proteins that catalyze the initial photochemical reactions deposited in databases, such as from metagenomics. We describe the use of a Rhodobacter sphaeroides laboratory strain for the expression of heterologous photosynthesis genes to demonstrate the feasibility of mining this resource, focusing on hot spring Chloroflexota gene sequences. Using a synthetic operon of genes, we produced a photochemically active complex of reaction center proteins in our biological system. We also present bioinformatic analyses of anoxygenic type II reaction center sequences from metagenomic samples collected from hot (42–90 °C) springs available through the JGI IMG database, to generate a resource of diverse sequences that are potentially adapted to photosynthesis at such temperatures. These data provide a view into the natural diversity of anoxygenic photosynthesis, through a lens focused on high-temperature environments. The approach we took to express such genes can be applied for potential biotechnology purposes as well as for studies of fundamental catalytic properties of these heretofore inaccessible protein complexes.

Journal Article

Share this book

Add to My Shelf

High-Throughput Sequencing-Based Immune Repertoire Study during Infectious Disease

by Chen, Cuicui , Song, Yuanlin , Chen, Shujing in Adaptive immunity , Algorithms , Antibodies

2016

The selectivity of the adaptive immune response is based on the enormous diversity of T and B cell antigen-specific receptors. The immune repertoire, the collection of T and B cells with functional diversity in the circulatory system at any given time, is dynamic and reflects the essence of immune selectivity. In this article, we review the recent advances in immune repertoire study of infectious diseases, which were achieved by traditional techniques and high-throughput sequencing (HTS) techniques. HTS techniques enable the determination of complementary regions of lymphocyte receptors with unprecedented efficiency and scale. This progress in methodology enhances the understanding of immunologic changes during pathogen challenge and also provides a basis for further development of novel diagnostic markers, immunotherapies, and vaccines.

Journal Article

Share this book

Add to My Shelf

Repurposing haloperidol for the treatment of rheumatoid arthritis: an integrative approach using data mining techniques

by Yokoyama, Satoshi , Takada, Mitsutaka , Hosomi, Kouichi in Bioinformatics , Data mining , Drugs

2021

Introduction: Treatment of rheumatoid arthritis (RA) has advanced with the introduction of biological disease-modifying antirheumatic drugs. However, more than 20% of patients with RA still have moderate or severe disease activity. Hence, novel antirheumatic drugs are required. Recently, drug repurposing, a process of identifying new indications for existing drugs, has received great attention. Furthermore, a few reports have shown that antipsychotics are capable of affecting several cytokines that are also modulated by existing antirheumatic drugs. Therefore, we investigated the association between antipsychotics and RA by data mining using real-world data and bioinformatics databases. Methods: Disproportionality and sequence symmetry analyses were employed to identify the associations between the investigational drugs and RA using the US Food and Drug Administration Adverse Event Reporting System (2004–2016) and JMDC administrative claims database (January 2005–April 2017; JMDC Inc., Tokyo, Japan), respectively. The reporting odds ratio (ROR) and information component (IC) were used in the disproportionality analysis to indicate a signal. The adjusted sequence ratio (SR) was used in the sequence symmetry analysis to indicate a signal. The bioinformatics analysis suite, BaseSpace Correlation Engine (Illumina, CA, USA) was employed to explore the molecular mechanisms associated with the potential candidates identified by the drug-repurposing approach. Results: A potential inverse association between the antipsychotic haloperidol and RA, which exhibited significant inverse signals with ROR, IC, and adjusted SR, was found. Furthermore, the results suggested that haloperidol may exert antirheumatic effects by modulating various signaling pathways, including cytokine and chemokine signaling, major histocompatibility complex class-II antigen presentation, and Toll-like receptor cascade pathways. Conclusion: Our drug-repurposing approach using data mining techniques identified haloperidol as a potential antirheumatic drug candidate.

Journal Article

Share this book

Add to My Shelf

In silico discovery of novel transcription factors regulated by mTOR-pathway activities

by Agnieszka Jablonska , Natalia Polouliakh in Amino acids , Autophagy , bioinformatic tools and databases

2014

The mammalian target of rapamycine (mTOR) pathway is a key regulator of cellular growth, development, and ageing, and unraveling its control is essential for understanding life and death of biological organisms. A motif-discovery workbench including nine tools was used to identify transcription factors involved in five basic (Insulin, MAPK, VEGF, Hypoxia, and mTOR core) activities of the mTOR pathway. Discovered transcription factors are classified as \"process-specific\" or \"pathway-ubiquitous\" with highlights toward their regulating/regulated activities within the mTOR pathway. Our transcription regulation results will facilitate further research on investigating the control mechanism in mTOR pathway.

Journal Article

Share this book

Add to My Shelf

Evaluation and comparison of bioinformatic tools for the enrichment analysis of metabolomics data

by Andrés Lacueva, Ma. Cristina , Marco Ramell, Anna , Alay, Ania in Algorithms , Bioinformatic tools , Bioinformatics

2018

Background: Bioinformatic tools for the enrichment of 'omics' datasets facilitate interpretation and understanding of data. To date few are suitable for metabolomics datasets. The main objective of this work is to give a critical overview, for the first time, of the performance of these tools. To that aim, datasets from metabolomic repositories were selected and enriched data were created. Both types of data were analysed with these tools and outputs were thoroughly examined. Results: An exploratory multivariate analysis of the most used tools for the enrichment of metabolite sets, based on a non-metric multidimensional scaling (NMDS) of Jaccard's distances, was performed and mirrored their diversity. Codes (identifiers) of the metabolites of the datasets were searched in different metabolite databases (HMDB, KEGG, PubChem, ChEBI, BioCyc/HumanCyc, LipidMAPS, ChemSpider, METLIN and Recon2). The databases that presented more identifiers of the metabolites of the dataset were PubChem, followed by METLIN and ChEBI. However, these databases had duplicated entries and might present false positives. The performance of over-representation analysis (ORA) tools, including BioCyc/HumanCyc, ConsensusPathDB, IMPaLA, MBRole, MetaboAnalyst, Metabox, MetExplore, MPEA, PathVisio and Reactome and the mapping tool KEGGREST, was examined. Results were mostly consistent among tools and between real and enriched data despite the variability of the tools. Nevertheless, a few controversial results such as differences in the total number of metabolites were also found. Disease-based enrichment analyses were also assessed, but they were not found to be accurate probably due to the fact that metabolite disease sets are not up-to-date and the difficulty of predicting diseases from a list of metabolites. Conclusions: We have extensively reviewed the state-of-the-art of the available range of tools for metabolomic datasets, the completeness of metabolite databases, the performance of ORA methods and disease-based analyses. Despite the variability of the tools, they provided consistent results independent of their analytic approach. However, more work on the completeness of metabolite and pathway databases is required, which strongly affects the accuracy of enrichment analyses. Improvements will be translated into more accurate and global insights of the metabolome.

Journal Article

Share this book

Add to My Shelf

Bioinformatics analyses of differentially expressed genes associated with spinal cord injury: A microarray-based analysis in a mouse model

by Guo, Lei , Lv, Jing , Hao, Ding-Jun in Biochemistry , Bioinformatics , Cellular signal transduction

2019

Gene spectrum analysis has shown that gene expression and signaling pathways change dramatically after spinal cord injury, which may affect the microenvironment of the damaged site. Microarray analysis provides a new opportunity for investigating diagnosis, treatment, and prognosis of spinal cord injury. However, differentially expressed genes are not consistent among studies, and many key genes and signaling pathways have not yet been accurately studied. GSE5296 was retrieved from the Gene Expression Omnibus DataSet. Differentially expressed genes were obtained using R/Bioconductor software (expression changed at least two-fold; P < 0.05). Database for Annotation, Visualization and Integrated Discovery was used for functional annotation of differentially expressed genes and Animal Transcription Factor Database for predicting potential transcription factors. The resulting transcription regulatory protein interaction network was mapped to screen representative genes and investigate their diagnostic and therapeutic value for disease. In total, this study identified 109 genes that were upregulated and 30 that were downregulated at 0.5, 4, and 24 hours, and 3, 7, and 28 days after spinal cord injury. The number of downregulated genes was smaller than the number of upregulated genes at each time point. Database for Annotation, Visualization and Integrated Discovery analysis found that many inflammation-related pathways were upregulated in injured spinal cord. Additionally, expression levels of these inflammation-related genes were maintained for at least 28 days. Moreover, 399 regulation modes and 77 nodes were shown in the protein-protein interaction network of upregulated differentially expressed genes. Among the 10 upregulated differentially expressed genes with the highest degrees of distribution, six genes were transcription factors. Among these transcription factors, ATF3 showed the greatest change. ATF3 was upregulated within 30 minutes, and its expression levels remained high at 28 days after spinal cord injury. These key genes screened by bioinformatics tools can be used as biological markers to diagnose diseases and provide a reference for identifying therapeutic targets.

Journal Article

Share this book

Add to My Shelf

New Challenges for Biological Text-Mining in the Next Decade

by Hong-Jie Dai Yen-Ching Chang Richard Tzong-Han Tsai Wen-Lian Hsu in Academic publications , Algorithms , Artificial Intelligence

2010

The massive flow of scholarly publications from traditional paper journals to online outlets has benefited biologists because of its ease to access. However, due to the sheer volume of available biological literature, researchers are finding it increasingly difficult to locate needed information. As a result, recent biology contests, notably JNLPBA and BioCreAtIvE, have focused on evaluating various methods in which the literature may be navigated. Among these methods, text-mining technology has shown the most promise. With recent advances in text-mining technology and the fact that publishers are now making the full texts of articles available in XML format, TMSs can be adapted to accelerate literature curation, maintain the integrity of information, and ensure proper linkage of data to other resources. Even so, several new challenges have emerged in relation to full text analysis, life-science terminology, complex relation extraction, and information fusion. These challenges must be overcome in order for text-mining to be more effective. In this paper, we identify the challenges, discuss how they might be overcome, and consider the resources that may be helpful in achieving that goal.

Journal Article

Share this book

Add to My Shelf

OysterDB: A Genome Database for Ostreidae

by Li, Xinchun , Bai, Yitian , Li, Qi in Annotations , Biotechnology , Content management software

2024

The molluscan family Ostreidae, commonly known as oysters, is an important molluscan group due to its economic and ecological importance. In recent years, an abundance of genomic data of Ostreidae species has been generated and available in public domain. However, there is still a lack of a high-efficiency database platform to store and distribute these data with comprehensive tools. In this study, we developed an oyster genome database (OysterDB) to consolidate oyster genomic data. This database includes eight oyster genomes and 208,923 protein-coding gene annotations. Bioinformatic tools, such as BLAST and JBrowse, are integrated into the database to provide a user-friendly platform for homologous sequence searching, visualization of genomes, and screen for candidate gene information. Moreover, OysterDB will be continuously updated with ever-growing oyster genomic resources and facilitate future studies for comparative and functional genomic analysis of oysters (http://oysterdb.com.cn/).

Journal Article

Share this book

Add to My Shelf

circVAR database: genome-wide archive of genetic variants for human circular RNAs

by Zhao, Min , Qu, Hong in Animal Genetics and Genomics , Annotations , Binding

2020

Background Circular RNAs (circRNAs) play important roles in regulating gene expression through binding miRNAs and RNA binding proteins. Genetic variation of circRNAs may affect complex traits/diseases by changing their binding efficiency to target miRNAs and proteins. There is a growing demand for investigations of the functions of genetic changes using large-scale experimental evidence. However, there is no online genetic resource for circRNA genes. Results We performed extensive genetic annotation of 295,526 circRNAs integrated from circBase, circNet and circRNAdb. All pre-computed genetic variants were presented at our online resource, circVAR, with data browsing and search functionality. We explored the chromosome-based distribution of circRNAs and their associated variants. We found that, based on mapping to the 1000 Genomes and ClinVAR databases, chromosome 17 has a relatively large number of circRNAs and associated common and health-related genetic variants. Following the annotation of genome wide association studies (GWAS)-based circRNA variants, we found many non-coding variants within circRNAs, suggesting novel mechanisms for common diseases reported from GWAS studies. For cancer-based somatic variants, we found that chromosome 7 has many highly complex mutations that have been overlooked in previous research. Conclusion We used the circVAR database to collect SNPs and small insertions and deletions (INDELs) in putative circRNA regions and to identify their potential phenotypic information. To provide a reusable resource for the circRNA research community, we have published all the pre-computed genetic data concerning circRNAs and associated genes together with data query and browsing functions at http://soft.bioinfo-minzhao.org/circvar .

Journal Article

Share this book

Add to My Shelf

On finding bicliques in bipartite graphs: a novel algorithm and its application to the integration of diverse biological data types

by Phillips, Charles A , Baker, Erich J , Langston, Michael A in Algorithms , Analysis , Animals

2014

Background Integrating and analyzing heterogeneous genome-scale data is a huge algorithmic challenge for modern systems biology. Bipartite graphs can be useful for representing relationships across pairs of disparate data types, with the interpretation of these relationships accomplished through an enumeration of maximal bicliques. Most previously-known techniques are generally ill-suited to this foundational task, because they are relatively inefficient and without effective scaling. In this paper, a powerful new algorithm is described that produces all maximal bicliques in a bipartite graph. Unlike most previous approaches, the new method neither places undue restrictions on its input nor inflates the problem size. Efficiency is achieved through an innovative exploitation of bipartite graph structure, and through computational reductions that rapidly eliminate non-maximal candidates from the search space. An iterative selection of vertices for consideration based on non-decreasing common neighborhood sizes boosts efficiency and leads to more balanced recursion trees. Results The new technique is implemented and compared to previously published approaches from graph theory and data mining. Formal time and space bounds are derived. Experiments are performed on both random graphs and graphs constructed from functional genomics data. It is shown that the new method substantially outperforms the best previous alternatives. Conclusions The new method is streamlined, efficient, and particularly well-suited to the study of huge and diverse biological data. A robust implementation has been incorporated into GeneWeaver, an online tool for integrating and analyzing functional genomics experiments, available at http://geneweaver.org . The enormous increase in scalability it provides empowers users to study complex and previously unassailable gene-set associations between genes and their biological functions in a hierarchical fashion and on a genome-wide scale. This practical computational resource is adaptable to almost any applications environment in which bipartite graphs can be used to model relationships between pairs of heterogeneous entities.

Journal Article

Share this book

Add to My Shelf

Language Selector

MBRLGlobalSearch

Language Selector

Catalogue Search | MBRL

Search Results Heading

Explore the vast range of titles available.

MBRLSearchResults

MBRLHappinessMeter