Catalogue Search | MBRL
Search Results Heading
Explore the vast range of titles available.
MBRLSearchResults
-
DisciplineDiscipline
-
Is Peer ReviewedIs Peer Reviewed
-
Item TypeItem Type
-
SubjectSubject
-
YearFrom:-To:
-
More FiltersMore FiltersSourceLanguage
Done
Filters
Reset
17
result(s) for
"Wilson, Laurence O. W."
Sort by:
Unsupervised machine learning framework for discriminating major variants of concern during COVID-19
by
Bansal, Chaarvi
,
Blau, Tom
,
Vasan, Seshadri
in
Acute respiratory distress syndrome
,
Algorithms
,
Analysis
2023
Due to the high mutation rate of the virus, the COVID-19 pandemic evolved rapidly. Certain variants of the virus, such as Delta and Omicron emerged with altered viral properties leading to severe transmission and death rates. These variants burdened the medical systems worldwide with a major impact to travel, productivity, and the world economy. Unsupervised machine learning methods have the ability to compress, characterize, and visualize unlabelled data. This paper presents a framework that utilizes unsupervised machine learning methods to discriminate and visualize the associations between major COVID-19 variants based on their genome sequences. These methods comprise a combination of selected dimensionality reduction and clustering techniques. The framework processes the RNA sequences by performing a k -mer analysis on the data and further visualises and compares the results using selected dimensionality reduction methods that include principal component analysis (PCA), t-distributed stochastic neighbour embedding (t-SNE), and uniform manifold approximation projection (UMAP). Our framework also employs agglomerative hierarchical clustering to visualize the mutational differences among major variants of concern and country-wise mutational differences for selected variants (Delta and Omicron) using dendrograms. We also provide country-wise mutational differences for selected variants via dendrograms. We find that the proposed framework can effectively distinguish between the major variants and has the potential to identify emerging variants in the future.
Journal Article
Evolutionary Insights from Association Rule Mining of Co-Occurring Mutations in Influenza Hemagglutinin and Neuraminidase
by
Monaghan, Michael T.
,
Bauer, Denis C.
,
Wilson, Laurence O. W.
in
Antigenic drift
,
Antigenic Drift and Shift - genetics
,
Antigens, Viral - genetics
2024
Seasonal influenza viruses continuously evolve via antigenic drift. This leads to recurring epidemics, globally significant mortality rates, and the need for annually updated vaccines. Co-occurring mutations in hemagglutinin (HA) and neuraminidase (NA) are suggested to have synergistic interactions where mutations can increase the chances of immune escape and viral fitness. Association rule mining was used to identify temporal relationships of co-occurring HA–NA mutations of influenza virus A/H3N2 and its role in antigenic evolution. A total of 64 clusters were found. These included well-known mutations responsible for antigenic drift, as well as previously undiscovered groups. A majority (41/64) were associated with known antigenic sites, and 38/64 involved mutations across both HA and NA. The emergence and disappearance of N-glycosylation sites in the pattern of N-X-[S/T] were also identified, which are crucial post-translational processes to maintain protein stability and functional balance (e.g., emergence of NA:339ASP and disappearance of HA:187ASP). Our study offers an alternative approach to the existing mutual-information and phylogenetic methods used to identify co-occurring mutations, enabling faster processing of large amounts of data. Our approach can facilitate the prediction of critical mutations given their occurrence in a previous season, facilitating vaccine development for the next flu season and leading to better preparation for future pandemics.
Journal Article
VARSCOT: variant-aware detection and scoring enables sensitive and personalized off-target detection for CRISPR-Cas9
by
Reinert, Knut
,
Pockrandt, Christopher
,
Bauer, Denis C.
in
adverse effects
,
Applied Microbiology
,
artificial intelligence
2019
Background
Natural variations in a genome can drastically alter the CRISPR-Cas9 off-target landscape by creating or removing sites. Despite the resulting potential side-effects from such unaccounted for sites, current off-target detection pipelines are not equipped to include variant information. To address this, we developed VARiant-aware detection and SCoring of Off-Targets (VARSCOT).
Results
VARSCOT identifies only 0.6% of off-targets to be common between 4 individual genomes and the reference, with an average of 82% of off-targets unique to an individual. VARSCOT is the most sensitive detection method for off-targets, finding 40 to 70% more experimentally verified off-targets compared to other popular software tools and its machine learning model allows for CRISPR-Cas9 concentration aware off-target activity scoring.
Conclusions
VARSCOT allows researchers to take genomic variation into account when designing individual or population-wide targeting strategies. VARSCOT is available from
https://github.com/BauerLab/VARSCOT
.
Journal Article
The Current State and Future of CRISPR-Cas9 gRNA Design Tools
by
Bauer, Denis C.
,
Wilson, Laurence O. W.
,
O’Brien, Aidan R.
in
activity prediction
,
bioinformatics
,
chromatin
2018
Recent years have seen the development of computational tools to assist researchers in performing CRISPR-Cas9 experiment optimally. More specifically, these tools aim to maximize on-target activity (guide efficiency) while also minimizing potential off-target effects (guide specificity) by analyzing the features of the target site. Nonetheless, currently available tools cannot robustly predict experimental success as prediction accuracy depends on the approximations of the underlying model and how closely the experimental setup matches the data the model was trained on. Here, we present an overview of the available computational tools, their current limitations and future considerations. We discuss new trends around personalized health by taking genomic variants into account when predicting target sites as well as discussing other governing factors that can improve prediction accuracy.
Journal Article
Unlocking HDR-mediated nucleotide editing by identifying high-efficiency target sites using machine learning
2019
Editing individual nucleotides is a crucial component for validating genomic disease association. It is currently hampered by CRISPR-Cas-mediated “base editing” being limited to certain nucleotide changes, and only achievable within a small window around CRISPR-Cas target sites. The more versatile alternative, HDR (homology directed repair), has a 3-fold lower efficiency with known optimization factors being largely immutable in experiments. Here, we investigated the variable efficiency-governing factors on a novel mouse dataset using machine learning. We found the sequence composition of the single-stranded oligodeoxynucleotide (ssODN), i.e. the repair template, to be a governing factor. Furthermore, different regions of the ssODN have variable influence, which reflects the underlying mechanism of the repair process. Our model improves HDR efficiency by 83% compared to traditionally chosen targets. Using our findings, we developed CUNE (Computational Universal Nucleotide Editor), which enables users to identify and design the optimal targeting strategy using traditional base editing or – for-the-first-time – HDR-mediated nucleotide changes.
Journal Article
Scalable genomic data exchange and analytics with sBeacon
by
Grimes, John
,
Lin, Victor San Kho
,
Lawley, Michael
in
631/114/2401
,
631/114/2785
,
692/308/2056
2023
Journal Article
Democratising high performance computing for bioinformatics through serverless cloud computing: A case study on CRISPR-Cas9 guide RNA design with Crackling Cloud
by
Winsen, Mattias
,
Joy, Divya
,
Wilkins, Mackenzie
in
Cloud Computing
,
Computational Biology - methods
,
CRISPR-Cas Systems - genetics
2025
Organisations are challenged when meeting the computational requirements of large-scale bioinformatics analyses using their own resources. Cloud computing has democratised large-scale resources, and to reduce the barriers of working with large-scale compute, leading cloud vendors offer serverless computing, a low-maintenance and low-cost model that provides ample resources for highly scalable software applications. While serverless computing has broad use, its adoption in bioinformatics remains poor. Here, we demonstrate the most extensive use of high-performance serverless computing for bioinformatics by applying the available technologies to CRISPR-Cas9 guide RNA (gRNA) design. Our adaptation of the established gRNA design tool, named Crackling, implements a novel, cloud-native and serverless-based, high-performance computing environment using technologies made available by Amazon Web Services (AWS). The architecture, compatible with technologies from all leading cloud vendors, and the AWS implementation, contributes to an effort of reducing the barrier to large computational capacity in bioinformatics and for CRISPR-Cas9 gRNA design. Crackling Cloud can be deployed to any AWS account, and is freely available on GitHub under the BSD 3-clause license: https://github.com/bmds-lab/Crackling-AWS.
Journal Article
Synsor: a tool for alignment-free detection of engineered DNA sequences
by
Tay, Aidan P.
,
Didi, Kieran
,
Wickramarachchi, Anuradha
in
alignment-free
,
Biological & chemical terrorism
,
biosecurity
2024
DNA sequences of nearly any desired composition, length, and function can be synthesized to alter the biology of an organism for purposes ranging from the bioproduction of therapeutic compounds to invasive pest control. Yet despite offering many great benefits, engineered DNA poses a risk due to their possible misuse or abuse by malicious actors, or their unintentional introduction into the environment. Monitoring the presence of engineered DNA in biological or environmental systems is therefore crucial for routine and timely detection of emerging biological threats, and for improving public acceptance of genetic technologies. To address this, we developed Synsor, a tool for identifying engineered DNA sequences in high-throughput sequencing data. Synsor leverages the k-mer signature differences between naturally occurring and engineered DNA sequences and uses an artificial neural network to classify whether a DNA sequence is natural or engineered. By querying suspected sequences against the model, Synsor can identify sequences that are likely to have been engineered. Using natural plasmid and engineered vector sequences, we showed that Synsor identifies engineered DNA with >99% accuracy. We demonstrate how Synsor can be used to detect potential genetically engineered organisms and locate where engineered DNA is being introduced into the environment by analysing genomic and metagenomic data from yeast and wastewater samples, respectively. Synsor is therefore a powerful tool that will streamline the process of identifying engineered DNA in poorly characterized biological or environmental systems, thereby allowing for enhanced monitoring of emerging biological threats.
Journal Article
The histone chaperone HJURP is a new independent prognostic marker for luminal A breast carcinoma
by
Martel, Elise
,
Asselain, Bernard
,
Montes de Oca, Rocío
in
Autoantigens - metabolism
,
Biomarkers
,
Biomarkers, Tumor - metabolism
2015
Breast cancer is a heterogeneous disease with different molecular subtypes that have varying responses to therapy. An ongoing challenge in breast cancer research is to distinguish high-risk patients from good prognosis patients. This is particularly difficult in the low-grade, ER-positive luminal A tumors, where robust diagnostic tools to aid clinical treatment decisions are lacking. Recent data implicating chromatin regulators in cancer initiation and progression offers a promising avenue to develop new tools to help guide clinical decisions.
Here we exploit a published transcriptome dataset and an independent validation cohort to correlate the mRNA expression of selected chromatin regulators with respect to the four intrinsic breast cancer molecular subtypes. We then perform univariate and multivariate analyses to compare the prognostic value of a panel of chromatin regulators to Ki67, a currently utilized proliferation marker.
Unsupervised hierarchical clustering revealed a gene cluster containing several histone chaperones and histone variants highly-expressed in the proliferative subtypes (basal-like, HER2-positive, luminal B) but not in the luminal A subtype. Several chromatin regulators, including the histone chaperones CAF-1 (subunits p150 and p60), ASF1b, and HJURP, and the centromeric histone variant CENP-A, associated with local and metastatic relapse and poor patient outcome. Importantly, we find that HJURP can discriminate favorable and unfavorable outcome within the luminal A subtype, outperforming the currently utilized proliferation marker Ki67, as an independent prognostic marker for luminal A patients.
The integration of chromatin regulators as clinical biomarkers, in particular the histone chaperone HJURP, will help guide patient substratification and treatment options for low-risk luminal A breast carcinoma patients.
•Specific chromatin regulators are overexpressed in aggressive breast tumors.•CAF-1, ASF1b, HJURP, MCM2, and EZH2 expression differentiates between ER+ subtypes.•HJURP outperforms MKI67 for prognostic value within the luminal A subtype.•HJURP is an independent marker of disease outcome in luminal A patients.
Journal Article
Unsupervised machine learning framework for discriminating major variants of concern during COVID-19
2023
Due to the high mutation rate of the virus, the COVID-19 pandemic evolved rapidly. Certain variants of the virus, such as Delta and Omicron, emerged with altered viral properties leading to severe transmission and death rates. These variants burdened the medical systems worldwide with a major impact to travel, productivity, and the world economy. Unsupervised machine learning methods have the ability to compress, characterize, and visualize unlabelled data. This paper presents a framework that utilizes unsupervised machine learning methods to discriminate and visualize the associations between major COVID-19 variants based on their genome sequences. These methods comprise a combination of selected dimensionality reduction and clustering techniques. The framework processes the RNA sequences by performing a k-mer analysis on the data and further visualises and compares the results using selected dimensionality reduction methods that include principal component analysis (PCA), t-distributed stochastic neighbour embedding (t-SNE), and uniform manifold approximation projection (UMAP). Our framework also employs agglomerative hierarchical clustering to visualize the mutational differences among major variants of concern and country-wise mutational differences for selected variants (Delta and Omicron) using dendrograms. We also provide country-wise mutational differences for selected variants via dendrograms. We find that the proposed framework can effectively distinguish between the major variants and has the potential to identify emerging variants in the future.