Catalogue Search | MBRL
Search Results Heading
Explore the vast range of titles available.
MBRLSearchResults
-
DisciplineDiscipline
-
Is Peer ReviewedIs Peer Reviewed
-
Item TypeItem Type
-
SubjectSubject
-
YearFrom:-To:
-
More FiltersMore FiltersSourceLanguage
Done
Filters
Reset
10
result(s) for
"Devaurs, Didier"
Sort by:
3pHLA-score improves structure-based peptide-HLA binding affinity prediction
by
Antunes, Dinler Amaral
,
Rigo, Mauricio Menegatti
,
Conev, Anja
in
631/114
,
631/114/1305
,
631/114/2397
2022
Binding of peptides to Human Leukocyte Antigen (HLA) receptors is a prerequisite for triggering immune response. Estimating peptide-HLA (pHLA) binding is crucial for peptide vaccine target identification and epitope discovery pipelines. Computational methods for binding affinity prediction can accelerate these pipelines. Currently, most of those computational methods rely exclusively on sequence-based data, which leads to inherent limitations. Recent studies have shown that structure-based data can address some of these limitations. In this work we propose a novel machine learning (ML) structure-based protocol to predict binding affinity of peptides to HLA receptors. For that, we engineer the input features for ML models by decoupling energy contributions at different residue positions in peptides, which leads to our novel per-peptide-position protocol. Using Rosetta’s ref2015 scoring function as a baseline we use this protocol to develop 3pHLA-score. Our per-peptide-position protocol outperforms the standard training protocol and leads to an increase from 0.82 to 0.99 of the area under the precision-recall curve. 3pHLA-score outperforms widely used scoring functions (AutoDock4, Vina, Dope, Vinardo, FoldX, GradDock) in a structural virtual screening task. Overall, this work brings structure-based methods one step closer to epitope discovery pipelines and could help advance the development of cancer and viral vaccines.
Journal Article
Using parallelized incremental meta-docking can solve the conformational sampling issue when docking large ligands to proteins
by
Hall-Swan, Sarah
,
Kavraki, Lydia E
,
Devaurs, Didier
in
Algorithms
,
Binding sites
,
Biological Microscopy
2019
Background
Docking large ligands, and especially peptides, to protein receptors is still considered a challenge in computational structural biology. Besides the issue of accurately scoring the binding modes of a protein-ligand complex produced by a molecular docking tool, the conformational sampling of a large ligand is also often considered a challenge because of its underlying combinatorial complexity. In this study, we evaluate the impact of using parallelized and incremental paradigms on the accuracy and performance of conformational sampling when docking large ligands. We use five datasets of protein-ligand complexes involving ligands that could not be accurately docked by classical protein-ligand docking tools in previous similar studies.
Results
Our computational evaluation shows that simply increasing the amount of conformational sampling performed by a protein-ligand docking tool, such as Vina, by running it for longer is rarely beneficial. Instead, it is more efficient and advantageous to run several short instances of this docking tool in parallel and group their results together, in a straightforward parallelized docking protocol. Even greater accuracy and efficiency are achieved by our parallelized incremental meta-docking tool, DINC, showing the additional benefits of its incremental paradigm. Using DINC, we could accurately reproduce the vast majority of the protein-ligand complexes we considered.
Conclusions
Our study suggests that, even when trying to dock large ligands to proteins, the conformational sampling of the ligand should no longer be considered an issue, as simple docking protocols using existing tools can solve it. Therefore, scoring should currently be regarded as the biggest unmet challenge in molecular docking.
Journal Article
General Prediction of Peptide-MHC Binding Modes Using Incremental Docking: A Proof of Concept
2018
The class I major histocompatibility complex (MHC) is capable of binding peptides derived from intracellular proteins and displaying them at the cell surface. The recognition of these peptide-MHC (pMHC) complexes by T-cells is the cornerstone of cellular immunity, enabling the elimination of infected or tumoral cells. T-cell-based immunotherapies against cancer, which leverage this mechanism, can greatly benefit from structural analyses of pMHC complexes. Several attempts have been made to use molecular docking for such analyses, but pMHC structure remains too challenging for even state-of-the-art docking tools. To overcome these limitations, we describe the use of an incremental meta-docking approach for structural prediction of pMHC complexes. Previous methods applied in this context used specific constraints to reduce the complexity of this prediction problem, at the expense of generality. Our strategy makes no assumption and can potentially be used to predict binding modes for any pMHC complex. Our method has been tested in a re-docking experiment, reproducing the binding modes of 25 pMHC complexes whose crystal structures are available. This study is a proof of concept that incremental docking strategies can lead to general geometry prediction of pMHC complexes, with potential applications for immunotherapy against cancer or infectious diseases.
Journal Article
1084 Large-scale modeling of pHLA complexes for structure-based immunogenicity prediction
by
Antunes Dinler
,
Choi Sae, Hee E
,
Freitas Martiela
in
Artificial intelligence
,
Cancer
,
Datasets
2025
BackgroundCancer immunotherapies have become an effective tool by manipulating cellular immune response to fight cancer. A key step of cellular immunity involves peptides binding to the Human Leukocyte Antigen (HLA) molecules and forming a stable peptide-HLA (pHLA) complex. pHLAs are transported to the cell surface where they can be ‘inspected’ by T-cells, through highly specific T-cell receptors (TCRs). Understanding such interactions is central to designing peptide-based vaccines and T-cell-based immunotherapies. Key limitations to widespread use of cancer immunotherapies includes lack of immunogenicity of tumor-associated peptide-antigens, and risk of off-target causing immune-related adverse events. Different approaches are being explored to address and minimize this issue. Therefore, accurate prediction of pHLA immunogenicity is a critical area for advancing the design of cancer immunotherapies. However, current immunogenicity prediction tools are limited to identifying peptide motifs, neglecting critical structural features and TCR-specific recognition. We developed a structure-based machine learning tool that utilizes models of HLA-peptide-TCR complexes to extract features predictive of immunogenicity.MethodsA labelled dataset involving over 15 million pHLA sequences with experimentally-determined immunogenicity results was selected from a previously developed sequence-based immunogenicity prediction tool called BigMHC.1 APE-Gen 2.02 and Boltz-23 were used to generate structural models for each pHLA, to be utilized for the extraction of structural features. APE-Gen 2.0 was selected for its tailored, scalable pHLA modeling and docking-based scoring of conformational ensembles.2 Boltz-2, a novel AI-based tool, for outperforming Alphafold2 by incorporating improved biophysical refinement.3 In addition, it also predicts complex binding affinity using a new AI-based approach.ResultsWe modeled a pilot dataset of 77 complexes with binding affinity and immunogenicity labels using Ape-Gen 2.0 and Boltz2. Top scored conformation from Ape-Gen 2.0 ensembles are being analyzed to determine which structural features contribute to immunogenicity. Different properties and featurization approaches will be explored. We are also evaluating accuracy of affinity predictions on Boltz-2, in comparison to Ape-Gen 2.0 and Rosetta. Large-scale modeling of the entire dataset of 15 million complexes is ongoing.ConclusionsBy integrating structural features of the entire pHLA complex we aim to overcome limitations of current sequence-based approaches and enable more accurate screening of therapeutic peptides, improving the safety of next-generation immunotherapies. Once modeled with both independent approaches (~30 million pHLA complexes), our dataset will be the largest available for machine-learning training. We will then explore how to further improve immunogenicity prediction by leveraging different structural sources, featurization methods and AI models.ReferencesAlbert BA, et al. Deep neural networks predict class I major histocompatibility complex epitope presentation and transfer learn neoepitope immunogenicity. Nat. Mach. Intell. 2023;5:861–872.Fasoulis R, Rigo MM, Lizée G, Antunes DA, Kavraki LE. APE-Gen2.0: expanding rapid class I peptide-major histocompatibility complex modeling to post-translational modifications and noncanonical peptide geometries. J. Chem. Inf. Model. 2024;64:1730–1750.Passaro S, et al. Boltz-2: towards accurate and efficient binding affinity prediction. 2025.06.14.659707 Preprint at https://doi.org/10.1101/2025.06.14.659707 (2025).
Journal Article
Computational analysis of complement inhibitor compstatin using molecular dynamics
by
Devaurs, Didier
,
Antunes, Dinler A.
,
Kavraki, Lydia E.
in
Analogs
,
Antiviral Agents - chemistry
,
Antiviral Agents - metabolism
2020
The complement system plays a major role in human immunity, but its abnormal activation can have severe pathological impacts. By mimicking a natural mechanism of complement regulation, the small peptide
compstatin
has proven to be a very promising complement inhibitor. Over the years, several compstatin analogs have been created, with improved inhibitory potency. A recent analog is being developed as a candidate drug against several pathological conditions, including COVID-19. However, the reasons behind its higher potency and increased binding affinity to complement proteins are not fully clear. This computational study highlights the mechanistic properties of several compstatin analogs, thus complementing previous experimental studies. We perform molecular dynamics simulations involving six analogs alone in solution and two complexes with compstatin bound to complement component 3. These simulations reveal that all the analogs we consider, except the original compstatin, naturally adopt a pre-bound conformation in solution. Interestingly, this set of analogs adopting a pre-bound conformation includes analogs that were not known to benefit from this behavior. We also show that the most recent compstatin analog (among those we consider) forms a stronger hydrogen bond network with its complement receptor than an earlier analog.
Journal Article
DINC-COVID: A webserver for ensemble docking with flexible SARS-CoV-2 proteins
by
Rigo, Mauricio M.
,
Hall-Swan, Sarah
,
Zanatta, Geancarlo
in
Algorithms
,
Binding sites
,
Computer applications
2021
An unprecedented research effort has been undertaken in response to the ongoing COVID-19 pandemic. This has included the determination of hundreds of crystallographic structures of SARS-CoV-2 proteins, and numerous virtual screening projects searching large compound libraries for potential drug inhibitors. Unfortunately, these initiatives have had very limited success in producing effective inhibitors against SARS-CoV-2 proteins. A reason might be an often overlooked factor in these computational efforts: receptor flexibility. To address this issue we have implemented a computational tool for ensemble docking with SARS-CoV-2 proteins. We have extracted representative ensembles of protein conformations from the Protein Data Bank and from in silico molecular dynamics simulations. Twelve pre-computed ensembles of SARS-CoV-2 protein conformations have now been made available for ensemble docking via a user-friendly webserver called DINC-COVID (dinc-covid.kavrakilab.org). We have validated DINC-COVID using data on tested inhibitors of two SARS-CoV-2 proteins, obtaining good correlations between docking-derived binding energies and experimentally-determined binding affinities. Some of the best results have been obtained on a dataset of large ligands resolved via room temperature crystallography, and therefore capturing alternative receptor conformations. In addition, we have shown that the ensembles available in DINC-COVID capture different ranges of receptor flexibility, and that this diversity is useful in finding alternative binding modes of ligands. Overall, our work highlights the importance of accounting for receptor flexibility in docking studies, and provides a platform for the identification of new inhibitors against SARS-CoV-2 proteins.
[Display omitted]
•DINC-COVID is a user-friendly webserver for ensemble docking to SARS-CoV-2 proteins.•DINC-COVID involves several receptor ensembles and different ranges of flexibility.•DINC-COVID predicts binding modes and energies in agreement with experimental data.
Journal Article
Revealing Unknown Protein Structures Using Computational Conformational Sampling Guided by Experimental Hydrogen-Exchange Data
by
Devaurs, Didier
,
Antunes, Dinler A.
,
Kavraki, Lydia E.
in
Algorithms
,
Crystallography
,
Cytokines
2018
Both experimental and computational methods are available to gather information about a protein’s conformational space and interpret changes in protein structure. However, experimentally observing and computationally modeling large proteins remain critical challenges for structural biology. Our work aims at addressing these challenges by combining computational and experimental techniques relying on each other to overcome their respective limitations. Indeed, despite its advantages, an experimental technique such as hydrogen-exchange monitoring cannot produce structural models because of its low resolution. Additionally, the computational methods that can generate such models suffer from the curse of dimensionality when applied to large proteins. Adopting a common solution to this issue, we have recently proposed a framework in which our computational method for protein conformational sampling is biased by experimental hydrogen-exchange data. In this paper, we present our latest application of this computational framework: generating an atomic-resolution structural model for an unknown protein state. For that, starting from an available protein structure, we explore the conformational space of this protein, using hydrogen-exchange data on this unknown state as a guide. We have successfully used our computational framework to generate models for three proteins of increasing size, the biggest one undergoing large-scale conformational changes.
Journal Article
Using parallelized incremental meta-docking can solve the conformational sampling issue when docking large ligands to proteins
by
Hall-Swan, Sarah
,
Kavraki, Lydia E
,
Devaurs, Didier
in
Computational biology
,
Observations
,
Peptides
2019
Docking large ligands, and especially peptides, to protein receptors is still considered a challenge in computational structural biology. Besides the issue of accurately scoring the binding modes of a protein-ligand complex produced by a molecular docking tool, the conformational sampling of a large ligand is also often considered a challenge because of its underlying combinatorial complexity. In this study, we evaluate the impact of using parallelized and incremental paradigms on the accuracy and performance of conformational sampling when docking large ligands. We use five datasets of protein-ligand complexes involving ligands that could not be accurately docked by classical protein-ligand docking tools in previous similar studies. Our computational evaluation shows that simply increasing the amount of conformational sampling performed by a protein-ligand docking tool, such as Vina, by running it for longer is rarely beneficial. Instead, it is more efficient and advantageous to run several short instances of this docking tool in parallel and group their results together, in a straightforward parallelized docking protocol. Even greater accuracy and efficiency are achieved by our parallelized incremental meta-docking tool, DINC, showing the additional benefits of its incremental paradigm. Using DINC, we could accurately reproduce the vast majority of the protein-ligand complexes we considered. Our study suggests that, even when trying to dock large ligands to proteins, the conformational sampling of the ligand should no longer be considered an issue, as simple docking protocols using existing tools can solve it. Therefore, scoring should currently be regarded as the biggest unmet challenge in molecular docking.
Journal Article
EnGens: a computational framework for generation and analysis of representative protein conformational ensembles
Proteins are dynamic macromolecules that perform vital functions in cells. A protein structure determines its function, but this structure is not static, as proteins change their conformation to achieve various functions. Understanding the conformational landscapes of proteins is essential to understand their mechanism of action. Sets of carefully chosen conformations can summarize such complex landscapes and provide better insights into protein function than single conformations. We refer to these sets as representative conformational ensembles. Recent advances in computational methods have led to an increase in number of available structural datasets spanning conformational landscapes. However, extracting representative conformational ensembles from such datasets is not an easy task and many methods have been developed to tackle it. Our new approach, EnGens (short for ensemble generation), collects these methods into a unified framework for generating and analyzing protein conformational ensembles. In this work we: (1) provide an overview of existing methods and tools for protein structural ensemble generation and analysis; (2) unify existing approaches in an open-source Python package, and a portable Docker image, providing interactive visualizations within a Jupyter Notebook pipeline; (3) test our pipeline on a few canonical examples found in the literature. Representative ensembles produced by EnGens can be used for many downstream tasks such as protein-ligand ensemble docking, Markov state modeling of protein dynamics and analysis of the effect of single-point mutations.
Journal Article
DINC-COVID: A webserver for ensemble docking with flexible SARS-CoV-2 proteins
by
Rigo, Mauricio M
,
Antunes, Dinler A
,
Hall-Swan, Sarah
in
Bioinformatics
,
Computer applications
,
DNA-directed RNA polymerase
2021
Abstract Motivation Recent efforts to computationally identify inhibitors for SARS-CoV-2 proteins have largely ignored the issue of receptor flexibility. We have implemented a computational tool for ensemble docking with the SARS-CoV-2 proteins, including the main protease (Mpro), papain-like protease (PLpro) and RNA-dependent RNA polymerase (RdRp). Results Ensembles of other SARS-CoV-2 proteins are being prepared and made available through a user-friendly docking interface. Plausible binding modes between conformations of a selected ensemble and an uploaded ligand are generated by DINC, our parallelized meta-docking tool. Binding modes are scored with three scoring functions, and account for the flexibility of both the ligand and receptor. Additional details on our methods are provided in the supplementary material. Availability dinc-covid.kavrakilab.org Supplementary information Details on methods for ensemble generation and docking are provided as supplementary data online. Contact geancarlo.zanatta{at}ufc.br, kavraki{at}rice.edu Competing Interest Statement The authors have declared no competing interest.