Asset Details
MbrlCatalogueTitleDetail
Do you wish to reserve the book?
Hybrid natural language processing tool for semantic annotation of medical texts in Spanish
by
Capllonch-Carrión, Adrián
, Valverde-Mateos, Ana
, Campillos-Llanos, Leonardo
in
Algorithms
/ Analysis
/ Annotations
/ Automation
/ Bioinformatics
/ Biomedical and Life Sciences
/ Case reports
/ Clinical trials
/ Computational Biology/Bioinformatics
/ Computational linguistics
/ Computer Appl. in Life Sciences
/ Data analysis
/ Data mining
/ Data Mining - methods
/ Deep Learning
/ Deep learning in healthcare
/ Dictionaries
/ Disease
/ Electronic Health Records
/ Error analysis
/ Humans
/ Information processing
/ Information retrieval
/ Language
/ Language processing
/ Life Sciences
/ Machine learning
/ Medical literature
/ Medical natural language processing
/ Medical personnel
/ Medical text mining tool
/ Methods
/ Microarrays
/ Named entity recognition
/ Natural language interfaces
/ Natural Language Processing
/ Product reviews
/ Python
/ Semantics
/ Software
/ Spain
/ Spanish language
/ Spanish medical NLP
/ Task complexity
/ Texts
/ Unstructured data
2025
Hey, we have placed the reservation for you!
By the way, why not check out events that you can attend while you pick your title.
You are currently in the queue to collect this book. You will be notified once it is your turn to collect the book.
Oops! Something went wrong.
Looks like we were not able to place the reservation. Kindly try again later.
Are you sure you want to remove the book from the shelf?
Hybrid natural language processing tool for semantic annotation of medical texts in Spanish
by
Capllonch-Carrión, Adrián
, Valverde-Mateos, Ana
, Campillos-Llanos, Leonardo
in
Algorithms
/ Analysis
/ Annotations
/ Automation
/ Bioinformatics
/ Biomedical and Life Sciences
/ Case reports
/ Clinical trials
/ Computational Biology/Bioinformatics
/ Computational linguistics
/ Computer Appl. in Life Sciences
/ Data analysis
/ Data mining
/ Data Mining - methods
/ Deep Learning
/ Deep learning in healthcare
/ Dictionaries
/ Disease
/ Electronic Health Records
/ Error analysis
/ Humans
/ Information processing
/ Information retrieval
/ Language
/ Language processing
/ Life Sciences
/ Machine learning
/ Medical literature
/ Medical natural language processing
/ Medical personnel
/ Medical text mining tool
/ Methods
/ Microarrays
/ Named entity recognition
/ Natural language interfaces
/ Natural Language Processing
/ Product reviews
/ Python
/ Semantics
/ Software
/ Spain
/ Spanish language
/ Spanish medical NLP
/ Task complexity
/ Texts
/ Unstructured data
2025
Oops! Something went wrong.
While trying to remove the title from your shelf something went wrong :( Kindly try again later!
Do you wish to request the book?
Hybrid natural language processing tool for semantic annotation of medical texts in Spanish
by
Capllonch-Carrión, Adrián
, Valverde-Mateos, Ana
, Campillos-Llanos, Leonardo
in
Algorithms
/ Analysis
/ Annotations
/ Automation
/ Bioinformatics
/ Biomedical and Life Sciences
/ Case reports
/ Clinical trials
/ Computational Biology/Bioinformatics
/ Computational linguistics
/ Computer Appl. in Life Sciences
/ Data analysis
/ Data mining
/ Data Mining - methods
/ Deep Learning
/ Deep learning in healthcare
/ Dictionaries
/ Disease
/ Electronic Health Records
/ Error analysis
/ Humans
/ Information processing
/ Information retrieval
/ Language
/ Language processing
/ Life Sciences
/ Machine learning
/ Medical literature
/ Medical natural language processing
/ Medical personnel
/ Medical text mining tool
/ Methods
/ Microarrays
/ Named entity recognition
/ Natural language interfaces
/ Natural Language Processing
/ Product reviews
/ Python
/ Semantics
/ Software
/ Spain
/ Spanish language
/ Spanish medical NLP
/ Task complexity
/ Texts
/ Unstructured data
2025
Please be aware that the book you have requested cannot be checked out. If you would like to checkout this book, you can reserve another copy
We have requested the book for you!
Your request is successful and it will be processed during the Library working hours. Please check the status of your request in My Requests.
Oops! Something went wrong.
Looks like we were not able to place your request. Kindly try again later.
Hybrid natural language processing tool for semantic annotation of medical texts in Spanish
Journal Article
Hybrid natural language processing tool for semantic annotation of medical texts in Spanish
2025
Request Book From Autostore
and Choose the Collection Method
Overview
Background
Natural language processing (NLP) enables the extraction of information embedded within unstructured texts, such as clinical case reports and trial eligibility criteria. By identifying relevant medical concepts, NLP facilitates the generation of structured and actionable data, supporting complex tasks like cohort identification and the analysis of clinical records. To accomplish those tasks, we introduce a deep learning-based and lexicon-based named entity recognition (NER) tool for texts in Spanish. It performs medical NER and normalization, medication information extraction and detection of temporal entities, negation and speculation, and temporality or experiencer attributes (Age, Contraindicated, Negated, Speculated, Hypothetical, Future, Family_member, Patient and Other). We built the tool with a dedicated lexicon and rules adapted from NegEx and HeidelTime. Using these resources, we annotated a corpus of 1200 texts, with high inter-annotator agreement (average F1 = 0.841% ± 0.045 for entities, and average F1 = 0.881% ± 0.032 for attributes). We used this corpus to train Transformer-based models (RoBERTa-based models, mBERT and mDeBERTa). We integrated them with the dictionary-based system in a hybrid tool, and distribute the models via the Hugging Face hub. For an internal validation, we used a held-out test set and conducted an error analysis. For an external validation, eight medical professionals evaluated the system by revising the annotation of 200 new texts not used in development.
Results
In the internal validation, the models yielded F1 values up to 0.915. In the external validation with 100 clinical trials, the tool achieved an average F1 score of 0.858 (± 0.032); and in 100 anonymized clinical cases, it achieved an average F1 score of 0.910 (± 0.019).
Conclusions
The tool is available at
https://claramed.csic.es/medspaner
. We also release the code (
https://github.com/lcampillos/medspaner
) and the annotated corpus to train the models.
Publisher
BioMed Central,BioMed Central Ltd,Springer Nature B.V,BMC
This website uses cookies to ensure you get the best experience on our website.