Asset Details
MbrlCatalogueTitleDetail
Do you wish to reserve the book?
A compressed large language model embedding dataset of ICD 10 CM descriptions
by
Esserman, Denise
, Greene, Erich J.
, Latham, Nancy K.
, King, Casey
, Kane, Michael J.
, Ganz, David A.
in
Algorithms
/ Analysis
/ Artificial intelligence
/ Autoencoder
/ Bioinformatics
/ Biomedical and Life Sciences
/ Computational Biology/Bioinformatics
/ Computational linguistics
/ Computer Appl. in Life Sciences
/ Datasets
/ EHR
/ Electronic health records
/ Embedding
/ ICD-10-CM
/ Language
/ Language processing
/ Large language model
/ Large language models
/ Life Sciences
/ Machine learning
/ Medical screening
/ Methods
/ Microarrays
/ Natural language interfaces
/ NLP
/ Quality standards
/ Reduction
/ Reimbursement
/ Representations
/ Semantics
2023
Hey, we have placed the reservation for you!
By the way, why not check out events that you can attend while you pick your title.
You are currently in the queue to collect this book. You will be notified once it is your turn to collect the book.
Oops! Something went wrong.
Looks like we were not able to place the reservation. Kindly try again later.
Are you sure you want to remove the book from the shelf?
A compressed large language model embedding dataset of ICD 10 CM descriptions
by
Esserman, Denise
, Greene, Erich J.
, Latham, Nancy K.
, King, Casey
, Kane, Michael J.
, Ganz, David A.
in
Algorithms
/ Analysis
/ Artificial intelligence
/ Autoencoder
/ Bioinformatics
/ Biomedical and Life Sciences
/ Computational Biology/Bioinformatics
/ Computational linguistics
/ Computer Appl. in Life Sciences
/ Datasets
/ EHR
/ Electronic health records
/ Embedding
/ ICD-10-CM
/ Language
/ Language processing
/ Large language model
/ Large language models
/ Life Sciences
/ Machine learning
/ Medical screening
/ Methods
/ Microarrays
/ Natural language interfaces
/ NLP
/ Quality standards
/ Reduction
/ Reimbursement
/ Representations
/ Semantics
2023
Oops! Something went wrong.
While trying to remove the title from your shelf something went wrong :( Kindly try again later!
Do you wish to request the book?
A compressed large language model embedding dataset of ICD 10 CM descriptions
by
Esserman, Denise
, Greene, Erich J.
, Latham, Nancy K.
, King, Casey
, Kane, Michael J.
, Ganz, David A.
in
Algorithms
/ Analysis
/ Artificial intelligence
/ Autoencoder
/ Bioinformatics
/ Biomedical and Life Sciences
/ Computational Biology/Bioinformatics
/ Computational linguistics
/ Computer Appl. in Life Sciences
/ Datasets
/ EHR
/ Electronic health records
/ Embedding
/ ICD-10-CM
/ Language
/ Language processing
/ Large language model
/ Large language models
/ Life Sciences
/ Machine learning
/ Medical screening
/ Methods
/ Microarrays
/ Natural language interfaces
/ NLP
/ Quality standards
/ Reduction
/ Reimbursement
/ Representations
/ Semantics
2023
Please be aware that the book you have requested cannot be checked out. If you would like to checkout this book, you can reserve another copy
We have requested the book for you!
Your request is successful and it will be processed during the Library working hours. Please check the status of your request in My Requests.
Oops! Something went wrong.
Looks like we were not able to place your request. Kindly try again later.
A compressed large language model embedding dataset of ICD 10 CM descriptions
Journal Article
A compressed large language model embedding dataset of ICD 10 CM descriptions
2023
Request Book From Autostore
and Choose the Collection Method
Overview
This paper presents novel datasets providing numerical representations of ICD-10-CM codes by generating description embeddings using a large language model followed by a dimension reduction via autoencoder. The embeddings serve as informative input features for machine learning models by capturing relationships among categories and preserving inherent context information. The model generating the data was validated in two ways. First, the dimension reduction was validated using an autoencoder, and secondly, a supervised model was created to estimate the ICD-10-CM hierarchical categories. Results show that the dimension of the data can be reduced to as few as 10 dimensions while maintaining the ability to reproduce the original embeddings, with the fidelity decreasing as the reduced-dimension representation decreases. Multiple compression levels are provided, allowing users to choose as per their requirements, download and use without any other setup. The readily available datasets of ICD-10-CM codes are anticipated to be highly valuable for researchers in biomedical informatics, enabling more advanced analyses in the field. This approach has the potential to significantly improve the utility of ICD-10-CM codes in the biomedical domain.
Publisher
BioMed Central,BioMed Central Ltd,Springer Nature B.V,BMC
Subject
This website uses cookies to ensure you get the best experience on our website.