Asset Details
MbrlCatalogueTitleDetail
Do you wish to reserve the book?
Predicting sample size required for classification performance
by
Ngo, Long H
, Kandula, Sasikiran
, Zeng-Treitler, Qing
, Figueroa, Rosa L
in
Algorithms
/ Classification
/ Confidence intervals
/ Data Interpretation, Statistical
/ Data processing
/ Decision making
/ Diagnosis, Computer-Assisted
/ Experiments
/ Health Informatics
/ Humans
/ Informatics
/ Information Systems and Communication Service
/ Learning
/ Learning algorithms
/ Learning Curve
/ Machine learning
/ Management
/ Management of Computing and Information Systems
/ Medical records
/ Medical research
/ Medicine
/ Medicine & Public Health
/ Methods
/ Models, Statistical
/ Nonlinear Dynamics
/ Pattern Recognition, Automated
/ Predictive Value of Tests
/ Probability Learning
/ Problem-Based Learning - methods
/ Reproducibility of Results
/ Research Article
/ Sample Size
/ Sampling
/ Stochastic Processes
/ Studies
2012
Hey, we have placed the reservation for you!
By the way, why not check out events that you can attend while you pick your title.
You are currently in the queue to collect this book. You will be notified once it is your turn to collect the book.
Oops! Something went wrong.
Looks like we were not able to place the reservation. Kindly try again later.
Are you sure you want to remove the book from the shelf?
Predicting sample size required for classification performance
by
Ngo, Long H
, Kandula, Sasikiran
, Zeng-Treitler, Qing
, Figueroa, Rosa L
in
Algorithms
/ Classification
/ Confidence intervals
/ Data Interpretation, Statistical
/ Data processing
/ Decision making
/ Diagnosis, Computer-Assisted
/ Experiments
/ Health Informatics
/ Humans
/ Informatics
/ Information Systems and Communication Service
/ Learning
/ Learning algorithms
/ Learning Curve
/ Machine learning
/ Management
/ Management of Computing and Information Systems
/ Medical records
/ Medical research
/ Medicine
/ Medicine & Public Health
/ Methods
/ Models, Statistical
/ Nonlinear Dynamics
/ Pattern Recognition, Automated
/ Predictive Value of Tests
/ Probability Learning
/ Problem-Based Learning - methods
/ Reproducibility of Results
/ Research Article
/ Sample Size
/ Sampling
/ Stochastic Processes
/ Studies
2012
Oops! Something went wrong.
While trying to remove the title from your shelf something went wrong :( Kindly try again later!
Do you wish to request the book?
Predicting sample size required for classification performance
by
Ngo, Long H
, Kandula, Sasikiran
, Zeng-Treitler, Qing
, Figueroa, Rosa L
in
Algorithms
/ Classification
/ Confidence intervals
/ Data Interpretation, Statistical
/ Data processing
/ Decision making
/ Diagnosis, Computer-Assisted
/ Experiments
/ Health Informatics
/ Humans
/ Informatics
/ Information Systems and Communication Service
/ Learning
/ Learning algorithms
/ Learning Curve
/ Machine learning
/ Management
/ Management of Computing and Information Systems
/ Medical records
/ Medical research
/ Medicine
/ Medicine & Public Health
/ Methods
/ Models, Statistical
/ Nonlinear Dynamics
/ Pattern Recognition, Automated
/ Predictive Value of Tests
/ Probability Learning
/ Problem-Based Learning - methods
/ Reproducibility of Results
/ Research Article
/ Sample Size
/ Sampling
/ Stochastic Processes
/ Studies
2012
Please be aware that the book you have requested cannot be checked out. If you would like to checkout this book, you can reserve another copy
We have requested the book for you!
Your request is successful and it will be processed during the Library working hours. Please check the status of your request in My Requests.
Oops! Something went wrong.
Looks like we were not able to place your request. Kindly try again later.
Predicting sample size required for classification performance
Journal Article
Predicting sample size required for classification performance
2012
Request Book From Autostore
and Choose the Collection Method
Overview
Background
Supervised learning methods need annotated data in order to generate efficient models. Annotated data, however, is a relatively scarce resource and can be expensive to obtain. For both passive and active learning methods, there is a need to estimate the size of the annotated sample required to reach a performance target.
Methods
We designed and implemented a method that fits an inverse power law model to points of a given learning curve created using a small annotated training set. Fitting is carried out using nonlinear weighted least squares optimization. The fitted model is then used to predict the classifier's performance and confidence interval for larger sample sizes. For evaluation, the nonlinear weighted curve fitting method was applied to a set of learning curves generated using clinical text and waveform classification tasks with active and passive sampling methods, and predictions were validated using standard goodness of fit measures. As control we used an un-weighted fitting method.
Results
A total of 568 models were fitted and the model predictions were compared with the observed performances. Depending on the data set and sampling method, it took between 80 to 560 annotated samples to achieve mean average and root mean squared error below 0.01. Results also show that our weighted fitting method outperformed the baseline un-weighted method (p < 0.05).
Conclusions
This paper describes a simple and effective sample size prediction algorithm that conducts weighted fitting of learning curves. The algorithm outperformed an un-weighted algorithm described in previous literature. It can help researchers determine annotation sample size for supervised machine learning.
Publisher
BioMed Central,BioMed Central Ltd,Springer Nature B.V,BMC
Subject
MBRLCatalogueRelatedBooks
Related Items
Related Items
We currently cannot retrieve any items related to this title. Kindly check back at a later time.
This website uses cookies to ensure you get the best experience on our website.