Catalogue Search | MBRL

Imputation for Lipidomics and Metabolomics

by McDowell, Graeme S.V. , Surendra, Anuradha , Ou, Huiting in Machine learning , Methods

2025

Motivation: Missing values are prevalent in high-throughput measurements due to various experimental or analytical reasons. Imputation, the process of replacing missing values in a dataset with estimated values, plays an important role in multivariate and machine learning analyses. The three missingness patterns, including missing completely at random, missing at random, and missing not at random, describe unique dependencies between the missing and observed data. The optimal imputation method for each dataset depends on the type of data, the cause of the missingness, and the nature of relationships between the missing and observed data. The challenge is to identify the optimal imputation solution for a given dataset. Availability and implementation: ImpLiMet is freely available at https://complimet.ca/shiny/implimet/ and https://github.com/complimet/ImpLiMet.

Journal Article

Share this book

Add to My Shelf

Imp utation for Li pidomics and Met abolomics (ImpLiMet): a web-based application for optimization and method selection for missing data imputation

by Čuperlović-Culf, Miroslava , Surendra, Anuradha , Ou, Huiting

2024

Journal Article

Share this book

Add to My Shelf

Imputation for lipidomics and metabolomics (ImpLiMet): a web-based application for optimization and method selection for missing data imputation

by Čuperlović-Culf, Miroslava , Surendra, Anuradha , Ou, Huiting in Application Note , Machine learning , Methods

2025

NRC publication: Yes

Journal Article

Share this book

Add to My Shelf

METAbolomics data Balancing with Over-sampling Al-gorithms (META-BOA): an online resource for addressing class imbalance

by Anuradha Surendra , Lavallee-Adam, Mathieu , Cuperlovic-Culf, Miroslava in Algorithms , Classification , Computer applications

2022

Motivation: Class imbalance, or unequal sample sizes between classes, is an increasing concern in machine learn-ing for metabolomic and lipidomic data mining, which can result in overfitting for the over-represented class. Numerous methods have been developed for handling class imbalance, but they are not readily accessible to users with limited computational experience. Moreover, there is no resource that enables users to easily evaluate the effect of different over-sampling algorithms. Results: METAbolomics data Balancing with Over-sampling Algorithms (META-BOA) is a web-based application that enables users to select between four different methods for class balancing, followed by data visualization and classification of the sample to observe the augmentation effects. META-BOA outputs a newly balanced dataset, generating additional samples in the minority class, according to the users choice of Synthetic Minority Over-sampling Technique (SMOTE), Borderline-SMOTE (BSMOTE), Adaptive Synthetic (ADASYN), or Random Over-Sampling Examples (ROSE). META-BOA further displays both principal component analysis (PCA) and t-distributed stochastic neighbor embedding (t-SNE) visualization of data pre- and post-over-sampling. Random forest classification is utilized to compare sample classification in both the original and balanced datasets, enabling users to select the most appro-priate method for their analyses. Availability and implementation: META-BOA is available at https://complimet.ca/meta-boa. Competing Interest Statement The authors have declared no competing interest.

Paper

Share this book

Add to My Shelf

METAbolomics data Balancing with Over-sampling Algorithms (META-BOA): an online resource for addressing class imbalance

by Lavallée-Adam, Mathieu , Čuperlović-Culf, Miroslava , Surendra, Anuradha in Bioinformatics

2022

Class imbalance, or unequal sample sizes between classes, is an increasing concern in machine learning for metabolomic and lipidomic data mining, which can result in overfitting for the over-represented class. Numerous methods have been developed for handling class imbalance, but they are not readily accessible to users with limited computational experience. Moreover, there is no resource that enables users to easily evaluate the effect of different over-sampling algorithms. METAbolomics data Balancing with Over-sampling Algorithms (META-BOA) is a web-based application that enables users to select between four different methods for class balancing, followed by data visualization and classification of the sample to observe the augmentation effects. META-BOA outputs a newly balanced dataset, generating additional samples in the minority class, according to the user’s choice of Synthetic Minority Over-sampling Technique (SMOTE), Borderline-SMOTE (BSMOTE), Adaptive Synthetic (ADASYN), or Random Over-Sampling Examples (ROSE). META-BOA further displays both principal component analysis (PCA) and t-distributed stochastic neighbor embedding (t-SNE) visualization of data pre- and post-over-sampling. Random forest classification is utilized to compare sample classification in both the original and balanced datasets, enabling users to select the most appropriate method for their analyses. META-BOA is available at https://complimet.ca/meta-boa. Supplementary material is available at Bioinformatics online.

Paper

Share this book

Add to My Shelf

Imputation for Lipidomics and Metabolomics (ImpLiMet): Online application for optimization and method selection for missing data imputation

by Anuradha Surendra , Ou, Huiting , Mcdowell, Graeme Sv in Bioinformatics , Datasets , Metabolomics

2024

Motivation: Missing values are often unavoidable in modern high-throughput measurements due to various experimental or analytical reasons. Imputation, the process of replacing missing values in a dataset with estimated values, plays an important role in multivariate and machine learning analyses. Three missingness patterns have been conceptualized: missing completely at random (MCAR), missing at random (MAR), and missing not at random (MNAR). Each describes unique dependencies between the missing and observed data. As a result, the optimal imputation method for each dataset depends on the type of data, the cause of the missing data, and the nature of relationships between the missing and observed data. The challenge is to identify the optimal imputation solution for a given dataset. Results: ImpLiMet: is a user-friendly UI-platform that enables users to impute missing data using eight different methods. For the users dataset, ImpLiMet can suggest the optimal imputation solution through a grid search-based investigation of the error rate for imputation across three missingness data simulations. The effect of imputation can be visually assessed by histogram, kurtosis and skewness analyses, as well as principal component analysis (PCA) comparing the impact of the chosen imputation method on the distribution and overall behaviour of the data. Availability and implementation: ImpLiMet is freely available at https://complimet.ca/shiny/implimet/ with software accessible at https://github.com/complimet/ImpLiMet Contact: steffanyann.bennett@uottawa.ca and miroslava.cuperlovic-culf@nrc-cnrc.gca.Competing Interest StatementThe authors have declared no competing interest.Footnotes* Version of the manuscript has been updated to explain new functionalities of the application and include additional validation of presented imputation method optimization. Manuscript is also changed to include additional references and clarifications of approaches.* https://complimet.ca/shiny/implimet/

Paper

Share this book

Add to My Shelf

Language Selector

MBRLGlobalSearch

Language Selector

Catalogue Search | MBRL

Search Results Heading

Explore the vast range of titles available.

MBRLSearchResults

MBRLHappinessMeter