Catalogue Search | MBRL

Predictive modelling to support sensitivity analysis for robust design in aerospace engineering

by Dasari, Siva Krishna , Cheddad, Abbas , Andersson, Petter in Aerospace engineering , Aerospace engines , Aircraft

2020

The design of aircraft engines involves computationally expensive engineering simulations. One way to solve this problem is the use of response surface models to approximate the high-fidelity time-consuming simulations while reducing computational time. For a robust design, sensitivity analysis based on these models allows for the efficient study of uncertain variables’ effect on system performance. The aim of this study is to support sensitivity analysis for a robust design in aerospace engineering. For this, an approach is presented in which random forests (RF) and multivariate adaptive regression splines (MARS) are explored to handle linear and non-linear response types for response surface modelling. Quantitative experiments are conducted to evaluate the predictive performance of these methods with Turbine Rear Structure (a component of aircraft) case study datasets for response surface modelling. Furthermore, to test these models’ applicability to perform sensitivity analysis, experiments are conducted using mathematical test problems (linear and non-linear functions) and their results are presented. From the experimental investigations, it appears that RF fits better on non-linear functions compared with MARS, whereas MARS fits well on linear functions.

Journal Article

Share this book

Add to My Shelf

Vowel segmentation impact on machine learning classification for chronic obstructive pulmonary disease

by Anderberg, Peter , Cheddad, Abbas , Idrisoglu, Alper in 692/308/53/2421 , 692/308/53/2422 , 692/308/53/2423

2025

Vowel-based voice analysis is gaining attention as a potential non-invasive tool for COPD classification, offering insights into phonatory function. The growing need for voice data has necessitated the adoption of various techniques, including segmentation, to augment existing datasets for training comprehensive Machine Learning (ML) modelsThis study aims to investigate the possible effects of segmentation of the utterance of vowel \"a\" on the performance of ML classifiers CatBoost (CB), Random Forest (RF), and Support Vector Machine (SVM). This research involves training individual ML models using three distinct dataset constructions: full-sequence, segment-wise, and group-wise, derived from the utterance of the vowel \"a\" which consists of 1058 recordings belonging to 48 participants. This approach comprehensively analyzes how each data categorization impacts the model's performance and results. A nested cross-validation (nCV) approach was implemented with grid search for hyperparameter optimization. This rigorous methodology was employed to minimize overfitting risks and maximize model performance. Compared to the full-sequence dataset, the findings indicate that the second segment yielded higher results within the four-segment category. Specifically, the CB model achieved superior accuracy, attaining 97.8% and 84.6% on the validation and test sets, respectively. The same category for the CB model also demonstrated the best balance regarding true positive rate (TPR) and true negative rate (TNR), making it the most clinically effective choice. These findings suggest that time-sensitive properties in vowel production are important for COPD classification and that segmentation can aid in capturing these properties. Despite these promising results, the dataset size and demographic homogeneity limit generalizability, highlighting areas for future research. Trial registration The study is registered on clinicaltrials.gov with ID: NCT06160674.

Journal Article

Share this book

Add to My Shelf

SHIBR—The Swedish Historical Birth Records: a semi-annotated dataset

by Sundin, Lena , Hall, Johan , Hilmkil, Agrin in Algorithms , Archives & records , Artificial Intelligence

2021

This paper presents a digital image dataset of historical handwritten birth records stored in the archives of several parishes across Sweden, together with the corresponding metadata that supports the evaluation of document analysis algorithms’ performance. The dataset is called SHIBR (the Swedish Historical Birth Records). The contribution of this paper is twofold. First, we believe it is the first and the largest Swedish dataset of its kind provided as open access (15,000 high-resolution colour images of the era between 1800 and 1840). We also perform some data mining of the dataset to uncover some statistics and facts that might be of interest and use to genealogists. Second, we provide a comprehensive survey of contemporary datasets in the field that are open to the public along with a compact review of word spotting techniques. The word transcription file contains 17 columns of information pertaining to each image (e.g., child’s first name, birth date, date of baptism, father's first/last name, mother’s first/last name, death records, town, job title of the father/mother, etc.). Moreover, we evaluate some deep learning models, pre-trained on two other renowned datasets, for word spotting in SHIBR. However, our dataset proved challenging due to the unique handwriting style. Therefore, the dataset could also be used for competitions dedicated to a large set of document analysis problems, including word spotting.

Journal Article

Share this book

Add to My Shelf

Explainable Multimedia Feature Fusion for Medical Applications

by Hemmje, Matthias , Cheddad, Abbas , Mc Kevitt, Paul in Algorithms , Artificial intelligence , Automation

2022

Due to the exponential growth of medical information in the form of, e.g., text, images, Electrocardiograms (ECGs), X-rays, and multimedia, the management of a patient’s data has become a huge challenge. In particular, the extraction of features from various different formats and their representation in a homogeneous way are areas of interest in medical applications. Multimedia Information Retrieval (MMIR) frameworks, like the Generic Multimedia Analysis Framework (GMAF), can contribute to solving this problem, when adapted to special requirements and modalities of medical applications. In this paper, we demonstrate how typical multimedia processing techniques can be extended and adapted to medical applications and how these applications benefit from employing a Multimedia Feature Graph (MMFG) and specialized, efficient indexing structures in the form of Graph Codes. These Graph Codes are transformed to feature relevant Graph Codes by employing a modified Term Frequency Inverse Document Frequency (TFIDF) algorithm, which further supports value ranges and Boolean operations required in the medical context. On this basis, various metrics for the calculation of similarity, recommendations, and automated inferencing and reasoning can be applied supporting the field of diagnostics. Finally, the presentation of these new facilities in the form of explainability is introduced and demonstrated. Thus, in this paper, we show how Graph Codes contribute new querying options for diagnosis and how Explainable Graph Codes can help to readily understand medical multimedia formats.

Journal Article

Share this book

Add to My Shelf

Area and Volumetric Density Estimation in Processed Full-Field Digital Mammograms for Risk Assessment of Breast Cancer

by Humphreys, Keith , Cheddad, Abbas , Hall, Per in Adult , Aged , Algorithms

2014

Mammographic density, the white radiolucent part of a mammogram, is a marker of breast cancer risk and mammographic sensitivity. There are several means of measuring mammographic density, among which are area-based and volumetric-based approaches. Current volumetric methods use only unprocessed, raw mammograms, which is a problematic restriction since such raw mammograms are normally not stored. We describe fully automated methods for measuring both area and volumetric mammographic density from processed images. The data set used in this study comprises raw and processed images of the same view from 1462 women. We developed two algorithms for processed images, an automated area-based approach (CASAM-Area) and a volumetric-based approach (CASAM-Vol). The latter method was based on training a random forest prediction model with image statistical features as predictors, against a volumetric measure, Volpara, for corresponding raw images. We contrast the three methods, CASAM-Area, CASAM-Vol and Volpara directly and in terms of association with breast cancer risk and a known genetic variant for mammographic density and breast cancer, rs10995190 in the gene ZNF365. Associations with breast cancer risk were evaluated using images from 47 breast cancer cases and 1011 control subjects. The genetic association analysis was based on 1011 control subjects. All three measures of mammographic density were associated with breast cancer risk and rs10995190 (p<0.025 for breast cancer risk and p<1 × 10(-6) for rs10995190). After adjusting for one of the measures there remained little or no evidence of residual association with the remaining density measures (p>0.10 for risk, p>0.03 for rs10995190). Our results show that it is possible to obtain reliable automated measures of volumetric and area mammographic density from processed digital images. Area and volumetric measures of density on processed digital images performed similar in terms of risk and genetic association.

Journal Article

Share this book

Add to My Shelf

Quantitative 3D OPT and LSFM datasets of pancreata from mice with streptozotocin-induced diabetes

by Berggren, Per-Olof , Ahlgren, Ulf , Ilegems, Erwin in 631/154/51 , 631/1647/245/2226 , 692/308/2778

2022

Mouse models for streptozotocin (STZ) induced diabetes probably represent the most widely used systems for preclinical diabetes research, owing to the compound’s toxic effect on pancreatic β-cells. However, a comprehensive view of pancreatic β-cell mass distribution subject to STZ administration is lacking. Previous assessments have largely relied on the extrapolation of stereological sections, which provide limited 3D-spatial and quantitative information. This data descriptor presents multiple ex vivo tomographic optical image datasets of the full β-cell mass distribution in mice subject to single high and multiple low doses of STZ administration, and in glycaemia recovered mice. The data further include information about structural features, such as individual islet β-cell volumes, spatial coordinates, and shape as well as signal intensities for both insulin and GLUT2. Together, they provide the most comprehensive anatomical record of the effects of STZ administration on the islet of Langerhans in mice. As such, this data descriptor may serve as reference material to facilitate the planning, use and (re)interpretation of this widely used disease model. Measurement(s) Fluorescent antibody staining of Insulin and GLUT2 in whole mouse pancreata Technology Type(s) Optical Projection Tomography • Light Sheet Fluorescence Microscopy (Ultramicroscope) Factor Type(s) mouse genotype • Streptozotocin dosage Sample Characteristic - Organism Mus musculus

Journal Article

Share this book

Add to My Shelf

Digital Image Processing and Development of Machine Learning Models for the Discrimination of Corneal Pathology: An Experimental Model

by Cheddad, Abbas , Jimenez-Perez, Julio Cesar , Bustamante-Arias, Andres in Algorithms , Artificial intelligence , Artificial neural networks

2021

Machine learning (ML) has an impressive capacity to learn and analyze a large volume of data. This study aimed to train different algorithms to discriminate between healthy and pathologic corneal images by evaluating digitally processed spectral-domain optical coherence tomography (SD-OCT) corneal images. A set of 22 SD-OCT images belonging to a random set of corneal pathologies was compared to 71 healthy corneas (control group). A binary classification method was applied where three approaches of ML were explored. Once all images were analyzed, representative areas from every digital image were also extracted, processed and analyzed for a statistical feature comparison between healthy and pathologic corneas. The best performance was obtained from transfer learning—support vector machine (TL-SVM) (AUC = 0.94, SPE 88%, SEN 100%) and transfer learning—random forest (TL- RF) method (AUC = 0.92, SPE 84%, SEN 100%), followed by convolutional neural network (CNN) (AUC = 0.84, SPE 77%, SEN 91%) and random forest (AUC = 0.77, SPE 60%, SEN 95%). The highest diagnostic accuracy in classifying corneal images was achieved with the TL-SVM and the TL-RF models. In image classification, CNN was a strong predictor. This pilot experimental study developed a systematic mechanized system to discern pathologic from healthy corneas using a small sample.

Journal Article

Share this book

Add to My Shelf

Clustering-based adaptive data augmentation for class-imbalance in machine learning (CADA): additive manufacturing use case

by Dasari, Siva Krishna , Cheddad, Abbas , Lundberg, Lars in Additive Manufacturing , Aerospace application , Aerospace industry

2025

Large amount of data are generated from in-situ monitoring of additive manufacturing (AM) processes which is later used in prediction modelling for defect classification to speed up quality inspection of products. A high volume of this process data is defect-free (majority class) and a lower volume of this data has defects (minority class) which result in the class-imbalance issue. Using imbalanced datasets, classifiers often provide sub-optimal classification results, i.e. better performance on the majority class than the minority class. However, it is important for process engineers that models classify defects more accurately than the class with no defects since this is crucial for quality inspection. Hence, we address the class-imbalance issue in manufacturing process data to support in-situ quality control of additive manufactured components. For this, we propose cluster-based adaptive data augmentation (CADA) for oversampling to address the class-imbalance problem. Quantitative experiments are conducted to evaluate the performance of the proposed method and to compare with other selected oversampling methods using AM datasets from an aerospace industry and a publicly available casting manufacturing dataset. The results show that CADA outperformed random oversampling and the SMOTE method and is similar to random data augmentation and cluster-based oversampling. Furthermore, the results of the statistical significance test show that there is a significant difference between the studied methods. As such, the CADA method can be considered as an alternative method for oversampling to improve the performance of models on the minority class.

Journal Article

Share this book

Add to My Shelf

ARDIS: a Swedish historical handwritten digit dataset

by Cheddad, Abbas , Hall, Johan , Yavariabdi, Amir in ARDIS dataset , Artificial Intelligence , Benchmark

2020

This paper introduces a new image-based handwritten historical digit dataset named Arkiv Digital Sweden (ARDIS). The images in ARDIS dataset are extracted from 15,000 Swedish church records which were written by different priests with various handwriting styles in the nineteenth and twentieth centuries. The constructed dataset consists of three single-digit datasets and one-digit string dataset. The digit string dataset includes 10,000 samples in red–green–blue color space, whereas the other datasets contain 7600 single-digit images in different color spaces. An extensive analysis of machine learning methods on several digit datasets is carried out. Additionally, correlation between ARDIS and existing digit datasets Modified National Institute of Standards and Technology (MNIST) and US Postal Service (USPS) is investigated. Experimental results show that machine learning algorithms, including deep learning methods, provide low recognition accuracy as they face difficulties when trained on existing datasets and tested on ARDIS dataset. Accordingly, convolutional neural network trained on MNIST and USPS and tested on ARDIS provide the highest accuracies 58.80 % and 35.44 % , respectively. Consequently, the results reveal that machine learning methods trained on existing datasets can have difficulties to recognize digits effectively on our dataset which proves that ARDIS dataset has unique characteristics. This dataset is publicly available for the research community to further advance handwritten digit recognition algorithms.

Journal Article

Share this book

Add to My Shelf

Magnetic Resonance Imaging Based Radiomic Models of Prostate Cancer: A Narrative Review

by Clarke, Sharon E. , Chaddad, Ahmad , Zhang, Mingli in Artificial intelligence , Biomarkers , Biopsy

2021

The management of prostate cancer (PCa) is dependent on biomarkers of biological aggression. This includes an invasive biopsy to facilitate a histopathological assessment of the tumor’s grade. This review explores the technical processes of applying magnetic resonance imaging based radiomic models to the evaluation of PCa. By exploring how a deep radiomics approach further optimizes the prediction of a PCa’s grade group, it will be clear how this integration of artificial intelligence mitigates existing major technological challenges faced by a traditional radiomic model: image acquisition, small data sets, image processing, labeling/segmentation, informative features, predicting molecular features and incorporating predictive models. Other potential impacts of artificial intelligence on the personalized treatment of PCa will also be discussed. The role of deep radiomics analysis-a deep texture analysis, which extracts features from convolutional neural networks layers, will be highlighted. Existing clinical work and upcoming clinical trials will be reviewed, directing investigators to pertinent future directions in the field. For future progress to result in clinical translation, the field will likely require multi-institutional collaboration in producing prospectively populated and expertly labeled imaging libraries.

Journal Article

Share this book

Add to My Shelf

Language Selector

MBRLGlobalSearch

Language Selector

Catalogue Search | MBRL

Search Results Heading

Explore the vast range of titles available.

MBRLSearchResults

MBRLHappinessMeter