Catalogue Search | MBRL
Search Results Heading
Explore the vast range of titles available.
MBRLSearchResults
-
DisciplineDiscipline
-
Is Peer ReviewedIs Peer Reviewed
-
Item TypeItem Type
-
SubjectSubject
-
YearFrom:-To:
-
More FiltersMore FiltersSourceLanguage
Done
Filters
Reset
495
result(s) for
"LightGBM"
Sort by:
Prediction of Gas Concentration Based on LSTM-LightGBM Variable Weight Combination Model
2022
Gas accidents threaten the safety of underground coal mining, which are always accompanied by abnormal gas concentration trend. The purpose of this paper is to improve the prediction accuracy of gas concentration so as to prevent gas accidents and improve the level of coal mine safety management. Combining the LSTM model with the LightGBM model, the LSTM-LightGBM model is proposed with variable weight combination method based on residual assignment, which considers not only the time subsequence feature of data, but also the nonlinear characteristics of data. During the data preprocessing, the optimal parameters of gas concentration prediction are determined through the analysis of the Pearson correlation coefficients of different sensor data. The experimental results demonstrate that the mean absolute errors of LSTM-LighGBM, LSTM and LightGBM are 1.94%, 2.19% and 2.77%, respectively. The accuracy of LSTM-LightGBM variable weight combination model is better than that of the two above models, respectively. In this way, this study provides a novel idea and method for gas accident prevention based on gas concentration prediction.
Journal Article
A comparative analysis of gradient boosting algorithms
2021
The family of gradient boosting algorithms has been recently extended with several interesting proposals (i.e. XGBoost, LightGBM and CatBoost) that focus on both speed and accuracy. XGBoost is a scalable ensemble technique that has demonstrated to be a reliable and efficient machine learning challenge solver. LightGBM is an accurate model focused on providing extremely fast training performance using selective sampling of high gradient instances. CatBoost modifies the computation of gradients to avoid the prediction shift in order to improve the accuracy of the model. This work proposes a practical analysis of how these novel variants of gradient boosting work in terms of training speed, generalization performance and hyper-parameter setup. In addition, a comprehensive comparison between XGBoost, LightGBM, CatBoost, random forests and gradient boosting has been performed using carefully tuned models as well as using their default settings. The results of this comparison indicate that CatBoost obtains the best results in generalization accuracy and AUC in the studied datasets although the differences are small. LightGBM is the fastest of all methods but not the most accurate. Finally, XGBoost places second both in accuracy and in training speed. Finally an extensive analysis of the effect of hyper-parameter tuning in XGBoost, LightGBM and CatBoost is carried out using two novel proposed tools.
Journal Article
An Improved LightGBM Algorithm for Online Fault Detection of Wind Turbine Gearboxes
by
Zhao, Qi
,
Li, Linlin
,
Ding, Steven X.
in
bayesian hyper-parameter optimization
,
fault diagnosis
,
gradient boosting algorithm
2020
It is widely accepted that conventional boost algorithms are of low efficiency and accuracy in dealing with big data collected from wind turbine operations. To address this issue, this paper is devoted to the application of an adaptive LightGBM method for wind turbine fault detections. To this end, the realization of feature selection for fault detection is firstly achieved by utilizing the maximum information coefficient to analyze the correlation among features in supervisory control and data acquisition (SCADA) of wind turbines. After that, a performance evaluation criterion is proposed for the improved LightGBM model to support fault detections. In this scheme, by embedding the confusion matrix as a performance indicator, an improved LightGBM fault detection approach is then developed. Based on the adaptive LightGBM fault detection model, a fault detection strategy for wind turbine gearboxes is investigated. To demonstrate the applications of the proposed algorithms and methods, a case study with a three-year SCADA dataset obtained from a wind farm sited in Southern China is conducted. Results indicate that the proposed approaches established a fault detection framework of wind turbine systems with either lower false alarm rate or lower missing detection rate.
Journal Article
LightGBM: accelerated genomically designed crop breeding through ensemble learning
by
Cheng, Qian
,
Yan, Jianbing
,
Xu, Yuetong
in
Algorithms
,
Animal Genetics and Genomics
,
Bioinformatics
2021
LightGBM is an ensemble model of decision trees for classification and regression prediction. We demonstrate its utility in genomic selection-assisted breeding with a large dataset of inbred and hybrid maize lines. LightGBM exhibits superior performance in terms of prediction precision, model stability, and computing efficiency through a series of benchmark tests. We also assess the factors that are essential to ensure the best performance of genomic selection prediction by taking complex scenarios in crop hybrid breeding into account. LightGBM has been implemented as a toolbox, CropGBM, encompassing multiple novel functions and analytical modules to facilitate genomically designed breeding in crops.
Journal Article
A Human Activity Recognition Algorithm Based on Stacking Denoising Autoencoder and LightGBM
by
Gao, Xile
,
Zhang, Yuexia
,
Zhao, Fang
in
deep learning
,
human activity recognition
,
indoor positioning
2019
Recently, the demand for human activity recognition has become more and more urgent. It is widely used in indoor positioning, medical monitoring, safe driving, etc. Existing activity recognition approaches require either the location information of the sensors or the specific domain knowledge, which are expensive, intrusive, and inconvenient for pervasive implementation. In this paper, a human activity recognition algorithm based on SDAE (Stacking Denoising Autoencoder) and LightGBM (LGB) is proposed. The SDAE is adopted to sanitize the noise in raw sensor data and extract the most effective characteristic expression with unsupervised learning. The LGB reveals the inherent feature dependencies among categories for accurate human activity recognition. Extensive experiments are conducted on four datasets of distinct sensor combinations collected by different devices in three typical application scenarios, which are human moving modes, current static, and dynamic behaviors of users. The experimental results demonstrate that our proposed algorithm achieves an average accuracy of 95.99%, outperforming other comparative algorithms using XGBoost, CNN (Convolutional Neural Network), CNN + Statistical features, or single SDAE.
Journal Article
Predicting Hard Rock Pillar Stability Using GBDT, XGBoost, and LightGBM Algorithms
by
Zhao, Guoyan
,
Wu, Hao
,
Liang, Weizhang
in
Algorithms
,
Decision trees
,
Discrete element method
2020
Predicting pillar stability is a vital task in hard rock mines as pillar instability can cause large-scale collapse hazards. However, it is challenging because the pillar stability is affected by many factors. With the accumulation of pillar stability cases, machine learning (ML) has shown great potential to predict pillar stability. This study aims to predict hard rock pillar stability using gradient boosting decision tree (GBDT), extreme gradient boosting (XGBoost), and light gradient boosting machine (LightGBM) algorithms. First, 236 cases with five indicators were collected from seven hard rock mines. Afterwards, the hyperparameters of each model were tuned using a five-fold cross validation (CV) approach. Based on the optimal hyperparameters configuration, prediction models were constructed using training set (70% of the data). Finally, the test set (30% of the data) was adopted to evaluate the performance of each model. The precision, recall, and F1 indexes were utilized to analyze prediction results of each level, and the accuracy and their macro average values were used to assess the overall prediction performance. Based on the sensitivity analysis of indicators, the relative importance of each indicator was obtained. In addition, the safety factor approach and other ML algorithms were adopted as comparisons. The results showed that GBDT, XGBoost, and LightGBM algorithms achieved a better comprehensive performance, and their prediction accuracies were 0.8310, 0.8310, and 0.8169, respectively. The average pillar stress and ratio of pillar width to pillar height had the most important influences on prediction results. The proposed methodology can provide a reliable reference for pillar design and stability risk management.
Journal Article
Natural language processing was effective in assisting rapid title and abstract screening when updating systematic reviews
2021
To examine whether the use of natural language processing (NLP) technology is effective in assisting rapid title and abstract screening when updating a systematic review.
Using the searched literature from a published systematic review, we trained and tested an NLP model that enables rapid title and abstract screening when updating a systematic review. The model was a light gradient boosting machine (LightGBM), an ensemble learning classifier which integrates four pretrained Bidirectional Encoder Representations from Transformers (BERT) models. We divided the searched citations into two sets (ie, training and test sets). The model was trained using the training set and assessed for screening performance using the test set. The searched citations, whose eligibility was determined by two independent reviewers, were treated as the reference standard.
The test set included 947 citations; our model included 340 citations, excluded 607 citations, and achieved 96% sensitivity, and 78% specificity. If the classifier assessment in the case study was accepted, reviewers would lose 8 of 180 eligible citations (4%), none of which were ultimately included in the systematic review after full-text consideration, while decreasing the workload by 64.1%.
NLP technology using the ensemble learning method may effectively assist in rapid literature screening when updating systematic reviews.
Journal Article
Machine learning-driven sedation-analgesia optimization in mechanically ventilated sepsis patients: a retrospective MIMIC-IV analysis
by
Gao, Xiaolan
,
Lei, Xianying
,
Hu, Lirong
in
lightgbm
,
mechanical ventilation
,
mechine learning
2026
BackgroundIn the intensive care unit (ICU), septic patients frequently require endotracheal intubation followed by invasive mechanical ventilation. Nonetheless, the optimal sedation-analgesia regimen for these critically ill patients remains undetermined.MethodsThis retrospective observational study analyzed data from the Medical Information Mart for Intensive Care IV (MIMIC-IV version 3.0) database to examine septic patients who underwent endotracheal intubation and subsequent invasive mechanical ventilation in the intensive care unit. Initially, Kaplan–Meier survival analysis and Cox proportional hazards models were employed to evaluate the prognostic impact of different sedation-analgesia regimens. Subsequently, the least absolute shrinkage and selection operator (LASSO) regression was utilized to identify key prognostic factors. Multiple machine learning algorithms were then implemented to develop predictive models, and the SHapley Additive exPlanations (SHAP) method was used to interpret the model outputs and determine the most influential predictors.ResultsFollowing the initial screening process, seven distinct sedation-analgesia regimens with sample sizes greater than 100 were incorporated into the final analysis. Utilizing Kaplan–Meier estimates and Cox regression models, the combination of fentanyl and midazolam was identified as the most advantageous regimen. This association remained statistically significant after adjusting for confounding variables, demonstrating a reduction in the length of stay in the intensive care unit (length of stay in ICU, HR [95% CI]: 0.66 [0.52–0.85]) and a decrease in ICU mortality (OR [95% CI]: 0.62 [0.46–0.85]). Subsequently, LASSO regression analysis identified seven key prognostic factors associated with outcomes in this patient subgroup. Among the machine learning models developed for outcome prediction, the LightGBM model exhibited superior performance (AUC = 0.838). SHAP analysis indicated that the top three predictors of 28-day mortality were the Acute Physiology Score III (APS III), patient age, and the presence of acute renal failure.ConclusionThe concurrent administration of fentanyl and midazolam was associated with lower ICU mortality and shorter length of ICU stay among septic patients necessitating endotracheal intubation and invasive mechanical ventilation, suggesting potential clinical benefit. Furthermore, the LightGBM algorithm exhibited superior predictive accuracy for ICU mortality within this cohort, suggesting its potential utility as a tool for supporting data-driven clinical decision-making.
Journal Article
Age Classification of Rice Seeds in Japan Using Gradient-Boosting and ANFIS Algorithms
2023
The rapidly changing climate affects an extensive spectrum of human-centered environments. The food industry is one of the affected industries due to rapid climate change. Rice is a staple food and an important cultural key point for Japanese people. As Japan is a country in which natural disasters continuously occur, using aged seeds for cultivation has become a regular practice. It is a well-known truth that seed quality and age highly impact germination rate and successful cultivation. However, a considerable research gap exists in the identification of seeds according to age. Hence, this study aims to implement a machine-learning model to identify Japanese rice seeds according to their age. Since agewise datasets are unavailable in the literature, this research implements a novel rice seed dataset with six rice varieties and three age variations. The rice seed dataset was created using a combination of RGB images. Image features were extracted using six feature descriptors. The proposed algorithm used in this study is called Cascaded-ANFIS. A novel structure for this algorithm is proposed in this work, combining several gradient-boosting algorithms such as XGBoost, CatBoost, and LightGBM. The classification was conducted in two steps. First, the seed variety was identified. Then, the age was predicted. As a result, seven classification models were implemented. The performance of the proposed algorithm was evaluated against 13 state-of-the-art algorithms. Overall, the proposed algorithm has a higher accuracy, precision, recall, and F1-score than the others. For the classification of variety, the proposed algorithm scored 0.7697, 0.7949, 0.7707, and 0.7862, respectively. The results of this study confirm that the proposed algorithm can be employed in the successful age classification of seeds.
Journal Article
Diagnosis of Diabetes Mellitus Using Gradient Boosting Machine (LightGBM)
by
Rufo, Derara Duba
,
Negera, Worku Gachena
,
Debelee, Taye Girma
in
Accuracy
,
Algorithms
,
Artificial intelligence
2021
Diabetes mellitus (DM) is a severe chronic disease that affects human health and has a high prevalence worldwide. Research has shown that half of the diabetic people throughout the world are unaware that they have DM and its complications are increasing, which presents new research challenges and opportunities. In this paper, we propose a preemptive diagnosis method for diabetes mellitus (DM) to assist or complement the early recognition of the disease in countries with low medical expert densities. Diabetes data are collected from the Zewditu Memorial Hospital (ZMHDD) in Addis Ababa, Ethiopia. Light Gradient Boosting Machine (LightGBM) is one of the most recent successful research findings for the gradient boosting framework that uses tree-based learning algorithms. It has low computational complexity and, therefore, is suited for applications in limited capacity regions such as Ethiopia. Thus, in this study, we apply the principle of LightGBM to develop an accurate model for the diagnosis of diabetes. The experimental results show that the prepared diabetes dataset is informative to predict the condition of diabetes mellitus. With accuracy, AUC, sensitivity, and specificity of 98.1%, 98.1%, 99.9%, and 96.3%, respectively, the LightGBM model outperformed KNN, SVM, NB, Bagging, RF, and XGBoost in the case of the ZMHDD dataset.
Journal Article