Catalogue Search | MBRL
Search Results Heading
Explore the vast range of titles available.
MBRLSearchResults
-
DisciplineDiscipline
-
Is Peer ReviewedIs Peer Reviewed
-
Item TypeItem Type
-
SubjectSubject
-
YearFrom:-To:
-
More FiltersMore FiltersSourceLanguage
Done
Filters
Reset
1,049
result(s) for
"Gao, Xiaohong"
Sort by:
A nonlinear prediction model for Chinese speech signal based on RBF neural network
by
Gao, Xiaohong
in
1193 - Intelligent Processing of Multimedia Signals
,
Algorithms
,
Back propagation networks
2022
A novel method for Chinese speech time series prediction model is proposed. In order to reconstruct the phase space of Chinese speech signal, the delay time and embedding dimension are calculated by C–C method and false nearest neighbor algorithm. The maximum lyapunov exponent and correlation dimension of Chinese speech phoneme are calculated by wolf algorithm and genetic programming algorithm. The numerical results show that there exists nonlinear characteristics in Chinese speech signal. Based on the analysis method of RBF neural network and the nonlinear characteristic parameters such as the delay time and embedding dimension, a nonlinear prediction model is designed. In order to further verify the prediction performance of the designed prediction model, waveform comparison and four evaluation indexes are used. It is shown that compared with the linear prediction model and back propagation neural network nonlinear prediction model, prediction error of the RBF neural network nonlinear prediction model is significantly reduced, and the model has higher prediction accuracy and prediction performance.
Journal Article
Monitoring and early warning of a metal mine tailings pond based on a deep learning bidirectional recurrent long and short memory network
2022
The effective monitoring and early warning capability of metal mine tailings ponds can improve the associated safety risk management level. The infiltration line is an important core index of tailings pond stability. In this paper, a tailings pond monitoring and early warning system, which provides technical support for the design and daily management of tailings reservoir early warning systems, is constructed. Based on a deep learning bidirectional recurrent long and short memory network, an infiltration line prediction model with univariate input and an infiltration line prediction model with multivariate input are proposed. The data adopted are those from four monitoring points of the same cross-section at different positions and data from one adjacent internal lateral displacement and internal vertical displacement monitoring point. Using the adaptive moment estimation (Adam) optimization algorithm and the root mean square error (RMSE) model evaluation metric, the multilayer perceptron model, univariate input model, and multivariate input model are compared. This work shows that their RMSEs are 0.10611, 0.09966, and 0.11955, respectively.
Journal Article
Scale Effect of Land Cover Classification from Multi-Resolution Satellite Remote Sensing Data
by
Zhang, Hao
,
Shi, Feifei
,
Li, Runxiang
in
Accuracy
,
Analysis
,
Artificial satellites in remote sensing
2023
Land cover data are important basic data for earth system science and other fields. Multi-source remote sensing images have become the main data source for land cover classification. There are still many uncertainties in the scale effect of image spatial resolution on land cover classification. Since it is difficult to obtain multiple spatial resolution remote sensing images of the same area at the same time, the main current method to study the scale effect of land cover classification is to use the same image resampled to different resolutions, however errors in the resampling process lead to uncertainty in the accuracy of land cover classification. To study the land cover classification scale effect of different spatial resolutions of multi-source remote sensing data, we selected 1 m and 4 m of GF-2, 6 m of SPOT-6, 10 m of Sentinel-2, and 30 m of Landsat-8 multi-sensor data, and explored the scale effect of image spatial resolution on land cover classification from two aspects of mixed image element decomposition and spatial heterogeneity. For the study area, we compared the classification obtained from GF-2, SPOT-6, Sentinel-2, and Landsat-8 images at different spatial resolutions based on GBDT and RF. The results show that (1) GF-2 and SPOT-6 had the best classification results, and the optimal scale based on this classification accuracy was 4–6 m; (2) the optimal scale based on linear decomposition depended on the study area; (3) the optimal scale of land cover was related to spatial heterogeneity, i.e., the more fragmented and complex was the space, the smaller the scale needed; and (4) the resampled images were not sensitive to scale and increased the uncertainty of the classification. These findings have implications for land cover classification and optimal scale selection, scale effects, and landscape ecology uncertainty studies.
Journal Article
An objective comparison of detection and segmentation algorithms for artefacts in clinical endoscopy
by
Ning, Qingtian
,
Schnabel, Julia A.
,
Rittscher, Jens
in
692/308/575
,
692/700/1421/164/2223
,
Algorithms
2020
We present a comprehensive analysis of the submissions to the first edition of the Endoscopy Artefact Detection challenge (EAD). Using crowd-sourcing, this initiative is a step towards understanding the limitations of existing state-of-the-art computer vision methods applied to endoscopy and promoting the development of new approaches suitable for clinical translation. Endoscopy is a routine imaging technique for the detection, diagnosis and treatment of diseases in hollow-organs; the esophagus, stomach, colon, uterus and the bladder. However the nature of these organs prevent imaged tissues to be free of imaging artefacts such as bubbles, pixel saturation, organ specularity and debris, all of which pose substantial challenges for any quantitative analysis. Consequently, the potential for improved clinical outcomes through quantitative assessment of abnormal mucosal surface observed in endoscopy videos is presently not realized accurately. The EAD challenge promotes awareness of and addresses this key bottleneck problem by investigating methods that can accurately classify, localize and segment artefacts in endoscopy frames as critical prerequisite tasks. Using a diverse curated multi-institutional, multi-modality, multi-organ dataset of video frames, the accuracy and performance of 23 algorithms were objectively ranked for artefact detection and segmentation. The ability of methods to generalize to unseen datasets was also evaluated. The best performing methods (top 15%) propose deep learning strategies to reconcile variabilities in artefact appearance with respect to size, modality, occurrence and organ type. However, no single method outperformed across all tasks. Detailed analyses reveal the shortcomings of current training strategies and highlight the need for developing new optimal metrics to accurately quantify the clinical applicability of methods.
Journal Article
Multi- class classification of breast cancer abnormalities using Deep Convolutional Neural Network (CNN)
by
Boodoo-Jahangeer, Nazmeen
,
Nathire, Shaista
,
Dullull, Wasiimah
in
Abnormalities
,
Accuracy
,
Algorithms
2021
The real cause of breast cancer is very challenging to determine and therefore early detection of the disease is necessary for reducing the death rate due to risks of breast cancer. Early detection of cancer boosts increasing the survival chance up to 8%. Primarily, breast images emanating from mammograms, X-Rays or MRI are analyzed by radiologists to detect abnormalities. However, even experienced radiologists face problems in identifying features like micro-calcifications, lumps and masses, leading to high false positive and high false negative. Recent advancement in image processing and deep learning create some hopes in devising more enhanced applications that can be used for the early detection of breast cancer. In this work, we have developed a Deep Convolutional Neural Network (CNN) to segment and classify the various types of breast abnormalities, such as calcifications, masses, asymmetry and carcinomas, unlike existing research work, which mainly classified the cancer into benign and malignant, leading to improved disease management. Firstly, a transfer learning was carried out on our dataset using the pre-trained model ResNet50. Along similar lines, we have developed an enhanced deep learning model, in which learning rate is considered as one of the most important attributes while training the neural network. The learning rate is set adaptively in our proposed model based on changes in error curves during the learning process involved. The proposed deep learning model has achieved a performance of 88% in the classification of these four types of breast cancer abnormalities such as, masses, calcifications, carcinomas and asymmetry mammograms.
Journal Article
Ensemble Learning for the Land Cover Classification of Loess Hills in the Eastern Qinghai–Tibet Plateau Using GF-7 Multitemporal Imagery
2024
The unique geographic environment, diverse ecosystems, and complex landforms of the Qinghai–Tibet Plateau make accurate land cover classification a significant challenge in plateau earth sciences. Given advancements in machine learning and satellite remote sensing technology, this study investigates whether emerging ensemble learning classifiers and submeter-level stereoscopic images can significantly improve land cover classification accuracy in the complex terrain of the Qinghai–Tibet Plateau. This study utilizes multitemporal submeter-level GF-7 stereoscopic images to evaluate the accuracy of 11 typical ensemble learning classifiers (representing bagging, boosting, stacking, and voting strategies) and 3 classification datasets (single-temporal, multitemporal, and feature-optimized datasets) for land cover classification in the loess hilly area of the Eastern Qinghai–Tibet Plateau. The results indicate that compared to traditional single strong classifiers (such as CART, SVM, and MLPC), ensemble learning classifiers can improve land cover classification accuracy by 5% to 9%. The classification accuracy differences among the 11 ensemble learning classifiers are generally within 1% to 3%, with HistGBoost, LightGBM, and AdaBoost-DT achieving a classification accuracy comparable to CNNs, with the highest overall classification accuracy (OA) exceeding 93.3%. All ensemble learning classifiers achieved better classification accuracy using multitemporal datasets, with the classification accuracy differences among the three classification datasets generally within 1% to 3%. Feature selection and feature importance evaluation show that spectral bands (e.g., the summer near-infrared (NIR-S) band), topographic factors (e.g., the digital elevation model (DEM)), and spectral indices (e.g., the summer resident ratio index (RRI-S)) significantly contribute to the accuracy of each ensemble learning classifier. Using feature-optimized datasets, ensemble classifiers can improve classification efficiency. This study preliminarily confirms that GF-7 images are suitable for land cover classification in complex terrains and that using ensemble learning classifiers and multitemporal datasets can improve classification accuracy.
Journal Article
A Framework for Subregion Ensemble Learning Mapping of Land Use/Land Cover at the Watershed Scale
2024
Land use/land cover (LULC) data are essential for Earth science research. Due to the high fragmentation and heterogeneity of landscapes, machine learning-based LULC classification frequently emphasizes results such as classification accuracy, efficiency, and variable importance analysis. However, this approach often overlooks the intermediate processes, and LULC mapping that relies on a single classifier typically does not yield satisfactory results. In this paper, to obtain refined LULC classification products at the watershed scale and improve the accuracy and efficiency of watershed-scale mapping, we propose a subregion ensemble learning classification framework. The Huangshui River watershed, located in the transition belts between the Qinghai-Tibet Plateau and Loess Plateau, is chosen as the case study area, and Sentinel-2A/B multi-temporal data are selected for ensemble learning classification. Using the proposed method, the block classification scale is analyzed and illustrated at the watershed, and the classification accuracy and efficiency of the new method are compared and analyzed against three ensemble learning methods using several variables. The proposed watershed-scale ensemble learning framework has better accuracy and efficiency for LULC mapping and has certain advantages over the other methods. The method proposed in this study provides new ideas for watershed-scale LULC mapping technology.
Journal Article
Estimation and mapping of soil texture content based on unmanned aerial vehicle hyperspectral imaging
2023
Soil texture is one of the important physical and natural properties of soil. Much of the current research focuses on soil texture monitoring using non-imaging geophysical spectrometers. However there are fewer studies utilizing unmanned aerial vehicle (UAV) hyperspectral data for soil texture monitoring. UAV mounted hyperspectral cameras can be used for quickly and accurately obtaining high-resolution spatial information of soil texture. A foundation has been laid for the realization of rapid soil texture surveys using unmanned airborne hyperspectral data without field sampling. This study selected three typical farmland areas in Huangshui Basin of Qinghai as the study area, and a total of 296 soil samples were collected. Data calibration of UAV spectra using laboratory spectra and field in situ spectra to explore the feasibility of applying laboratory soil texture models directly to field conditions. This results show that UAV hyperspectral imagery combined with machine learning can obtain a set of ideal processing methods. The pre-processing of the spectral data can obtain high accuracy of soil texture estimation and good mapping effect. The results of this study can provide effective technical support and decision-making assistance for future agricultural land planning on the Tibetan Plateau. The main innovation of this study is to establish a set of processing procedures and methods applicable to UAV hyperspectral imagery to provide data reference for monitoring soil texture in agricultural fields on the Tibetan Plateau.
Journal Article
Fine-Scale Grassland Classification Using UAV-Based Multi-Sensor Image Fusion and Deep Learning
2025
Grassland classification via remote sensing is essential for ecosystem monitoring and precision management, yet conventional satellite-based approaches are fundamentally constrained by coarse spatial resolution. To overcome this limitation, we harness high-resolution UAV multi-sensor data, integrating multi-scale image fusion with deep learning to achieve fine-scale grassland classification that satellites cannot provide. First, four categories of UAV data, including RGB, multispectral, thermal infrared, and LiDAR point cloud, were collected, and a fused image tensor consisting of 10 channels (NDVI, VCI, CHM, etc.) was constructed through orthorectification and resampling. For feature-level fusion, four deep fusion networks were designed. Among them, the MultiScale Pyramid Fusion Network, utilizing a pyramid pooling module, effectively integrated spectral and structural features, achieving optimal performance in all six image fusion evaluation metrics, including information entropy (6.84), spatial frequency (15.56), and mean gradient (12.54). Subsequently, training and validation datasets were constructed by integrating visual interpretation samples. Four backbone networks, including UNet++, DeepLabV3+, PSPNet, and FPN, were employed, and attention modules (SE, ECA, and CBAM) were introduced separately to form 12 model combinations. Results indicated that the UNet++ network combined with the SE attention module achieved the best segmentation performance on the validation set, with a mean Intersection over Union (mIoU) of 77.68%, overall accuracy (OA) of 86.98%, F1-score of 81.48%, and Kappa coefficient of 0.82. In the categories of Leymus chinensis and Puccinellia distans, producer’s accuracy (PA)/user’s accuracy (UA) reached 86.46%/82.30% and 82.40%/77.68%, respectively. Whole-image prediction validated the model’s coherent identification capability for patch boundaries. In conclusion, this study provides a systematic approach for integrating multi-source UAV remote sensing data and intelligent grassland interpretation, offering technical support for grassland ecological monitoring and resource assessment.
Journal Article
Development of an ensemble CNN model with explainable AI for the classification of gastrointestinal cancer
by
Gooda Sahib, Nuzhah
,
Auzine, Muhammad Muzzammil
,
Baichoo, Sunilduth
in
Accuracy
,
Artificial Intelligence
,
Artificial neural networks
2024
The implementation of AI assisted cancer detection systems in clinical environments has faced numerous hurdles, mainly because of the restricted explainability of their elemental mechanisms, even though such detection systems have proven to be highly effective. Medical practitioners are skeptical about adopting AI assisted diagnoses as due to the latter’s inability to be transparent about decision making processes. In this respect, explainable artificial intelligence (XAI) has emerged to provide explanations for model predictions, thereby overcoming the computational black box problem associated with AI systems. In this particular research, the focal point has been the exploration of the Shapley additive explanations (SHAP) and local interpretable model-agnostic explanations (LIME) approaches which enable model prediction explanations. This study used an ensemble model consisting of three convolutional neural networks(CNN): InceptionV3, InceptionResNetV2 and VGG16, which was based on averaging techniques and by combining their respective predictions. These models were trained on the Kvasir dataset, which consists of pathological findings related to gastrointestinal cancer. An accuracy of 96.89% and F1-scores of 96.877% were attained by our ensemble model. Following the training of the ensemble model, we employed SHAP and LIME to analyze images from the three classes, aiming to provide explanations regarding the deterministic features influencing the model’s predictions. The results obtained from this analysis demonstrated a positive and encouraging advancement in the exploration of XAI approaches, specifically in the context of gastrointestinal cancer detection within the healthcare domain.
Journal Article