Catalogue Search | MBRL

Artificial Intelligence-Based Drone System for Multiclass Plant Disease Detection Using an Improved Efficient Convolutional Neural Network

by Javed, Ali , Albattah, Waleed , Nawaz, Marriam in Accuracy , Agricultural development , Agricultural production

2022

The role of agricultural development is very important in the economy of a country. However, the occurrence of several plant diseases is a major hindrance to the growth rate and quality of crops. The exact determination and categorization of crop leaf diseases is a complex and time-required activity due to the occurrence of low contrast information in the input samples. Moreover, the alterations in the size, location, structure of crop diseased portion, and existence of noise and blurriness effect in the input images further complicate the classification task. To solve the problems of existing techniques, a robust drone-based deep learning approach is proposed. More specifically, we have introduced an improved EfficientNetV2-B4 with additional added dense layers at the end of the architecture. The customized EfficientNetV2-B4 calculates the deep key points and classifies them in their related classes by utilizing an end-to-end training architecture. For performance evaluation, a standard dataset, namely, the PlantVillage Kaggle along with the samples captured using a drone is used which is complicated in the aspect of varying image samples with diverse image capturing conditions. We attained the average precision, recall, and accuracy values of 99.63, 99.93, and 99.99%, respectively. The obtained results confirm the robustness of our approach in comparison to other recent techniques and also show less time complexity.

Journal Article

Share this book

Add to My Shelf

SkinEHDLF a hybrid deep learning approach for accurate skin cancer classification in complex systems

by Alsafyani, Majed , Baqasah, Abdullah M. , Lilhore, Umesh Kumar in 631/67 , 639/166 , 639/705

2025

Skin cancer represents a significant global public health issue, and prompt and precise detection is essential for effective treatment. This study introduces SkinEHDLF, an innovative deep-learning model that enhances skin cancer classification. SkinEHDLF utilizes the advantages of several advanced models, i.e., ConvNeXt, EfficientNetV2, and Swin Transformer, while integrating an adaptive attention-based feature fusion mechanism to enhance the synthesis of acquired features. This hybrid methodology combines ConvNeXt’s proficient feature extraction capabilities, EfficientNetV2’s scalability, and Swin Transformer’s long-range attention mechanisms, resulting in a highly accurate and dependable model. The adaptive attention mechanism dynamically optimizes feature fusion, enabling the model to focus on the most relevant information, enhancing accuracy and reducing false positives. We trained and evaluated SkinEHDLF using the ISIC 2024 dataset, which comprises 401,059 skin lesion images extracted from 3D total-body photography. The dataset is divided into three categories: melanoma, benign lesions, and noncancerous skin anomalies. The findings indicate the superiority of SkinEHDLF compared to current models. In binary skin cancer classification, SkinEHDLF surpassed baseline models, achieving an AUROC of 99.8% and an accuracy of 98.76%. The model attained 98.6% accuracy, 97.9% precision, 97.3% recall, and 99.7% AUROC across all lesion categories in multi-class classification. SkinEHDLF demonstrates a 7.9% enhancement in accuracy and a 28% decrease in false positives, outperforming leading models including ResNet-50, EfficientNet-B3, ViT-B16, and hybrid methodologies such as ResNet-50 + EfficientNet and ViT + CNN, thereby positioning itself as a more precise and reliable solution for automated skin cancer detection. These findings underscore SkinEHDLF’s capacity to transform dermatological diagnostics by providing a scalable and accurate method for classifying skin cancer.

Journal Article

Share this book

Add to My Shelf

Enhancing EfficientNetv2 with global and efficient channel attention mechanisms for accurate MRI-Based brain tumor classification

by Pacal, Ishak , Bayram, Bilal , Cunha, Antonio in Accuracy , Algorithms , Architecture

2024

The early and accurate diagnosis of brain tumors is critical for effective treatment planning, with Magnetic Resonance Imaging (MRI) serving as a key tool in the non-invasive examination of such conditions. Despite the advancements in Computer-Aided Diagnosis (CADx) systems powered by deep learning, the challenge of accurately classifying brain tumors from MRI scans persists due to the high variability of tumor appearances and the subtlety of early-stage manifestations. This work introduces a novel adaptation of the EfficientNetv2 architecture, enhanced with Global Attention Mechanism (GAM) and Efficient Channel Attention (ECA), aimed at overcoming these hurdles. This enhancement not only amplifies the model’s ability to focus on salient features within complex MRI images but also significantly improves the classification accuracy of brain tumors. Our approach distinguishes itself by meticulously integrating attention mechanisms that systematically enhance feature extraction, thereby achieving superior performance in detecting a broad spectrum of brain tumors. Demonstrated through extensive experiments on a large public dataset, our model achieves an exceptional high-test accuracy of 99.76%, setting a new benchmark in MRI-based brain tumor classification. Moreover, the incorporation of Grad-CAM visualization techniques sheds light on the model’s decision-making process, offering transparent and interpretable insights that are invaluable for clinical assessment. By addressing the limitations inherent in previous models, this study not only advances the field of medical imaging analysis but also highlights the pivotal role of attention mechanisms in enhancing the interpretability and accuracy of deep learning models for brain tumor diagnosis. This research sets the stage for advanced CADx systems, enhancing patient care and treatment outcomes.

Journal Article

Share this book

Add to My Shelf

Graph Neural Network-Based Method of Spatiotemporal Land Cover Mapping Using Satellite Imagery

by Mongus, Domen , Kavran, Domen , Lukač, Niko in Analysis , Classification , Computer software industry

2023

Multispectral satellite imagery offers a new perspective for spatial modelling, change detection and land cover classification. The increased demand for accurate classification of geographically diverse regions led to advances in object-based methods. A novel spatiotemporal method is presented for object-based land cover classification of satellite imagery using a Graph Neural Network. This paper introduces innovative representation of sequential satellite images as a directed graph by connecting segmented land region through time. The method’s novel modular node classification pipeline utilises the Convolutional Neural Network as a multispectral image feature extraction network, and the Graph Neural Network as a node classification model. To evaluate the performance of the proposed method, we utilised EfficientNetV2-S for feature extraction and the GraphSAGE algorithm with Long Short-Term Memory aggregation for node classification. This innovative application on Sentinel-2 L2A imagery produced complete 4-year intermonthly land cover classification maps for two regions: Graz in Austria, and the region of Portorož, Izola and Koper in Slovenia. The regions were classified with Corine Land Cover classes. In the level 2 classification of the Graz region, the method outperformed the state-of-the-art UNet model, achieving an average F1-score of 0.841 and an accuracy of 0.831, as opposed to UNet’s 0.824 and 0.818, respectively. Similarly, the method demonstrated superior performance over UNet in both regions under the level 1 classification, which contains fewer classes. Individual classes have been classified with accuracies up to 99.17%.

Journal Article

Share this book

Add to My Shelf

A two stage blood cell detection and classification algorithm based on improved YOLOv7 and EfficientNetv2

by Hu, ZhiGang , Wang, XinZheng , Ge, AoRu in 631/114/1564 , 639/166/985 , Algorithms

2025

Current diagnoses of leukemia are typically performed manually by physicians on the basis of blood cell morphology, leading to challenges such as excessive workload, limited efficiency, and subjective outcomes. To solve the above problems, a two-stage detection method was developed for the automatic detection and identification of blood cells. First, for the blood cell detection task, an improved YOLOv7 blood cell detection model was proposed that integrates multihead attention and the SCYLLA-IoU (SIoU) loss function to accurately locate and classify white blood cells (WBCs), red blood cells (RBCs), and platelets in a full-field image of blood cells. For the white blood cell identification task of detecting network positioning, an improved EfficientNetv2 classification model was subsequently developed, which integrates the atrous spatial pyramid pooling (ASPP) module to increase classification accuracy and employs the balanced cross-entropy (BCE) function to address sample number imbalance. The experiments utilized four publicly accessible datasets: BCCD, LDWBC, LISC, and Raabin. The proposed detection model achieved an average accuracy of 94.7% in detecting and identifying blood cells in the BCCD dataset. With an IoU equal to 0.5, the model attained a mean average precision (mAP) of 97.17%. In the white blood cell classification task, an average precision (AP) of 95.12% and an average recall (AR) of 97% were achieved on the LDWBC, LISC, and Raabin datasets. The experimental results demonstrate that the proposed two-stage detection method detects and identifies blood cells accurately, thereby facilitating automatic detection, classification, and quantification of blood cell images, which can aid doctors in preliminary leukemia diagnosis.

Journal Article

Share this book

Add to My Shelf

Lightweight tea bud detection method based on improved YOLOv5

by Chen, Shuangchen , Liu, Yuyang , Cui, Jingying in 631/449 , 639/166 , Accuracy

2024

Tea bud detection technology is of great significance in realizing automated and intelligent plucking of tea buds. This study proposes a lightweight tea bud identification model based on modified Yolov5 to increase the picking accuracy and labor efficiency of intelligent tea bud picking while lowering the deployment pressure of mobile terminals. The following methods are used to make improvements: the backbone network CSPDarknet-53 of YOLOv5 is replaced with the EfficientNetV2 feature extraction network to reduce the number of parameters and floating-point operations of the model; the neck network of YOLOv5, the Ghost module is introduced to construct the ghost convolution and C3ghost module to further reduce the number of parameters and floating-point operations of the model; replacing the upsampling module of the neck network with the CARAFE upsampling module can aggregate the contextual tea bud feature information within a larger sensory field and improve the mean average precision of the model in detecting tea buds. The results show that the improved tea bud detection model has a mean average precision of 85.79%, only 4.14 M parameters, and only 5.02G of floating-point operations. The number of parameters and floating-point operations is reduced by 40.94% and 68.15%, respectively, when compared to the original Yolov5 model, but the mean average precision is raised by 1.67% points. The advantages of this paper’s algorithm in tea shot detection can be noticed by comparing it to other YOLO series detection algorithms. The improved YOLOv5 algorithm in this paper can effectively detect tea buds based on lightweight, and provide corresponding theoretical research for intelligent tea-picking robots.

Journal Article

Share this book

Add to My Shelf

STCD-EffV2T Unet: Semi Transfer Learning EfficientNetV2 T-Unet Network for Urban/Land Cover Change Detection Using Sentinel-2 Satellite Images

by Reinartz, Peter , Hasanlou, Mahdi , Gomroki, Masoomeh in Analysis , Change detection , Coders

2023

Change detection in urban areas can be helpful for urban resource management and smart city planning. The effects of human activities on the environment and ground have gained momentum over the past decades, causing remote sensing data sources analysis (such as satellite images) to become an option for swift change detection in the environment and urban areas. We proposed a semi-transfer learning method of EfficientNetV2 T-Unet (EffV2 T-Unet) that combines the effectiveness of composite scaled EfficientNetV2 T as the first path or encoder for feature extraction and convolutional layers of Unet as the second path or decoder for reconstructing the binary change map. In the encoder path, we use EfficientNetV2 T, which was trained by the ImageNet dataset. In this research, we employ two datasets to evaluate the performance of our proposed method for binary change detection. The first dataset is Sentinel-2 satellite images which were captured in 2017 and 2021 in urban areas of northern Iran. The second one is the Onera Satellite Change Detection dataset (OSCD). The performance of the proposed method is compared with YoloX-Unet families, ResNest-Unet families, and other well-known methods. The results demonstrated our proposed method’s effectiveness compared to other methods. The final change map reached an overall accuracy of 97.66%.

Journal Article

Share this book

Add to My Shelf

Novelty Classification Model Use in Reinforcement Learning for Cervical Cancer

by Baltayev, Jushkin , Cho, Young-Im , Shoraimov, Khusanboy in Accuracy , Artificial intelligence , Automation

2024

Purpose: Cervical cancer significantly impacts global health, where early detection is piv- otal for improving patient outcomes. This study aims to enhance the accuracy of cervical cancer diagnosis by addressing class imbalance through a novel hybrid deep learning model. Methods: The proposed model, RL-CancerNet, integrates EfficientNetV2 and Vision Transformers (ViTs) within a Reinforcement Learning (RL) framework. EfficientNetV2 extracts local features from cervical cytology images to capture fine-grained details, while ViTs analyze these features to recognize global dependencies across image patches. To address class imbalance, an RL agent dynamically adjusts the focus towards minority classes, thus reducing the common bias towards majority classes in medical image classification. Additionally, a Supporter Module incorporating Conv3D and BiLSTM layers with an attention mechanism enhances contextual learning. Results: RL-CancerNet was evaluated on the benchmark cervical cytology datasets Herlev and SipaKMeD, achieving an exceptional accuracy of 99.7%. This performance surpasses several state-of-the-art models, demonstrating the model’s effectiveness in identifying subtle diagnostic features in complex backgrounds. Conclusions: The integration of CNNs, ViTs, and RL into RL-CancerNet significantly improves the diagnostic accuracy of cervical cancer screenings. This model not only advances the field of automated medical screening but also provides a scalable framework adaptable to other medical imaging tasks, potentially enhancing diagnostic processes across various medical domains.

Journal Article

Share this book

Add to My Shelf

A hybrid deep learning model EfficientNet with GRU for breast cancer detection from histopathology images

by Sharmila, B. , Pradeepa, M. , Nirmala, M. in 639/166 , 639/166/985 , 639/166/987

2025

Breast cancer remains a significant global health challenge among women, with histopathological image analysis playing a critical role in early detection. However, existing diagnostic models often struggle to extract complex patterns from high-resolution tissue images, limiting their diagnostic accuracy and generalization. This study aims to develop a high-performance deep learning framework for accurate classification of breast cancer in histopathology images, addressing limitations in feature extraction and spatial dependency modeling. A hybrid deep learning model is proposed, integrating EfficientNetV2 for multi-scale feature extraction with a Gated Recurrent Unit (GRU) enhanced by an attention mechanism to model sequential dependencies. The model is trained and evaluated using the BreakHisand Camelyon17 dataset at 200× magnification. Evaluation metrics include precision, recall, F1-score, specificity, Intersection over Union (IoU), and accuracy. The proposed model achieved superior performance compared to existing architectures such as AlexNet, DenseNet, MobileNetV3, and EfficientNet. It attained a precision of 98.15%, recall of 95.68%, F1-score of 96.82%, specificity of 96%, IoU of 93.99%, and accuracy of 95.72% on the test set. The integration of EfficientNetV2 with GRU and attention mechanisms enables effective learning of spatial and contextual features, enhancing the accuracy and interpretability of breast cancer classification from histopathology images. This framework shows strong potential for aiding pathologists in real-time diagnostic workflows.

Journal Article

Share this book

Add to My Shelf

Multi-resolution transfer learning for tampered image classification using SE-enhanced fused-MBConv and optimized CNN heads

by Raj, Rayappa David Amar , Korsipati, Jithin Reddy , Prakasha, K. Krishna in 639/166 , 639/166/987 , Accuracy

2025

The widespread use of digital image tampering has created a strong need for accurate and generalizable detection systems, especially in domains like forensics, journalism, and cybersecurity. Traditional handcrafted methods often fail to capture subtle manipulation artifacts, and many deep learning approaches lack generalization across diverse image sources and manipulation techniques. To address these limitations, we propose a tampered image classification model based on transfer learning using EfficientNetV2B0. This backbone is combined with a lightweight, regularized CNN classification head and optimized using Focal Loss to address class imbalance. The architecture integrates compound scaling, fused MBConv layers, and squeeze-and-excitation (SE) attention to improve feature representation and robustness. We evaluate the model on four benchmark datasets-CASIA v1, Columbia, MICC-F2000, and Defacto (Splicing)-and achieve exceptional performance, with AUC scores up to 1.0000 and F1-scores up to 0.9997. Comparisons with 42 state-of-the-art models, including IML-ViT, MVSS-Net++, ConvNeXtFF, and DRRU-Net, show our method consistently outperforms existing approaches in accuracy, precision, recall, and generalization, particularly on high-resolution and compressed images. These results demonstrate the practical effectiveness and forensic reliability of the proposed system.

Journal Article

Share this book

Add to My Shelf

Language Selector

MBRLGlobalSearch

Language Selector

Catalogue Search | MBRL

Search Results Heading

Explore the vast range of titles available.

MBRLSearchResults

MBRLHappinessMeter