Catalogue Search | MBRL

Intelligent breast cancer diagnosis with two-stage using mammogram images

by Yaqub, Muhammad , Mehmood, Atif , Jinchao, Feng in 639/705/117 , 692/699/67/1347 , 692/700/1421

2024

Breast cancer (BC) significantly contributes to cancer-related mortality in women, underscoring the criticality of early detection for optimal patient outcomes. Mammography is a key tool for identifying and diagnosing breast abnormalities; however, accurately distinguishing malignant mass lesions remains challenging. To address this issue, we propose a novel deep learning approach for BC screening utilizing mammography images. Our proposed model comprises three distinct stages: data collection from established benchmark sources, image segmentation employing an Atrous Convolution-based Attentive and Adaptive Trans-Res-UNet (ACA-ATRUNet) architecture, and BC identification via an Atrous Convolution-based Attentive and Adaptive Multi-scale DenseNet (ACA-AMDN) model. The hyperparameters within the ACA-ATRUNet and ACA-AMDN models are optimized using the Modified Mussel Length-based Eurasian Oystercatcher Optimization (MML-EOO) algorithm. The performance is evaluated using a variety of metrics, and a comparative analysis against conventional methods is presented. Our experimental results reveal that the proposed BC detection framework attains superior precision rates in early disease detection, demonstrating its potential to enhance mammography-based screening methodologies.

Journal Article

Share this book

Add to My Shelf

Towards Diverse Binary Segmentation via a Simple yet General Gated Network

by Zhao, Xiaoqi , Zhang, Lihe , Lu, Huchuan in Computer vision , Convolution , Encoders-Decoders

2024

In many binary segmentation tasks, most CNNs-based methods use a U-shape encoder-decoder network as their basic structure. They ignore two key problems when the encoder exchanges information with the decoder: one is the lack of interference control mechanism between them, the other is without considering the disparity of the contributions from different encoder levels. In this work, we propose a simple yet general gated network (GateNet) to tackle them all at once. With the help of multi-level gate units, the valuable context information from the encoder can be selectively transmitted to the decoder. In addition, we design a gated dual branch structure to build the cooperation among the features of different levels and improve the discrimination ability of the network. Furthermore, we introduce a “Fold” operation to improve the atrous convolution and form a novel folded atrous convolution, which can be flexibly embedded in ASPP or DenseASPP to accurately localize foreground objects of various scales. GateNet can be easily generalized to many binary segmentation tasks, including general and specific object segmentation and multi-modal segmentation. Without bells and whistles, our network consistently performs favorably against the state-of-the-art methods under 10 metrics on 33 datasets of 10 binary segmentation tasks.

Journal Article

Share this book

Add to My Shelf

An efficient colorectal cancer detection network using atrous convolution with coordinate attention transformer and histopathological images

by Deivasigamani, Sugitha , Khalid, Majdi , V, Sathiya in 639/705/117 , 639/705/258 , 639/705/794

2024

The second most common type of malignant tumor worldwide is colorectal cancer. Histopathology image analysis offers crucial data for the clinical diagnosis of colorectal cancer. Currently, deep learning techniques are applied to enhance cancer classification and tumor localization in histopathological image analysis. Moreover, traditional deep learning techniques might loss integrated information in the image while evaluating thousands of patches recovered from whole slide images (WSIs). This research proposes a novel colorectal cancer detection network (CCDNet) that combines coordinate attention transformer with atrous convolution. CCDNet first denoises the input histopathological image using a Wiener based Midpoint weighted non-local means filter (WMW-NLM) for guaranteeing precise diagnoses and maintain image features. Also, a novel atrous convolution with coordinate attention transformer (AConvCAT) is introduced, which successfully combines the advantages of two networks to classify colorectal tissue at various scales by capturing local and global information. Further, coordinate attention model is integrated with a Cross-shaped window (CrSWin) transformer for capturing tiny changes in colorectal tissue from multiple angles. The proposed CCDNet achieved accuracy rates of 98.61% and 98.96%, on the colorectal histological image and NCT-CRC-HE-100 K datasets correspondingly. The comparison analysis demonstrates that the suggested framework performed better than the most advanced methods already in use. In hospitals, clinicians can use the proposed CCDNet to verify the diagnosis.

Journal Article

Share this book

Add to My Shelf

Mask-guided SSD for small-object detection

by Ai Yibo , Zhang, Weidong , Chang, Sun in Convolution , Feature maps , Object recognition

2021

Detecting small objects is a challenging job for the single-shot multibox detector (SSD) model due to the limited information contained in features and complex background interference. Here, we increased the performance of the SSD for detecting target objects with small size by enhancing detection features with contextual information and introducing a segmentation mask to eliminate background regions. The proposed model is referred to as a “guided SSD” (Mask-SSD) and includes two branches: a detection branch and a segmentation branch. We created a feature-fusion module to allow the detection branch to exploit contextual information for feature maps with large resolution, with the segmentation branch primarily built with atrous convolution to provide additional contextual information to the detection branch. The input of the segmentation branch was also the output of the detection branch, and output segmentation features were fused with detection features in order to classify and locate target objects. Additionally, segmentation features were applied to generate the mask, which was utilized to guide the detection branch to find objects in potential foreground regions. Evaluation of Mask-SSD on the Tsinghua-Tencent 100K and Caltech pedestrian datasets demonstrated its effectiveness at detecting small objects and comparable performance relative to other state-of-the-art methods.

Journal Article

Share this book

Add to My Shelf

Waterfall Atrous Spatial Pooling Architecture for Efficient Semantic Segmentation

by Artacho, Bruno , Savakis, Andreas in Accuracy , Neural networks , Semantics

2019

We propose a new efficient architecture for semantic segmentation, based on a “Waterfall” Atrous Spatial Pooling architecture, that achieves a considerable accuracy increase while decreasing the number of network parameters and memory footprint. The proposed Waterfall architecture leverages the efficiency of progressive filtering in the cascade architecture while maintaining multiscale fields-of-view comparable to spatial pyramid configurations. Additionally, our method does not rely on a postprocessing stage with Conditional Random Fields, which further reduces complexity and required training time. We demonstrate that the Waterfall approach with a ResNet backbone is a robust and efficient architecture for semantic segmentation obtaining state-of-the-art results with significant reduction in the number of parameters for the Pascal VOC dataset and the Cityscapes dataset.

Journal Article

Share this book

Add to My Shelf

Bridge Crack Semantic Segmentation Based on Improved Deeplabv3

by Wang, Yuchao , Fu, Huixuan , Meng, Dan in Accuracy , Algorithms , atrous convolution

2021

Cracks are the main goal of bridge maintenance and accurate detection of cracks will help ensure their safe use. Aiming at the problem that traditional image processing methods are difficult to accurately detect cracks, deep learning technology was introduced and a crack detection method based on an improved DeepLabv3+ semantic segmentation algorithm was proposed. In the network structure, the densely connected atrous spatial pyramid pooling module was introduced into the DeepLabv3+ network, which enabled the network to obtain denser pixel sampling, thus enhancing the ability of the network to extract detail features. While obtaining a larger receptive field, the number of network parameters was consistent with the original algorithm. The images of bridge cracks under different environmental conditions were collected, and then a concrete bridge crack segmentation data set was established, and the segmentation model was obtained through end-to-end training of the network. The experimental results showed that the improved DeepLabv3+ algorithm had higher crack segmentation accuracy than the original DeepLabv3+ algorithm, with an average intersection ratio reaching 82.37%, and the segmentation of crack details was more accurate, which proved the effectiveness of the proposed algorithm.

Journal Article

Share this book

Add to My Shelf

Detecting Large-Scale Urban Land Cover Changes from Very High Resolution Remote Sensing Images Using CNN-Based Classification

by Lu, Meng , Zhang, Chi , Ji, Shunping in aerial photography , Algorithms , Artificial neural networks

2019

The study investigates land use/cover classification and change detection of urban areas from very high resolution (VHR) remote sensing images using deep learning-based methods. Firstly, we introduce a fully Atrous convolutional neural network (FACNN) to learn the land cover classification. In the FACNN an encoder, consisting of full Atrous convolution layers, is proposed for extracting scale robust features from VHR images. Then, a pixel-based change map is produced based on the classification map of current images and an outdated land cover geographical information system (GIS) map. Both polygon-based and object-based change detection accuracy is investigated, where a polygon is the unit of the GIS map and an object consists of those adjacent changed pixels on the pixel-based change map. The test data covers a rapidly developing city of Wuhan (8000 km2), China, consisting of 0.5 m ground resolution aerial images acquired in 2014, and 1 m ground resolution Beijing-2 satellite images in 2017, and their land cover GIS maps. Testing results showed that our FACNN greatly exceeded several recent convolutional neural networks in land cover classification. Second, the object-based change detection could achieve much better results than a pixel-based method, and provide accurate change maps to facilitate manual urban land cover updating.

Journal Article

Share this book

Add to My Shelf

Weakly-supervised learning method for the recognition of potato leaf diseases

in Attention , Crop yield , Datasets

2023

As a crucial food crop, potatoes are highly consumed worldwide, while they are also susceptible to being infected by diverse diseases. Early detection and diagnosis can prevent the epidemic of plant diseases and raise crop yields. To this end, this study proposed a weakly-supervised learning approach for the identification of potato plant diseases. The foundation network was applied with the lightweight MobileNet V2, and to enhance the learning ability for minute lesion features, we modified the existing MobileNet-V2 architecture using the fine-tuning approach conducted by transfer learning. Then, the atrous convolution along with the SPP module was embedded into the pre-trained networks, which was followed by a hybrid attention mechanism containing channel attention and spatial attention submodules to efficiently extract high-dimensional features of plant disease images. The proposed approach outperformed other compared methods and achieved a superior performance gain. It realized an average recall rate of 91.99% for recognizing potato disease types on the publicly accessible dataset. In practical field scenarios, the proposed approach separately attained an average accuracy and specificity of 97.33% and 98.39% on the locally collected image dataset. Experimental results present a competitive performance and demonstrate the validity and feasibility of the proposed approach.

Journal Article

Share this book

Add to My Shelf

Deep atrous context convolution generative adversarial network with corner key point extracted feature for nuts classification

by Elakkiya, E. , Priya, S. , Devi, M. Shyamala in 631/114 , 639/166 , 639/705

2026

Deep learning-based nut classification has emerged as a viable way to automate the detection and categorization of different nut varieties in the food processing and agriculture sectors. Conventional techniques for classifying nuts mostly rely on manually created characteristics like texture, color, shape, or edges. These characteristics frequently fall short of capturing the image’s complete complexity, particularly when nuts show tiny visual variances. This research proposes Deep Atrous Context Convolution Generative Adversarial Network (DAC-GAN) model that categorize the 8 classes of nuts like brazil nuts, cashew, peanut, pecan nut, pistachio, chest nut, macadamia and Walnut. This research uses Common Nut KAGGLE dataset with 4,000 nuts images of 8 nuts classes. The DAC-GAN approach overcomes the difficulties of having limited labelled data for nut classification tasks by employing DCGANs’ ability to produce high-quality, synthetic nut images to supplement the dataset. The DCGAN comprises of a discriminator and a generator block. The discriminator block develops the ability to differentiate between synthetic and real images, while the generator block generates realistic nut images from random noise. The real images along with the DCGAN generated images are processed with feature filtering methods to extract the Corner Key Points Featured (CKPF) nuts images. To further enhance the feature selection, the CKPF edges are extracted from the image that provides unique, geometrically distinctive critical corners to further process for representative learning. To proceed with the effective feature extraction and model learning, the CKPF nuts images are processed with atrous convolution that capture the intricate details by expanding the receptive field without losing resolution. The novelty of this work exists by appending the filtration and atrous convolution that acquire the spatial data features from the nut’s images at various resolutions. Atrous convolution was refined by appending the pre-context and post-context block that add the image level information to the features. The effectiveness of the DAC-GAN model was validated with the traditional augmented dataset with all existing filtering images and CNN models. Implementation outcome shows that DAC-GAN found to exhibit high accuracy of 99.83% towards the nuts type classification. The superiority of the DAC-GAN method over traditional approaches is demonstrated by extensive experiments on augmented and DCGAN generated datasets, which achieve higher classification accuracy and generalization across a variety of nut type categorization. The outcome demonstrates that the DCGAN together with atrous convolution have the potential to be an effective tool for automating nut sorting in food industry.

Journal Article

Share this book

Add to My Shelf

MADR-Net: multi-level attention dilated residual neural network for segmentation of medical images

by Bhargava, Rohit , Mittal, Shachi , Rathore, Anurag S. in 692/4019 , 692/700/1421 , Algorithms

2024

Medical image segmentation has made a significant contribution towards delivering affordable healthcare by facilitating the automatic identification of anatomical structures and other regions of interest. Although convolution neural networks have become prominent in the field of medical image segmentation, they suffer from certain limitations. In this study, we present a reliable framework for producing performant outcomes for the segmentation of pathological structures of 2D medical images. Our framework consists of a novel deep learning architecture, called deep multi-level attention dilated residual neural network (MADR-Net), designed to improve the performance of medical image segmentation. MADR-Net uses a U-Net encoder/decoder backbone in combination with multi-level residual blocks and atrous pyramid scene parsing pooling. To improve the segmentation results, channel-spatial attention blocks were added in the skip connection to capture both the global and local features and superseded the bottleneck layer with an ASPP block. Furthermore, we introduce a hybrid loss function that has an excellent convergence property and enhances the performance of the medical image segmentation task. We extensively validated the proposed MADR-Net on four typical yet challenging medical image segmentation tasks: (1) Left ventricle, left atrium, and myocardial wall segmentation from Echocardiogram images in the CAMUS dataset, (2) Skin cancer segmentation from dermoscopy images in ISIC 2017 dataset, (3) Electron microscopy in FIB-SEM dataset, and (4) Fluid attenuated inversion recovery abnormality from MR images in LGG segmentation dataset. The proposed algorithm yielded significant results when compared to state-of-the-art architectures such as U-Net, Residual U-Net, and Attention U-Net. The proposed MADR-Net consistently outperformed the classical U-Net by 5.43%, 3.43%, and 3.92% relative improvement in terms of dice coefficient, respectively, for electron microscopy, dermoscopy, and MRI. The experimental results demonstrate superior performance on single and multi-class datasets and that the proposed MADR-Net can be utilized as a baseline for the assessment of cross-dataset and segmentation tasks.

Journal Article

Share this book

Add to My Shelf

Language Selector

MBRLGlobalSearch

Language Selector

Catalogue Search | MBRL

Search Results Heading

Explore the vast range of titles available.

MBRLSearchResults

MBRLHappinessMeter