Catalogue Search | MBRL

C-UNet: Complement UNet for Remote Sensing Road Extraction

by Hou, Yuewu , Liu, Zhaoying , Zhang, Ting in complementary UNet , dilated convolution , fixed threshold

2021

Roads are important mode of transportation, which are very convenient for people’s daily work and life. However, it is challenging to accuratly extract road information from a high-resolution remote sensing image. This paper presents a road extraction method for remote sensing images with a complement UNet (C-UNet). C-UNet contains four modules. Firstly, the standard UNet is used to roughly extract road information from remote sensing images, getting the first segmentation result; secondly, a fixed threshold is utilized to erase partial extracted information; thirdly, a multi-scale dense dilated convolution UNet (MD-UNet) is introduced to discover the complement road areas in the erased masks, obtaining the second segmentation result; and, finally, we fuse the extraction results of the first and the third modules, getting the final segmentation results. Experimental results on the Massachusetts Road dataset indicate that our C-UNet gets the higher results than the state-of-the-art methods, demonstrating its effectiveness.

Journal Article

Share this book

Add to My Shelf

AGs-Unet: Building Extraction Model for High Resolution Remote Sensing Images Based on Attention Gates U Network

by Yu, Mingyang , Chen, Xiaoxian , Liu, Yaohui in Accuracy , AGs-Unet model , building extraction

2022

Building contour extraction from high-resolution remote sensing images is a basic task for the reasonable planning of regional construction. Recently, building segmentation methods based on the U-Net network have become popular as they largely improve the segmentation accuracy by applying ‘skip connection’ to combine high-level and low-level feature information more effectively. Meanwhile, researchers have demonstrated that introducing an attention mechanism into U-Net can enhance local feature expression and improve the performance of building extraction in remote sensing images. In this paper, we intend to explore the effectiveness of the primeval attention gate module and propose the novel Attention Gate Module (AG) based on adjusting the position of ‘Resampler’ in an attention gate to Sigmoid function for a building extraction task, and a novel Attention Gates U network (AGs-Unet) is further proposed based on AG, which can automatically learn different forms of building structures in high-resolution remote sensing images and realize efficient extraction of building contour. AGs-Unet integrates attention gates with a single U-Net network, in which a series of attention gate modules are added into the ‘skip connection’ for suppressing the irrelevant and noisy feature responses in the input image to highlight the dominant features of the buildings in the image. AGs-Unet improves the feature selection of the attention map to enhance the ability of feature learning, as well as paying attention to the feature information of small-scale buildings. We conducted the experiments on the WHU building dataset and the INRIA Aerial Image Labeling dataset, in which the proposed AGs-Unet model is compared with several classic models (such as FCN8s, SegNet, U-Net, and DANet) and two state-of-the-art models (such as PISANet, and ARC-Net). The extraction accuracy of each model is evaluated by using three evaluation indexes, namely, overall accuracy, precision, and intersection over union. Experimental results show that the proposed AGs-Unet model can improve the quality of building extraction from high-resolution remote sensing images effectively in terms of prediction performance and result accuracy.

Journal Article

Share this book

Add to My Shelf

Modified UNet Model for Brain Stroke Lesion Segmentation on Computed Tomography Images

by Postolache, Octavian , Gamry, Khaled , Shiyapov, Kadrzhan

2022

Journal Article

Share this book

Add to My Shelf

Clinically Applicable Segmentation of Head and Neck Anatomy for Radiotherapy: Deep Learning Algorithm Development and Validation Study

by Nikolov, Stanislav , Blackwell, Sam , Mendes, Ruheena in Algorithms , Anatomy , Artificial intelligence

2021

Background: Over half a million individuals are diagnosed with head and neck cancer each year globally. Radiotherapy is an important curative treatment for this disease, but it requires manual time to delineate radiosensitive organs at risk. This planning process can delay treatment while also introducing interoperator variability, resulting in downstream radiation dose differences. Although auto-segmentation algorithms offer a potentially time-saving solution, the challenges in defining, quantifying, and achieving expert performance remain. Objective: Adopting a deep learning approach, we aim to demonstrate a 3D U-Net architecture that achieves expert-level performance in delineating 21 distinct head and neck organs at risk commonly segmented in clinical practice. Methods: The model was trained on a data set of 663 deidentified computed tomography scans acquired in routine clinical practice and with both segmentations taken from clinical practice and segmentations created by experienced radiographers as part of this research, all in accordance with consensus organ at risk definitions. Results: We demonstrated the model’s clinical applicability by assessing its performance on a test set of 21 computed tomography scans from clinical practice, each with 21 organs at risk segmented by 2 independent experts. We also introduced surface Dice similarity coefficient, a new metric for the comparison of organ delineation, to quantify the deviation between organ at risk surface contours rather than volumes, better reflecting the clinical task of correcting errors in automated organ segmentations. The model’s generalizability was then demonstrated on 2 distinct open-source data sets, reflecting different centers and countries to model training. Conclusions: Deep learning is an effective and clinically applicable technique for the segmentation of the head and neck anatomy for radiotherapy. With appropriate validation studies and regulatory approvals, this system could improve the efficiency, consistency, and safety of radiotherapy pathways.

Journal Article

Share this book

Add to My Shelf

Multitemporal Water Extraction of Dongting Lake and Poyang Lake Based on an Automatic Water Extraction and Dynamic Monitoring Framework

by Zhang, Hong , Wu, Fan , Wang, Chao in data collection , dynamic monitoring , flood control

2021

Timely and accurate large-scale water body mapping and dynamic monitoring are of great significance for water resource planning, flood control, and disaster reduction applications. Synthetic aperture radar (SAR) systems have the characteristics of strong operability, wide coverage, and all-weather data availability, and play a key role in large-scale water monitoring applications. However, there are still some challenges in the application of highly efficient, high-precision water extraction and dynamic monitoring methods. In this paper, a framework for the automatic extraction and long-term change monitoring of water bodies is proposed. First, a multitemporal water sample dataset is produced based on the bimodal threshold segmentation method. Second, attention block and pyramid module are introduced into the UNet (encoder-decoder) model to construct a robust water extraction network (PA-UNet). Then, GIS modeling is used for the automatic postprocessing of the water extraction results. Finally, the results are mapped and statistically analyzed. The whole process realizes end-to-end input and output. Sentinel-1 data covering Dongting Lake and Poyang Lake are selected for water extraction and dynamic monitoring analysis from 2017 to 2020, and Sentinel-2 images from a similar time frame are selected for verification. The results show that the proposed framework can realize high-precision (the extraction accuracy is higher than 95%), highly efficient automatic water extraction. Multitemporal monitoring results show that Dongting Lake and Poyang Lake fluctuate most in April, July, and November in 2017, 2019, and 2020, and the change trends of the two lakes are the same.

Journal Article

Share this book

Add to My Shelf

Automated segmentation of the breech and firing pin faces of fired cartridge case images

by Puch-Solis, Roberto , Farhan, Santo , Mookiah, Muthu Rama Krishnan in Algorithms , Ammunition , Artificial neural networks

2025

Firearm identification plays a crucial role in criminal justice globally. The capability to link firearms to specific crimes is invaluable for investigations and court cases. Each firearm leaves distinctive markings on bullets and cartridge cases, creating a “mechanical fingerprint” that can be used for the comparison of bullets and cartridge cases and underpins this area of forensic science. Cartridge cases fired from the same firearm exhibit similar markings on their bases. These traces can be used for investigation purposes as a means to potentially provide a link between more than one scene where cartridge cases have been recovered, or to provide a potential evidential link between a firearm and a cartridge case. These applications involve comparing the markings on the base of two or more cartridge cases, consisting of the headstamp, breech face and firing pin areas. The headstamp area usually contains information about the manufacturer and the calibre. Once this is considered, the remaining task is to compare the breech and firing pin areas of the two cartridges. Currently, some automated methods exist for this comparison, all of which involve the removal of the headstamp area to minimize bias. Some semi-automated methods for headstamp removal are available, and recently, an automated deep learning method that can be applied to 256 × 256 pixel resolution images has been introduced. In this article, we also propose a deep learning method addressing a more computationally demanding task of removing the head stamp area in higher-resolution images, 512 × 512 and 2592 × 1944 pixels, which will permit the automated extraction of finer features at a higher resolution. We also (a) introduce a post-processing method that improves the performance of our method, (b) provide the labelled data that we have produced so it can be used, together with the NIST database of cartridge case images, as a benchmark for future research, and (c) provide the estimated weights and models of the convolutional neural networks that can either be used directly or as initial values for further research. This article contributes to the emerging body of research on deep learning applications in forensic science. •The only article addressing segmentation of cartridge cases at 512 × 512 resolution.•Three CNNs tested for cartridge-case segmentation: UNet, Dense UNet and DeepLabv3+.•Deeplabv3+ performs well for segmenting cartridge case images.•Upsampling predicted masks from DeepLabv3+ to 2592 × 1944 maintains accuracy.•Postprocessing was used to remove small artefacts after segmentation.

Journal Article

Share this book

Add to My Shelf

Improved UNet with Attention for Medical Image Segmentation

by AL Qurri, Ahmed , Almekkawy, Mohamed in attention , Computational linguistics , CT imaging

2023

Medical image segmentation is crucial for medical image processing and the development of computer-aided diagnostics. In recent years, deep Convolutional Neural Networks (CNNs) have been widely adopted for medical image segmentation and have achieved significant success. UNet, which is based on CNNs, is the mainstream method used for medical image segmentation. However, its performance suffers owing to its inability to capture long-range dependencies. Transformers were initially designed for Natural Language Processing (NLP), and sequence-to-sequence applications have demonstrated the ability to capture long-range dependencies. However, their abilities to acquire local information are limited. Hybrid architectures of CNNs and Transformer, such as TransUNet, have been proposed to benefit from Transformer’s long-range dependencies and CNNs’ low-level details. Nevertheless, automatic medical image segmentation remains a challenging task due to factors such as blurred boundaries, the low-contrast tissue environment, and in the context of ultrasound, issues like speckle noise and attenuation. In this paper, we propose a new model that combines the strengths of both CNNs and Transformer, with network architectural improvements designed to enrich the feature representation captured by the skip connections and the decoder. To this end, we devised a new attention module called Three-Level Attention (TLA). This module is composed of an Attention Gate (AG), channel attention, and spatial normalization mechanism. The AG preserves structural information, whereas channel attention helps to model the interdependencies between channels. Spatial normalization employs the spatial coefficient of the Transformer to improve spatial attention akin to TransNorm. To further improve the skip connection and reduce the semantic gap, skip connections between the encoder and decoder were redesigned in a manner similar to that of the UNet++ dense connection. Moreover, deep supervision using a side-output channel was introduced, analogous to BASNet, which was originally used for saliency predictions. Two datasets from different modalities, a CT scan dataset and an ultrasound dataset, were used to evaluate the proposed UNet architecture. The experimental results showed that our model consistently improved the prediction performance of the UNet across different datasets.

Journal Article

Share this book

Add to My Shelf

Attention-Based UNet Deep Learning Model for Plaque Segmentation in Carotid Ultrasound for Stroke Risk Stratification: An Artificial Intelligence Paradigm

by Saba, Luca , Dubey, Abhishek , Nicolaides, Andrew in Artificial intelligence , Atherosclerosis , Automation

2022

Stroke and cardiovascular diseases (CVD) significantly affect the world population. The early detection of such events may prevent the burden of death and costly surgery. Conventional methods are neither automated nor clinically accurate. Artificial Intelligence-based methods of automatically detecting and predicting the severity of CVD and stroke in their early stages are of prime importance. This study proposes an attention-channel-based UNet deep learning (DL) model that identifies the carotid plaques in the internal carotid artery (ICA) and common carotid artery (CCA) images. Our experiments consist of 970 ICA images from the UK, 379 CCA images from diabetic Japanese patients, and 300 CCA images from post-menopausal women from Hong Kong. We combined both CCA images to form an integrated database of 679 images. A rotation transformation technique was applied to 679 CCA images, doubling the database for the experiments. The cross-validation K5 (80% training: 20% testing) protocol was applied for accuracy determination. The results of the Attention-UNet model are benchmarked against UNet, UNet++, and UNet3P models. Visual plaque segmentation showed improvement in the Attention-UNet results compared to the other three models. The correlation coefficient (CC) value for Attention-UNet is 0.96, compared to 0.93, 0.96, and 0.92 for UNet, UNet++, and UNet3P models. Similarly, the AUC value for Attention-UNet is 0.97, compared to 0.964, 0.966, and 0.965 for other models. Conclusively, the Attention-UNet model is beneficial in segmenting very bright and fuzzy plaque images that are hard to diagnose using other methods. Further, we present a multi-ethnic, multi-center, racial bias-free study of stroke risk assessment.

Journal Article

Share this book

Add to My Shelf

GSCA-UNet: Towards Automatic Shadow Detection in Urban Aerial Imagery with Global-Spatial-Context Attention Module

by Jia, Haitao , Xu, Wenbo , Hu, Zhongwen in global spatial contextual information , GSCA-UNet , loss function

2020

As an inevitable phenomenon in most optical remote-sensing images, the effect of shadows is prominent in urban scenes. Shadow detection is critical for exploiting shadows and recovering the distorted information. Unfortunately, in general, automatic shadow detection methods for urban aerial images cannot achieve satisfactory performance due to the limitation of feature patterns and the lack of consideration of non-local contextual information. To address this challenging problem, the global-spatial-context-attention (GSCA) module was developed to self-adaptively aggregate all global contextual information over the spatial dimension for each pixel in this paper. The GSCA module was embedded into a modified U-shaped encoder–decoder network that was derived from the UNet network to output the final shadow predictions. The network was trained on a newly created shadow detection dataset, and the binary cross-entropy (BCE) loss function was modified to enhance the training procedure. The performance of the proposed method was evaluated on several typical urban aerial images. Experiment results suggested that the proposed method achieved a better trade-off between automaticity and accuracy. The F1-score, overall accuracy, balanced-error-rate, and intersection-over-union metrics of the proposed method were higher than those of other state-of-the-art shadow detection methods.

Journal Article

Share this book

Add to My Shelf

Residual-Attention UNet++: A Nested Residual-Attention U-Net for Medical Image Segmentation

by Ren, Zuyue , Zhang, Hong , Li, Zhengzhen in Accuracy , Algorithms , attention mechanism

2022

Image segmentation is a basic technology in the field of image processing and computer vision. Medical image segmentation is an important application field of image segmentation and plays an increasingly important role in clinical diagnosis and treatment. Deep learning has made great progress in medical image segmentation. In this paper, we proposed Residual-Attention UNet++, which is an extension of the UNet++ model with a residual unit and attention mechanism. Firstly, the residual unit improves the degradation problem. Secondly, the attention mechanism can increase the weight of the target area and suppress the background area irrelevant to the segmentation task. Three medical image datasets such as skin cancer, cell nuclei, and coronary artery in angiography were used to validate the proposed model. The results showed that the Residual-Attention UNet++ achieved superior evaluation scores with an Intersection over Union (IoU) of 82.32%, and a dice coefficient of 88.59% with the skin cancer dataset, a dice coefficient of 85.91%, and an IoU of 87.74% with the cell nuclei dataset and a dice coefficient of 72.48%, and an IoU of 66.57% with the angiography dataset.

Journal Article

Share this book

Add to My Shelf

Language Selector

MBRLGlobalSearch

Language Selector

Catalogue Search | MBRL

Search Results Heading

Explore the vast range of titles available.

MBRLSearchResults

MBRLHappinessMeter