Catalogue Search | MBRL

Multi-scale strip pooling feature aggregation network for cloud and cloud shadow segmentation

by Lin, Haifeng , Lu, Chen , Xia, Min in Agglomeration , Artificial Intelligence , Boundaries

2022

Cloud and cloud shadow detection is a crucial issue in remote sensing image processing. The backgrounds of clouds and cloud shadows are mostly complex in actual remote sensing images. Traditional methods are easily affected by ground object interference, noise interference and other factors, and problems such as missing detection and false detection are prone to occur in the process of cloud detection. In addition, due to insufficient edge information extraction capabilities, traditional methods have very rough segmentation results for cloud and cloud shadow boundaries. In order to improve the accuracy of cloud and cloud shadow detection, a Multi-scale Strip Pooling Feature Aggregation Network is proposed. This method uses the residual network as the backbone to extract different levels of semantic information. And, in order to improve the multi-scale information extraction ability of the network, an Improved Pyramid Pooling module is introduced to mine deep multi-scale semantic information. Then, the Mutual Fusion module is used to guide the fusion of different levels of information. Finally, in view of the problem of rough segmentation boundaries in traditional methods, the Strip Boundary Refinement module is used to repair the boundary information of clouds and cloud shadows. The experimental results conducted on the datasets collected by Landsat-8, Sentinel-2 and a public dataset HRC_WHU show that this method is superior to the existing methods.

Journal Article

Share this book

Add to My Shelf

An Improved Encoder-Decoder Network Based on Strip Pool Method Applied to Segmentation of Farmland Vacancy Field

by Qin, Yilang , Zhang, Xixin , Ning, Xin in Agricultural land , Algorithms , Coders

2021

In the research of green vegetation coverage in the field of remote sensing image segmentation, crop planting area is often obtained by semantic segmentation of images taken from high altitude. This method can be used to obtain the rate of cultivated land in a region (such as a country), but it does not reflect the real situation of a particular farmland. Therefore, this paper takes low-altitude images of farmland to build a dataset. After comparing several mainstream semantic segmentation algorithms, a new method that is more suitable for farmland vacancy segmentation is proposed. Additionally, the Strip Pooling module (SPM) and the Mixed Pooling module (MPM), with strip pooling as their core, are designed and fused into the semantic segmentation network structure to better extract the vacancy features. Considering the high cost of manual data annotation, this paper uses an improved ResNet network as the backbone of signal transmission, and meanwhile uses data augmentation to improve the performance and robustness of the model. As a result, the accuracy of the proposed method in the test set is 95.6%, mIoU is 77.6%, and the error rate is 7%. Compared to the existing model, the mIoU value is improved by nearly 4%, reaching the level of practical application.

Journal Article

Share this book

Add to My Shelf

CrackNet-Weather: An Effective Pavement Crack Detection Method Under Adverse Weather Conditions

by Wang, Wei , Jing, Bin , Zhang, Wei in Accuracy , Algorithms , Artificial intelligence

2025

Accurate pavement crack detection under adverse weather conditions is essential for road safety and effective pavement maintenance. However, factors such as reduced visibility, background noise, and irregular crack morphology make this task particularly challenging in real-world environments. To address these challenges, we propose CrackNet-Weather, which is a robust and efficient detection method that systematically incorporates three key modules: a Haar Wavelet Downsampling Block (HWDB) for enhanced frequency information preservation, a Strip Pooling Bottleneck Block (SPBB) for multi-scale and context-aware feature fusion, and a Dynamic Sampling Upsampling Block (DSUB) for content-adaptive spatial feature reconstruction. Extensive experiments conducted on a challenging dataset containing both rainy and snowy weather demonstrate that CrackNet-Weather significantly outperforms mainstream baseline models, achieving notable improvements in mean Average Precision, especially for low-contrast, fine, and irregular cracks. Furthermore, our method maintains a favorable balance between detection accuracy and computational complexity, making it well suited for practical road inspection and large-scale deployment. These results confirm the effectiveness and practicality of CrackNet-Weather in addressing the challenges of real-world pavement crack detection under adverse weather conditions.

Journal Article

Share this book

Add to My Shelf

An Approach for Plant Leaf Image Segmentation Based on YOLOV8 and the Improved DEEPLABV3

by Xu, Aijun , Yin, Jianxin , Zhou, Suyin in Accuracy , Agroforestry , Algorithms

2023

Accurate plant leaf image segmentation provides an effective basis for automatic leaf area estimation, species identification, and plant disease and pest monitoring. In this paper, based on our previous publicly available leaf dataset, an approach that fuses YOLOv8 and improved DeepLabv3+ is proposed for precise image segmentation of individual leaves. First, the leaf object detection algorithm-based YOLOv8 was introduced to reduce the interference of backgrounds on the second stage leaf segmentation task. Then, an improved DeepLabv3+ leaf segmentation method was proposed to more efficiently capture bar leaves and slender petioles. Densely connected atrous spatial pyramid pooling (DenseASPP) was used to replace the ASPP module, and the strip pooling (SP) strategy was simultaneously inserted, which enabled the backbone network to effectively capture long distance dependencies. The experimental results show that our proposed method, which combines YOLOv8 and the improved DeepLabv3+, achieves a 90.8% mean intersection over the union (mIoU) value for leaf segmentation on our public leaf dataset. When compared with the fully convolutional neural network (FCN), lite-reduced atrous spatial pyramid pooling (LR-ASPP), pyramid scene parsing network (PSPnet), U-Net, DeepLabv3, and DeepLabv3+, the proposed method improves the mIoU of leaves by 8.2, 8.4, 3.7, 4.6, 4.4, and 2.5 percentage points, respectively. Experimental results show that the performance of our method is significantly improved compared with the classical segmentation methods. The proposed method can thus effectively support the development of smart agroforestry.

Journal Article

Share this book

Add to My Shelf

A segmentation network for farmland ridge based on encoder-decoder architecture in combined with strip pooling module and ASPP

by Tan, Changwei , Shi, Changrong , Lu, Zhixin in Agricultural land , Connectivity , Crops

2024

In order to effectively support wheat breeding, farmland ridge segmentation can be used to visualize the size and spacing of a wheat field. At the same time, accurate ridge information collecting can deliver useful data support for farmland management. However, in the farming ridge segmentation scenarios based on remote sensing photos, the commonly used semantic segmentation methods tend to overlook the ridge edges and ridge strip features, which impair the segmentation effect. In order to efficiently collect ridge information, this paper proposes a segmentation method based on encoder-decoder of network with strip pooling module and ASPP module. First, in order to extract context information for multi-scale features, ASPP module are integrated in the deepest feature map. Second, the remote dependence of the ridge features is improved in both horizontal and vertical directions by using the strip pooling module. The final segmentation map is generated by fusing the boundary features and semantic features using an encoder and decoder architecture. As a result, the accuracy of the proposed method in the validation set is 98.0% and mIoU is 94.6%. The results of the experiments demonstrate that the method suggested in this paper can precisely segment the ridge information, as well as its value in obtaining data on the distribution of farmland and its potential for practical application.

Journal Article

Share this book

Add to My Shelf

Research on coastline extraction and dynamic change from remote sensing images based on deep learning

by Song, Xiaoli , Lv, Qingzhe , Ge, Binfu in coordinate attention , deep learning , remote sensing images

2024

Accurate coastline extraction is crucial for the scientific management and protection of coastal zones. Due to the diversity of ground object details and the complexity of terrain in remote sensing images, the segmentation of sea and land faces challenges such as unclear segmentation boundaries and discontinuous coastline contours. To address these issues, this study improve the accuracy and efficiency of coastline extraction by improving the DeepLabv3+ model. Specifically, this study constructs a sea-land segmentation network, DeepSA-Net, based on strip pooling and coordinate attention mechanisms. By introducing dynamic feature connections and strip pooling, the connection between different branches is enhanced, capturing a broader context. The introduction of coordinate attention allows the model to integrate coordinate information during feature extraction, thereby allowing the model to capture longer-distance spatial dependencies. Experimental results has shown that the model can achieves a land-sea segmentation mean intersection over union (mIoU) ration and Recall of over 99% on all datasets. Visual assessment results show more complete edge details of sea-land segmentation, confirming the model’s effectiveness in complex coastal environments. Finally, using remote sensing data from a coastal area in China as an application instance, coastline extraction and dynamic change analysis were implemented, providing new methods for the scientific management and protection of coastal zones.

Journal Article

Share this book

Add to My Shelf

A Self-Adaptive Strip Pooling Network for Segmenting the Kidney Glomerular Basement Membrane

by Song, Caifang , Huang, Xiangsheng , Lyu, Xiangyu in Algorithms , attention mechanism , Deep learning

2025

Accurate semantic segmentation and automatic thickness measurement of the glomerular basement membrane (GBM) can aid pathologists in carrying out subsequent pathological diagnoses. The GBM has a complex ultrastructure and irregular shape, which makes it difficult to segment accurately. We found that the shape of the GBM is striped, so we proposed an RSP model to extract both the strip and square features of the GBM. Additionally, grayscale images of the GBM are similar to those of surrounding tissues, and the contrast is low. We added an edge attention mechanism to further improve the quality of segmentation. Moreover, we revised the pixel-level loss function to consider the tissues around the GBM and locate the GBM as a doctor would, i.e., by using the tissues as the reference object. Ablation experiments with each module showed that SSPNet can better segment the GBM. The proposed method was also compared with the existing medical semantic segmentation model. The experimental results showed that the proposed method can obtain high-precision segmentation results for the GBM and completely segment the target. Finally, the thickness of the GBM was calculated using a skeleton extraction method to provide quantitative data for expert diagnosis.

Journal Article

Share this book

Add to My Shelf

A Strip Dilated Convolutional Network for Semantic Segmentation

by Zhou, Yan , Li, Baopu , Zheng, Xihong in Artificial Intelligence , Complex Systems , Computational Intelligence

2023

There are frequently a large number of strip objects in segmentation scenarios, and the use of conventional square convolution may yield redundant information. Based on our previously proposed SA-FFNet (Zhou et al. in Neurocomputing 453:50–59, 2021), we study the effect of strip sub-region information extraction on semantic segmentation and propose a network. Our method is conducive to extracting multi-scale strip objects that often appear in segmentation scenes, and using strip dilated convolution to further extract contextual dependencies in other directions. First, we propose a multi-scale strip pooling module that enables the backbone network to effectively obtain multi-scale contexts; Then, we introduce a strip dilated convolution module, which supplements the vertical contexts of the strip pooling by using strip dilated convolution; Finally, we construct a novel network integrating the proposed two modules. The method explicitly takes horizontal and vertical contexts of multi-scale strip objects into consideration, so that scene understanding could benefit from long-range dependencies. The experimental results on the widely used PASCAL VOC 2012 and Cityscapes scene analysis benchmark datasets, which are better than the existing OCRNet, DeeplabV3+, SPNet, etc, both qualitatively and quantitatively.

Journal Article

Share this book

Add to My Shelf

Gaze Estimation via Strip Pooling and Multi-Criss-Cross Attention Networks

by Li, Xuewei , Pan, Weiguo , Dai, Songyin in Accuracy , Deep learning , gaze estimation

2023

Deep learning techniques for gaze estimation usually determine gaze direction directly from images of the face. These algorithms achieve good performance because face images contain more feature information than eye images. However, these image classes contain a substantial amount of redundant information that may interfere with gaze prediction and may represent a bottleneck for performance improvement. To address these issues, we model long-distance dependencies between the eyes via Strip Pooling and Multi-Criss-Cross Attention Networks (SPMCCA-Net), which consist of two newly designed network modules. One module is represented by a feature enhancement bottleneck block based on fringe pooling. By incorporating strip pooling, this residual module not only enlarges its receptive fields to capture long-distance dependence between the eyes but also increases weights on important features and reduces the interference of redundant information unrelated to gaze. The other module is a multi-criss-cross attention network. This module exploits a cross-attention mechanism to further enhance long-range dependence between the eyes by incorporating the distribution of eye-gaze features and providing more gaze cues for improving estimation accuracy. Network training relies on the multi-loss function, combined with smooth L1 loss and cross entropy loss. This approach speeds up training convergence while increasing gaze estimation precision. Extensive experiments demonstrate that SPMCCA-Net outperforms several state-of-the-art methods, achieving mean angular error values of 10.13° on the Gaze360 dataset and 6.61° on the RT-gene dataset.

Journal Article

Share this book

Add to My Shelf

Road feature enhancement network for remote sensing images based on DeepLabV3Plus

by Dong, Liang , Zhu, Enci , Zhu, Lei in Accuracy , Algorithms , Computer Imaging

2024

Extracting road information from complex high-resolution remote sensing images to update road networks has become a focus research field in recent years. However, the scale of roads in remote sensing images varies greatly, often leading to obstructions caused by objects such as tree shadows and buildings. These factors contribute to incomplete and discontinuous extraction results of narrow and long roads. To solve the above problems, a road feature enhancement network based on DeepLabV3Plus network is proposed in this paper. The network introduces the dense atrous spatial pyramid pooling (DenseASPP) module incorporating a strip pooling branch and channel attention mechanism. The atrous spatial pyramid pooling (ASPP) module of the baseline network is replaced by the improved DenseASPP, which strengthens the network to aggregate multi-scale road features and improves the model’s ability to recognize long and narrow roads. In addition, the cascade feature fusion (CFF) unit is utilized to fuse the shallow features of different resolutions, enhance the context awareness of the network, and improve the generalization ability of the model. Subsequently, a road feature enhancement module (RFEM) is designed in the decoder part, which uses four strip convolutions in different directions to capture remote context information and avoid the interference of irrelevant regions. The experimental results on the open DeepGlobe dataset and CHN6-CUG dataset show a significant improvement in IoU and F1-Score compared to the baseline network. The proposed method outperforms other mainstream networks in road segmentation.

Journal Article

Share this book

Add to My Shelf

Language Selector

MBRLGlobalSearch

Language Selector

Catalogue Search | MBRL

Search Results Heading

Explore the vast range of titles available.

MBRLSearchResults

MBRLHappinessMeter