Catalogue Search | MBRL
Search Results Heading
Explore the vast range of titles available.
MBRLSearchResults
-
DisciplineDiscipline
-
Is Peer ReviewedIs Peer Reviewed
-
Item TypeItem Type
-
SubjectSubject
-
YearFrom:-To:
-
More FiltersMore FiltersSourceLanguage
Done
Filters
Reset
42,633
result(s) for
"Image detection"
Sort by:
A Spatial-Temporal Attention-Based Method and a New Dataset for Remote Sensing Image Change Detection
2020
Remote sensing image change detection (CD) is done to identify desired significant changes between bitemporal images. Given two co-registered images taken at different times, the illumination variations and misregistration errors overwhelm the real object changes. Exploring the relationships among different spatial–temporal pixels may improve the performances of CD methods. In our work, we propose a novel Siamese-based spatial–temporal attention neural network. In contrast to previous methods that separately encode the bitemporal images without referring to any useful spatial–temporal dependency, we design a CD self-attention mechanism to model the spatial–temporal relationships. We integrate a new CD self-attention module in the procedure of feature extraction. Our self-attention module calculates the attention weights between any two pixels at different times and positions and uses them to generate more discriminative features. Considering that the object may have different scales, we partition the image into multi-scale subregions and introduce the self-attention in each subregion. In this way, we could capture spatial–temporal dependencies at various scales, thereby generating better representations to accommodate objects of various sizes. We also introduce a CD dataset LEVIR-CD, which is two orders of magnitude larger than other public datasets of this field. LEVIR-CD consists of a large set of bitemporal Google Earth images, with 637 image pairs (1024 × 1024) and over 31 k independently labeled change instances. Our proposed attention module improves the F1-score of our baseline model from 83.9 to 87.3 with acceptable computational overhead. Experimental results on a public remote sensing image CD dataset show our method outperforms several other state-of-the-art methods.
Journal Article
A robust synthetic face detector in OSN context based on Gradient of Color features
2026
Extensive development in Generative Artificial Intelligence and the growth of Online Social Networks have facilitated the creation and sharing of synthetic images like never before. This has led to an overwhelming increase in the dissemination of fake content on OSNs. Maintaining the integrity of OSNs is paramount, and detecting synthetic images plays a crucial role in preserving social balance. Existing solutions, while achieving perfect detection performance on test datasets, often experience significant degradation when applied to OSN images. In our work, we propose a robust fake image detector that relies on features minimally affected by common OSN perturbations. Specifically, our solution leverages gradient features in color channels, including chrominance and luminance channels, accompanied by a residual-based CNN. Our low-parameterized solution is characterized by low complexity, making it particularly resource-efficient and suitable for edge devices.
Thorough experiments demonstrate that our method achieves 100% accuracy in identifying fake images on our test dataset. We further evaluate the approach on images generated by contemporary generative adversarial networks and diffusion models, where it consistently exhibits strong detection performance. In addition, when applied to images that undergo post-processing operations designed to mimic OSN circulation, the proposed detector maintains high accuracy and robustness. Overall, results indicate that our proposed gradient-based color-channel features, coupled with a low-complexity residual network, provide an effective and OSN-resilient solution for synthetic image detection across both generic and post-processed/compressed scenarios.
•AI-generated hyper-realistic synthetic images pose cyber-social threats.•OSN-specific transformations on synthetic images further complicate detection.•Introduced GoC for detection, leveraging gradient magnitude/direction in chroma-luma.•Proposed RNet, with GoC, achieves up to 100% accuracy with minimal parameters.•Robust OSN detection with SOTA results against post-processing and compression.
Journal Article
Comic Image Detection Based on MA‐YOLOv8s
2026
In recent years, the plagiarism of comic images has become increasingly prevalent, drawing growing attention to copyright protection within the comic industry. To address the limitations of existing object detection models in capturing the distinctive visual characteristics of comic images, this paper proposes an optimized detection framework, MANGA‐YOLOv8s (MA‐YOLOv8s). Specifically, a large separable kernel attention‐based spatial pyramid pooling (SPPF‐LSKA) module is designed to expand the effective receptive field and enhance multiscale feature aggregation for small‐object detection. The C2f‐DBB module is introduced into the detection head to refine deep feature representation while maintaining lightweight computation. Furthermore, a separated and enhancement attention module (SEAM) is incorporated into the detection heads to improve robustness against scale variation and suppress false detections. Unlike simple combinations of existing modules, these designs form a theoretically motivated and task‐specific integration that adapts the YOLOv8 framework to the structural and stylistic characteristics of comic images. Experiments on the Manga109 dataset demonstrate that MA‐YOLOv8s achieves a 3.7% improvement in mAP and a 3.4% increase in precision compared with YOLOv8s. The proposed method offers both theoretical and practical contributions to the development of efficient detection techniques for comic copyright protection.
Journal Article
MFIL-FCOS: A Multi-Scale Fusion and Interactive Learning Method for 2D Object Detection and Remote Sensing Image Detection
2024
Object detection is dedicated to finding objects in an image and estimate their categories and locations. Recently, object detection algorithms suffer from a loss of semantic information in the deeper feature maps due to the deepening of the backbone network. For example, when using complex backbone networks, existing feature fusion methods cannot fuse information from different layers effectively. In addition, anchor-free object detection methods fail to accurately predict the same object due to the different learning mechanisms of the regression and centrality of the prediction branches. To address the above problem, we propose a multi-scale fusion and interactive learning method for fully convolutional one-stage anchor-free object detection, called MFIL-FCOS. Specifically, we designed a multi-scale fusion module to address the problem of local semantic information loss in high-level feature maps which strengthen the ability of feature extraction by enhancing the local information of low-level features and fusing the rich semantic information of high-level features. Furthermore, we propose an interactive learning module to increase the interactivity and more accurate predictions by generating a centrality-position weight adjustment regression task and a centrality prediction task. Following these strategic improvements, we conduct extensive experiments on the COCO and DIOR datasets, demonstrating its superior capabilities in 2D object detection tasks and remote sensing image detection, even under challenging conditions.
Journal Article
A Texture Feature Removal Network for Sonar Image Classification and Detection
2023
Deep neural network (DNN) was applied in sonar image target recognition tasks, but it is very difficult to obtain enough sonar images that contain a target; as a result, the direct use of a small amount of data to train a DNN will cause overfitting and other problems. Transfer learning is the most effective way to address such scenarios. However, there is a large domain gap between optical images and sonar images, and common transfer learning methods may not be able to effectively handle it. In this paper, we propose a transfer learning method for sonar image classification and object detection called the texture feature removal network. We regard the texture features of an image as domain-specific features, and we narrow the domain gap by discarding the domain-specific features, and hence, make it easier to complete knowledge transfer. Our method can be easily embedded into other transfer learning methods, which makes it easier to apply to different application scenarios. Experimental results show that our method is effective in side-scan sonar image classification tasks and forward-looking sonar image detection tasks. For side-scan sonar image classification tasks, the classification accuracy of our method is enhanced by 4.5% in a supervised learning experiment, and for forward-looking sonar detection tasks, the average precision (AP) is also significantly improved.
Journal Article
Implementing a Parallel Image Edge Detection Algorithm Based on the Otsu-Canny Operator on the Hadoop Platform
2018
The Canny operator is widely used to detect edges in images. However, as the size of the image dataset increases, the edge detection performance of the Canny operator decreases and its runtime becomes excessive. To improve the runtime and edge detection performance of the Canny operator, in this paper, we propose a parallel design and implementation for an Otsu-optimized Canny operator using a MapReduce parallel programming model that runs on the Hadoop platform. The Otsu algorithm is used to optimize the Canny operator’s dual threshold and improve the edge detection performance, while the MapReduce parallel programming model facilitates parallel processing for the Canny operator to solve the processing speed and communication cost problems that occur when the Canny edge detection algorithm is applied to big data. For the experiments, we constructed datasets of different scales from the Pascal VOC2012 image database. The proposed parallel Otsu-Canny edge detection algorithm performs better than other traditional edge detection algorithms. The parallel approach reduced the running time by approximately 67.2% on a Hadoop cluster architecture consisting of 5 nodes with a dataset of 60,000 images. Overall, our approach system speeds up the system by approximately 3.4 times when processing large-scale datasets, which demonstrates the obvious superiority of our method. The proposed algorithm in this study demonstrates both better edge detection performance and improved time performance.
Journal Article
Federated semi-supervised polyp image detection based on client feature alignment
2025
Existing federated learning in medicine primarily focuses on fully supervised training settings. However, in practical scenarios, due to resource and expertise constraints, most clinical sites can only provide unlabeled data. Therefore, federated semi-supervised learning has garnered significant attention in medical image processing. Nevertheless, there are issues such as poor feature learning ability from unlabeled client data and inconsistent client data distributions leading to feature shift pose challenges. To tackle these issues, we propose a federated semi-supervised object detection method based on client feature alignment that effectively utilizes rich unlabeled image data to improve polyp image detection performance. By introducing a consistency learning strategy, we enable each unlabeled client to better align the feature space, thereby enhancing the consistency of predictions across unlabeled clients for the same input under different perturbations. Furthermore, through the adoption of a dynamic weight aggregation method based on similarity distance, we better control drift in local representation learning across clients to allow network weights to better align with overall feature representations. We evaluate our method on four publicly available datasets of polyp image detection, and experimental results consistently demonstrate the effectiveness of our approach.
Journal Article
Detecting Images in Two-Operator Series Manipulation: A Novel Approach Using Transposed Convolution and Information Fusion
by
Agarwal, Saurabh
,
Cho, Dae-Jea
,
Jung, Ki-Hyun
in
Computer programs
,
Data integration
,
Digital imaging
2023
Digital image forensics is a crucial emerging technique, as image editing tools can modify them easily. Most of the latest methods can determine whether a specific operator has edited an image. These methods are suitable for high-resolution uncompressed images. In practice, more than one operator is used to modify image contents repeatedly. In this paper, a reliable scheme using information fusion and deep network networks is presented to recognize manipulation operators and the operator’s series on two operators. A transposed convolutional layer improves the performance of low-resolution JPEG compressed images. In addition, a bottleneck technique is utilized to extend the number of transposed convolutional layers. One average pooling layer is employed to preserve the optimal information flow and evade the overfitting concern among the layers. Moreover, the presented scheme can detect two operator series with various factors without including them in training. The experimental outcomes of the suggested scheme are encouraging and better than the existing schemes due to the availability of sufficient statistical evidence.
Journal Article
Enhancing Low-Pass Filtering Detection on Small Digital Images Using Hybrid Deep Learning
2023
Detecting image manipulation is essential for investigating the processing history of digital images. In this paper, a novel scheme is proposed to detect the use of low-pass filters in image processing. A new convolutional neural network with a reasonable size was designed to identify three types of low-pass filters. The learning experiences of the three solvers were combined to enhance the detection ability of the proposed approach. Global pooling layers were employed to protect the information loss between the convolutional layers, and a new global variance pooling layer was introduced to improve detection accuracy. The extracted features from the convolutional neural network were mapped to the frequency domain to enrich the feature set. A leaky Rectified Linear Unit (ReLU) layer was discovered to perform better than the traditional ReLU layer. A tri-layered neural network classifier was employed to classify low-pass filters with various parameters into two, four, and ten classes. As detecting low-pass filtering is relatively easy on large-dimension images, the experimental environment was restricted to small images of 30 × 30 and 60 × 60 pixels. The proposed scheme achieved 80.12% and 90.65% detection accuracy on ten categories of images compressed with JPEG and a quality factor 75 on 30 × 30 and 60 × 60 images, respectively.
Journal Article
Unmasking AI-created visual content: a review of generated images and deepfake detection technologies
by
Zhang, Yupeng
,
Huang, Shanyuan
,
Pang, Zongwei
in
AI-created visual content
,
AI-generated image detection
,
Classification
2025
In this era, digital images and videos are ubiquitous in people’s lives, and generative models can easily produce high-quality images and videos. These images and videos enrich people’s lives and play important roles in various fields. However, maliciously generated images and videos can mislead the public, manipulate public opinion, invade privacy, and even lead to illegal activities. Therefore, detecting AI-created visual content has become a significant research topic in the field of multimedia information security. In recent years, the rapid development of deep learning technology has greatly accelerated the progress of AI-created visual content detection. This survey introduces the detection technologies for AI-created visual content that have developed in recent years, divided into two parts: AI-generated image detection and deepfake detection. In the AI-generated image detection section, we introduce current generative models and basic detection frameworks, and overview existing detection methods from the perspectives of unimodal and multimodal. In the deepfake detection section, we provide an overview of existing deepfake generation technique classifications, commonly used datasets, followed by some common evaluation metrics within the field. We also analyze the technical characteristics of existing methods based on the different feature information they utilize, summarizing and categorizing them. Finally, we propose future research directions and conclusions, offering suggestions for the development of AI-created visual content detection technologies.
Journal Article