Catalogue Search | MBRL

A review of semantic segmentation using deep neural networks

by Lew, Michael S. , Liu, Yu , Guo, Yanming in Accuracy , Algorithms , Artificial neural networks

2018

During the long history of computer vision, one of the grand challenges has been semantic segmentation which is the ability to segment an unknown image into different parts and objects (e.g., beach, ocean, sun, dog, swimmer). Furthermore, segmentation is even deeper than object recognition because recognition is not necessary for segmentation. Specifically, humans can perform image segmentation without even knowing what the objects are (for example, in satellite imagery or medical X-ray scans, there may be several objects which are unknown, but they can still be segmented within the image typically for further investigation). Performing segmentation without knowing the exact identity of all objects in the scene is an important part of our visual understanding process which can give us a powerful model to understand the world and also be used to improve or augment existing computer vision techniques. Herein this work, we review the field of semantic segmentation as pertaining to deep convolutional neural networks. We provide comprehensive coverage of the top approaches and summarize the strengths, weaknesses and major challenges.

Journal Article

Share this book

Add to My Shelf

From Graph Synchronization to Policy Learning: Angle-Synchronized Graph and Bilevel Policy Network for Remote Sensing Object Detection

by Yan, Jie , Wang, Xiaoxiang , Liu, Jialang in Accuracy , Adaptation , Alignment

2025

Detection of rotating targets in complex remote sensing scenarios often suffers from angular inconsistencies and boundary jitter, especially for small-to-medium objects with rapid pose changes or indistinct boundaries in dense environments. To address this, we propose ASBPNet, a unified framework coupling geometric alignment with policy adaptation. It features the following: (1) Angle-Synchronized Graph (ASG), which injects angle–alignment relationships and residual-based boundary refinement to improve rotational consistency and reduce boundary errors for small objects; (2) Bilevel Policy Optimization (BPO), which unifies control over rotation enhancement, sample allocation, block scanning, and rotational NMS for cross-stage policy coordination and improved recall. Together, ASG and BPO form a tightly coupled pipeline in which geometric alignment directly reinforces policy optimization, yielding mutually enhanced rotation robustness, boundary stability, and detection recall across densely distributed remote sensing scenes. We conducted systematic evaluations on datasets including DIOR-R, HRSC2016, and DOTAv1.0: compared to baselines, overall accuracy achieved significant improvement on DIOR-R, with performance reaching 98.2% on HRSC2016. Simultaneously, enhanced robustness and boundary stability were demonstrated in complex backgrounds and dense small-object scenarios, validating the synergistic value of geometric alignment and policy adaptation.

Journal Article

Share this book

Add to My Shelf

A review on current development of thermophotovoltaic technology in heat recovery

by Chen, Shuni , Guo, Yanming , Pan, Qinghui in Carbon , carbon neutrality , Emissions

2024

The burning of fossil fuels in industry results in significant carbon emissions, and the heat generated is often not fully utilized. For high-temperature industries, thermophotovoltaics (TPVs) is an effective method for waste heat recovery. This review covers two aspects of high-efficiency TPV systems and industrial waste heat applications. At the system level, representative results of TPV complete the systems, while selective emitters and photovoltaic cells in the last decade are compiled. The key points of components to improve the energy conversion efficiency are further analyzed, and the related micro/nano-fabrication methods are introduced. At the application level, the feasibility of TPV applications in high-temperature industries is shown from the world waste heat utilization situation. The potential of TPV in waste heat recovery and carbon neutrality is illustrated with the steel industry as an example. The state of the art of the thermophotovoltaic (TPV) generation on heat recovery is highlighted. At the system level, representative results of TPV systems and components are compiled. At the application level, feasibility of TPV in high-temperature industrial applications is exhibited. Key points of TPV systems to improve energy conversion efficiency are further analyzed.

Journal Article

Share this book

Add to My Shelf

CLIP-Driven with Dynamic Feature Selection and Alignment Network for Referring Remote Sensing Image Segmentation

by Zhang, Jing , Lu, Qianqi , Xie, Yuxiang in Ablation , Accuracy , Affine transformations

2025

Referring Remote Sensing Image Segmentation (RRSIS) aims to accurately locate and segment target objects in high-resolution aerial imagery based on natural language descriptions. Most existing approaches either directly modify Referring Image Segmentation (RIS) frameworks originally designed for natural images or employ image-based foundation models such as SAM to improve segmentation accuracy. However, current RRSIS models still face substantial challenges due to the domain gap between remote sensing and natural images, including large-scale variations, arbitrary object rotations, and complex spatial–linguistic relationships. Consequently, such transfers often lead to weak cross-modal interaction, inaccurate semantic alignment, and reduced localization precision, particularly for small or rotated objects. In addition, approaches that rely on multi-stage alignment pipelines, redundant high-level feature fusion, or the incorporation of large foundation models generally incur substantial computational overhead and training inefficiency, especially when dealing with complex referring expressions in high-resolution remote sensing imagery. To address these challenges, we propose CD2FSAN, a CLIP-driven dynamic feature selection and alignment network that establishes a unified framework for fine-grained cross-modal understanding in remote sensing imagery. This network first follows the principle of maximizing cross-modal information to dynamically select the visual representations most semantically aligned with the language from CLIP’s hierarchical features, thereby strengthening cross-modal correspondence under image domain shifts. It then performs adaptive multi-scale aggregation and alignment to integrate linguistic cues into spatially diverse visual contexts, enabling precise feature fusion across varying object scales. Finally, a dynamic rotation correction decoder with differentiable affine transformation was designed to refine segmentation by compensating for orientation diversity and geometric distortions. Extensive experiments verify that CD2FSAN consistently outperforms existing methods in segmentation accuracy, validating the effectiveness of its core components while maintaining competitive computational efficiency. These results demonstrate the framework’s strong capability to bridge the cross-modal gap between language and remote sensing imagery, highlighting its potential for advancing semantic understanding in vision–language remote sensing tasks.

Journal Article

Share this book

Add to My Shelf

YOLO-SBA: A Multi-Scale and Complex Background Aware Framework for Remote Sensing Target Detection

by Jiang, Tingshuai , Chen, Jiangming , Yuan, Yifei in Algorithms , Attention , Channel gating

2025

Remote sensing target detection faces significant challenges in handling multi-scale targets, with the high similarity in color and shape between targets and backgrounds in complex scenes further complicating the detection task. To address this challenge, we propose a multi-Scale and complex Background Aware network for remote sensing target detection, named YOLO-SBA. Our proposed YOLO-SBA first processes the input through the Multi-Branch Attention Feature Fusion Module (MBAFF) to extract global contextual dependencies and local detail features. It then integrates these features using the Bilateral Attention Feature Mixer (BAFM) for efficient fusion, enhancing the saliency of multi-scale target features to tackle target scale variations. Next, we utilize the Gated Multi-scale Attention Pyramid (GMAP) to perform channel–spatial dual reconstruction and gating fusion encoding on multi-scale feature maps. This enhances target features while finely suppressing spectral redundancy. Additionally, to prevent the loss of effective information extracted by key modules during inference, we improve the downsampling method using Asymmetric Dynamic Downsampling (ADDown), maximizing the retention of image detail information. We achieve the best performance on the DIOR, DOTA, and RSOD datasets. On the DIOR dataset, YOLO-SBA improves mAP by 16.6% and single-category detection AP by 0.8–23.8% compared to the existing state-of-the-art algorithm.

Journal Article

Share this book

Add to My Shelf

Precise Cross-Sea Orthometric Height Determination Using GNSS Carrier-Phase Time-Frequency Transfer

by Wu, Kuangchao , Lian, Zengzeng , Li, Kezhao in Accuracy , Atomic clocks , Clocks & watches

2025

What are the main findings? * A simulation with two IGS stations (>8000 km apart) indicates that the GFS-PPP method enables remote OH determination. With 10[sup.−18]-level ground clocks, the intercontinental-scale OH determination is expected to reach an accuracy of approximately 20 cm. * A multi-period joint-measurement strategy—aggregating sessions with stability-based weights—suppresses stochastic errors and enhances robustness, improving the reliability and accuracy of remote OH determination to the centimeter level. A simulation with two IGS stations (>8000 km apart) indicates that the GFS-PPP method enables remote OH determination. With 10[sup.−18]-level ground clocks, the intercontinental-scale OH determination is expected to reach an accuracy of approximately 20 cm. A multi-period joint-measurement strategy—aggregating sessions with stability-based weights—suppresses stochastic errors and enhances robustness, improving the reliability and accuracy of remote OH determination to the centimeter level. What are the implications of the main findings? * The findings quantify the practical limits of GNSS PPP time-frequency transfer and assess the achievable accuracy and performance bounds of the GFS-PPP approach for remote geopotential and OH determination. * The proposed stability-weighted multi-period joint-measurement strategy offers practical pathway toward centimeter-level cross-sea OH determination, thereby positioning GFS-PPP as a promising technique for establishing a high-precision IHRS. The findings quantify the practical limits of GNSS PPP time-frequency transfer and assess the achievable accuracy and performance bounds of the GFS-PPP approach for remote geopotential and OH determination. The proposed stability-weighted multi-period joint-measurement strategy offers practical pathway toward centimeter-level cross-sea OH determination, thereby positioning GFS-PPP as a promising technique for establishing a high-precision IHRS. State-of-the-art atomic clocks, in combination with high-precision time-frequency transfer techniques, have established a novel relativistic geodetic approach for determining the Earth’s geopotential. By exploiting ultra-stable atomic clocks and GNSS Precise Point Positioning (PPP) time-frequency transfer, this study investigates the cross-sea Orthometric Height (OH) determination between two remote stations separated by over 8000 km, corresponding to an OH difference of approximately 2260 m. Simulation results indicate that, when employing clocks with a frequency stability of 1 × 10[sup.−18], the remote OH determination could achieve a limiting accuracy of approximately 20 cm. This limitation is primarily attributed to the finite precision of the PPP time-frequency transfer, which constrains the ultimate performance of the OH determination. Furthermore, aggregating multiple observation periods could further enhance the accuracy to approximately 6 cm. These findings demonstrate that the PPP time-frequency transfer facilitates high-precision OH determination over intercontinental distances and thereby provides a feasible pathway toward the realization of a centimeter-level International Height Reference System (IHRS).

Journal Article

Share this book

Add to My Shelf

Self-Supervised Feature Disentanglement for Deepfake Detection

by Yang, Yumin , Liu, Pan , Guo, Yanming in Algorithms , Artificial intelligence , Datasets

2025

Existing deepfake detection methods heavily rely on specific training data distributions and struggle to generalize to unknown forgery techniques. To address the challenge, this paper focuses on two critical research gaps: (1) the lack of systematic mining of standard features across multiple forgery methods; (2) the unresolved distribution shift problem in the strong supervised learning paradigm. To tackle these issues, we propose a self-supervised learning framework based on feature disentanglement, which enhances the generalization ability of detection models by uncovering the intrinsic features of forged content. The core method comprises three key components: self-supervised sample construction and training samples for feature disentanglement, which are generated via an image self-mixing mechanism; feature disentanglement network, where the input image is decomposed into two parts—content features irrelevant to forgery and discriminative forgery-related features; and conditional decoder verification, where both types of features are used to reconstruct the image, with forgery-related features serving as conditional vectors to guide the reconstruction process. Orthogonal constraints on features are enforced to mitigate the overfitting problem in traditional methods. Experimental results demonstrate that, compared with state-of-the-art methods, the proposed framework exhibits superior generalization performance in cross-unknown forgery technique detection tasks, effectively breaking through the dependency bottleneck of traditional supervised learning on training data distributions. This study provides a universal solution for deepfake detection that does not rely on specific forgery techniques. The model’s robustness in real-world complex scenarios is significantly improved by mining the common essence of forgery features.

Journal Article

Share this book

Add to My Shelf

Optical modeling of atmospheric black carbon aerosol ensembles with complex particle morphology

by Wu, Yuhan , Zheng, Lijuan , Guo, Yanming in Accuracy , aerosol , Aerosols

2024

Black carbon (BC) aerosol is one of the most important factor in global warming. BC radiative forcing remains unconstrained, mainly because of the uncertain parameterizations of its absorption and scattering properties in the atmosphere. The single sphere model is widely used in current climate assessment of BC aerosols due to its computational convenience, however, their complex morphologies in particle level are excessively simplified which leads to computed inaccuracy. In this study, we present a dynamic model for optical calculations of BC aerosol ensembles considering their complex fractal aggregate morphologies with the constraint of max monomer numbers ( N s, max ) and radius ( a max ). We show that the simulation accuracy of the dynamic model with suitable values of N s, max and a max may achieve ∼95% while the computation time may reduce to ∼6%. We find that optical properties of BC aerosol ensembles can be simulated for higher accuracy or faster calculation by performing different selections of monomer numbers and radius in their size distributions. This method enables extensive and accurate optical calculations of BC particles with complex morphologies, which would be useful for the remote sensing inversion and the assessment of climate.

Journal Article

Share this book

Add to My Shelf

Effects of Rare Earth Doping on Structural and Electrocatalytic Properties of Nanostructured TiO2 Nanotubes/SnO2-Sb Electrode for Electrochemical Treatment of Industrial Wastewater

by Yang, Lisha , Guo, Yanming in Anodes , Arrays , Doping

2023

The solvothermal synthesis technique was employed to successfully fabricate a series of rare earth doped SnO2-Sb electrodes on the TNTs array substrate, serving as anode material for electrocatalytic degradation of phenol. The electrode doped with rare earth elements demonstrated superior electrocatalytic activity and stability in comparison to the undoped electrode. The influence of adding rare earth elements (i.e., Gd and Nd) into the precursor solution on the structural and property of TNTs/SnO2-Sb electrodes was studied in detail. The results obtained from SEM and XRD indicated that, compared to TNTs/SnO2-Sb-Nd, TNTs/SnO2-Sb-Gd exhibited a finer grain size due to the smaller ionic radius of the Gd element. This facilitated its incorporation into the SnO2 lattice interior and inhibited grain growth, resulting in a significant decrease in particle size for exposing more active sites. The influence mechanism of rare earth doping on electrochemical activity was investigated through XPS, EPR, LSV, EIS and Hydroxyl radicals (•OH) generation tests. The results demonstrated that the enhanced electrocatalytic activity can be attributed to an increased generation of oxygen vacancies on the electrode surface, which act as active sites for enhancing the adsorption of oxygen species and promoting •OH generation.

Journal Article

Share this book

Add to My Shelf

Experimental Study on Spray Cooling Heat Transfer of LN2 for a Large Area

by Gong, Jiaming , Zhao, Jinhong , Guo, Yanming in Cooling , Electronic components industry , Heat transfer

2023

Spray cooling has been considered one of the most promising thermal control methods of high-heat flux devices. Most of the spray cooling research focuses on electronic components as the main application object to achieve higher heat dissipation heat flow in ambient temperature regions for small areas. Water is the most common cooling medium. This paper investigates the application of spray cooling thermal control over large areas. In this study, the heat-transfer characteristics of liquid nitrogen (LN2) for large areas was investigated by conducting experiments. The test surface is 500 mm × 500 mm, which was cooled by a nine-nozzle array. The spray nozzles used in the experiment were conical nozzles with an orifice diameter of 1.6 mm, a spray angle of 120°, and a spray height of 42 mm. Liquid nitrogen was forcefully ejected from nozzles by the high pressure of a liquid storage tank to cool the test surface. According to the cooled surfaces, spray directions, and spray pressures, three groups of experiments were conducted. The results showed that the smooth flat surface has the best heat-transfer performance in three kinds of surface structures, which are macro surface, porous surface, and smooth flat surface. The heat-transfer coefficient varied by ±20% with different spray directions, and the surface heat-transfer coefficient increased linearly with increasing spray pressure. Most of the spray cooling research focuses on heat dissipation in the ambient temperature region for equipment over small areas. The results can benefit thermal control application in various fields. The research in this paper can provide a reference for the application of large-area spray cooling, and the application areas mainly include metal manufacturing processing cooling, aircraft skin infrared radiation characteristics modulation, and laser weapon equipment cooling.

Journal Article

Share this book

Add to My Shelf

Language Selector

MBRLGlobalSearch

Language Selector

Catalogue Search | MBRL

Search Results Heading

Explore the vast range of titles available.

MBRLSearchResults

MBRLHappinessMeter