Search Results Heading

MBRLSearchResults

mbrl.module.common.modules.added.book.to.shelf
Title added to your shelf!
View what I already have on My Shelf.
Oops! Something went wrong.
Oops! Something went wrong.
While trying to add the title to your shelf something went wrong :( Kindly try again later!
Are you sure you want to remove the book from the shelf?
Oops! Something went wrong.
Oops! Something went wrong.
While trying to remove the title from your shelf something went wrong :( Kindly try again later!
    Done
    Filters
    Reset
  • Discipline
      Discipline
      Clear All
      Discipline
  • Is Peer Reviewed
      Is Peer Reviewed
      Clear All
      Is Peer Reviewed
  • Item Type
      Item Type
      Clear All
      Item Type
  • Subject
      Subject
      Clear All
      Subject
  • Year
      Year
      Clear All
      From:
      -
      To:
  • More Filters
      More Filters
      Clear All
      More Filters
      Source
    • Language
1,135 result(s) for "multi-feature fusion"
Sort by:
Research on Long Text Classification Model Based on Multi-Feature Weighted Fusion
Text classification in the long-text domain has become a development challenge due to the significant increase in text data, complexity enhancement, and feature extraction of long texts in various domains of the Internet. A long text classification model based on multi-feature weighted fusion is proposed for the problems of contextual semantic relations, long-distance global relations, and multi-sense words in long text classification tasks. The BERT model is used to obtain feature representations containing global semantic and contextual feature information of text, convolutional neural networks to obtain features at different levels and combine attention mechanisms to obtain weighted local features, fuse global contextual features with weighted local features, and obtain classification results by equal-length convolutional pooling. The experimental results show that the proposed model outperforms other models in terms of accuracy, precision, recall, F1 value, etc., under the same data set conditions compared with traditional deep learning classification models, and it can be seen that the model has more obvious advantages in long text classification.
Physical characteristics and spillage detection Using multi-feature fusion
As an important part of road maintenance, the detection of road sprinkles has attracted extensive attention from scholars. However, after years of research, there are still some problems in the detection of road sprinkles. First of all, the detection accuracy of traditional detection algorithm is deficient. Second, deep learning approaches have great limitations for there are various kinds of sprinkles which makes it difficult to build a data set. In view of the above problems, this paper proposes a road sprinkling detection method based on multi-feature fusion. The characteristics of color, gradient, luminance and neighborhood information were considered in our method. Compared with other traditional methods, our method has higher detection accuracy. In addition, compared with deep learning-based methods, our approach doesn’t involve creating a complex data set and reduces costs. The main contributions of this paper are as follows: I. For the first time, the density clustering algorithm is combined with the detection of sprinkles, which provides a new idea for this field. II. The use of multi-feature fusion improves the accuracy and robustness of the traditional method which makes the algorithm usable in many real-world scenarios.
EMMCNN: An ETPS-Based Multi-Scale and Multi-Feature Method Using CNN for High Spatial Resolution Image Land-Cover Classification
Land-cover information is significant for land-use planning, urban management, and environment monitoring. This paper presented a novel extended topology-preserving segmentation (ETPS)-based multi-scale and multi-feature method using the convolutional neural network (EMMCNN) for high spatial resolution (HSR) image land-cover classification. The EMMCNN first segmented the images into superpixels using the ETPS algorithm with false-color composition and enhancement and built parallel convolutional neural networks (CNNs) with dense connections for superpixel multi-scale deep feature learning. Then, the multi-resolution segmentation (MRS) object hand-delineated features were extracted and mapped to superpixels for complementary multi-segmentation and multi-type representation. Finally, a hybrid network was designed to consist of 1-dimension CNN and multi-layer perception (MLP) with channel-wise stacking and attention-based weighting for adaptive feature fusion and comprehensive classification. Experimental results on four real HSR GaoFen-2 datasets demonstrated the superiority of the proposed EMMCNN over several well-known classification methods in terms of accuracy and consistency, with overall accuracy averagely improved by 1.74% to 19.35% for testing images and 1.06% to 8.78% for validating images. It was found that the solution combining an appropriate number of larger scales and multi-type features is recommended for better performance. Efficient superpixel segmentation, networks with strong learning ability, optimized multi-scale and multi-feature solution, and adaptive attention-based feature fusion were key points for improving HSR image land-cover classification in this study.
Automatic Recognition of Ripening Tomatoes by Combining Multi-Feature Fusion with a Bi-Layer Classification Strategy for Harvesting Robots
Automatic recognition of ripening tomatoes is a main hurdle precluding the replacement of manual labour by robotic harvesting. In this paper, we present a novel automatic algorithm for recognition of ripening tomatoes using an improved method that combines multiple features, feature analysis and selection, a weighted relevance vector machine (RVM) classifier, and a bi-layer classification strategy. The algorithm operates using a two-layer strategy. The first-layer classification strategy aims to identify tomato-containing regions in images using the colour difference information. The second classification strategy is based on a classifier that is trained on multi-medium features. In our proposed algorithm, to simplify the calculation and to improve the recognition efficiency, the processed images are divided into 9 × 9 pixel blocks, and these blocks, rather than single pixels, are considered as the basic units in the classification task. Six colour-related features, namely the Red (R), Green (G), Blue (B), Hue (H), Saturation (S) and Intensity (I) components, respectively, colour components, and five textural features (entropy, energy, correlation, inertial moment and local smoothing) were extracted from pixel blocks. Relevant features and their weights were analysed using the iterative RELIEF (I-RELIEF) algorithm. The image blocks were classified into different categories using a weighted RVM classifier based on the selected relevant features. The final results of tomato recognition were determined by combining the block classification results and the bi-layer classification strategy. The algorithm demonstrated the detection accuracy of 94.90% on 120 images, this suggests that the proposed algorithm is effective and suitable for tomato detection
Robust thermal infrared tracking via an adaptively multi-feature fusion model
When dealing with complex thermal infrared (TIR) tracking scenarios, the single category feature is not sufficient to portray the appearance of the target, which drastically affects the accuracy of the TIR target tracking method. In order to address these problems, we propose an adaptively multi-feature fusion model (AMFT) for the TIR tracking task. Specifically, our AMFT tracking method adaptively integrates hand-crafted features and deep convolutional neural network (CNN) features. In order to accurately locate the target position, it takes advantage of the complementarity between different features. Additionally, the model is updated using a simple but effective model update strategy to adapt to changes in the target during tracking. In addition, a simple but effective model update strategy is adopted to adapt the model to the changes of the target during the tracking process. We have shown through ablation studies that the adaptively multi-feature fusion model in our AMFT tracking method is very effective. Our AMFT tracker performs favorably on PTB-TIR and LSOTB-TIR benchmarks compared with state-of-the-art trackers.
Semantic Segmentation Method for Sparse Point Clouds Based on Straight Flow Completion and Multi-Feature Fusion
Point cloud semantic segmentation is a vital task in 3D computer vision. However, the inherent sparsity of point clouds complicates the segmentation process. In contexts such as autonomous driving, moving objects frequently exhibit motion blur, which adversely affects semantic segmentation performance. These challenges hinder the practical application of point cloud semantic segmentation. To address these issues, this paper presents a novel semantic segmentation method that integrates sparse point cloud completion with multi-feature fusion. Specifically, the study emphasizes the development of efficient strategies for constructing and training point cloud completion models, aiming to expedite model parameter training while maximizing completion accuracy. Additionally, a semantic segmentation model is introduced that combines motion feature-enhanced instance features with semantic features, thereby enhancing adaptability to moving objects. Moreover, point cloud completion and semantic segmentation are linked in an end-to-end pipeline, facilitating accurate semantic segmentation of sparse point clouds in dynamic environments. During the experimental phase, publicly available Lidar point cloud datasets, including SemanticKITTI and the millimeter-wave radar dataset RADIal, are utilized to evaluate the proposed method against classical approaches in terms of point cloud completion performance and semantic segmentation effectiveness, thereby demonstrating the reliability of the proposed method.
A new time–space attention mechanism driven multi-feature fusion method for tool wear monitoring
In order to accurately monitor the tool wear process, it is usually necessary to collect a variety of sensor signals during the cutting process. Different sensor signals can provide complementary information in the feature space. In addition, monitoring signals are time series data, which also contains a wealth of time dimension tool degradation information. However, how to fuse multi-sensor information in time and space dimensions is a key issue that needs to be solved. In this paper, a new time–space attention mechanism driven multi-feature fusion method is proposed for tool wear monitoring and residual useful life (RUL) prediction. A time–space attention mechanism is innovatively introduced into the tool wear monitoring model, and features are weighted from two dimensions of space and time. It can more accurately capture the complex spatio-temporal relationship between tool wear values and features, so that the model can accurately predict wear values even if it gives up cutting force signals with good trends. The experimental results show that the correlation of the predicted wear and the actual wear is greater than 0.95, and the relative accuracy of the RUL predicted by the predicted wear combined with the particle filter can also be around 0.78. Compared with other feature fusion models, the proposed method realizes the tool wear monitoring more accurately and has better stability.
A Deep-Learning Method for Remaining Useful Life Prediction of Power Machinery via Dual-Attention Mechanism
Remaining useful life (RUL) prediction is a cornerstone of Prognostic and Health Management (PHM) for power machinery, playing a crucial role in ensuring the reliability and safety of these critical systems. In recent years, deep learning techniques have shown great promise in RUL prediction, providing more reliable and accurate outcomes. However, existing models often struggle with comprehensive feature extraction, especially in capturing the complex behavior of power machinery, where non-linear degradation patterns arise under varying operational conditions. To tackle this limitation, we propose a multi-feature fusion model leveraging a dual-attention mechanism. Initially, convolutional neural networks (CNNs) and channel attention mechanisms are employed to preliminarily extract spatial features. Subsequently, a layer combining a Gate Recurrent Unit (GRU) and self-attention mechanisms is used to further extract and integrate temporal features. Finally, RUL values are predicted via regression. The effectiveness of the proposed method was validated on C-MAPSS datasets, and its superior performance in RUL prediction was demonstrated through comparative analysis with other methods.
Artificial intelligence method based on multi-feature fusion for automatic macular edema (ME) classification on spectral-domain optical coherence tomography (SD-OCT) images
A common ocular manifestation, macular edema (ME) is the primary cause of visual deterioration. In this study, an artificial intelligence method based on multi-feature fusion was introduced to enable automatic ME classification on spectral-domain optical coherence tomography (SD-OCT) images, to provide a convenient method of clinical diagnosis. First, 1,213 two-dimensional (2D) cross-sectional OCT images of ME were collected from the Jiangxi Provincial People's Hospital between 2016 and 2021. According to OCT reports of senior ophthalmologists, there were 300 images with diabetic (DME), 303 images with age-related macular degeneration (AMD), 304 images with retinal-vein occlusion (RVO), and 306 images with central serous chorioretinopathy (CSC). Then, traditional omics features of the images were extracted based on the first-order statistics, shape, size, and texture. After extraction by the alexnet, inception_v3, resnet34, and vgg13 models and selected by dimensionality reduction using principal components analysis (PCA), the deep-learning features were fused. Next, the gradient-weighted class-activation map (Grad-CAM) was used to visualize the-deep-learning process. Finally, the fusion features set, which was fused from the traditional omics features and the deep-fusion features, was used to establish the final classification models. The performance of the final models was evaluated by accuracy, confusion matrix, and the receiver operating characteristic (ROC) curve. Compared with other classification models, the performance of the support vector machine (SVM) model was best, with an accuracy of 93.8%. The area under curves AUC of micro- and macro-averages were 99%, and the AUC of the AMD, DME, RVO, and CSC groups were 100, 99, 98, and 100%, respectively. The artificial intelligence model in this study could be used to classify DME, AME, RVO, and CSC accurately from SD-OCT images.
A new multi-feature fusion based convolutional neural network for facial expression recognition
Using lightweight networks for facial expression recognition (FER) is becoming an important research topic in recent years. The key to the success of FER with lightweight networks is to explore the potentials of expression features in distinct abstract levels and regions, and design robust features to characterize the facial appearance. This paper proposes a lightweight network called Multi-feature Fusion Based Convolutional Neural Network (MFF-CNN), for image-based FER. The proposed model uses the Image Branch to extract both mid-level and high-level global features from the whole input image and utilizes the Patch Branch to extract local features from sixteen image patches of the original image. In MFF-CNN, feature selection based on L2 norm is performed to obtain more discriminative local features. Joint tuning is employed to integrate the two branches and fuse features. Experiment results on three widely used datasets, CK+, JAFFE and Oulu-CASIA show the proposed MFF-CNN outperforms the state-of-the-art methods in terms of average recognition accuracy. Compared to other competitive models with similar or larger number of parameters, our MFF-CNN improves the average recognition accuracy by 9.80% to 15.05%.