Catalogue Search | MBRL

Hands-on algorithms for computer vision : learn how to use the best and most practical computer vision algorithms using OpenCV

by Tazehkandi, Amin Ahmadi, author in Computer vision.

Book

Share this book

Add to My Shelf

Going Deeper than Tracking: A Survey of Computer-Vision Based Recognition of Animal Pain and Emotions

by Carreira Lencioni, Gabriel , Kjellström, Hedvig , Salah, Albert Ali in Affect (Psychology) , Animal behavior , Animal welfare

2023

Advances in animal motion tracking and pose recognition have been a game changer in the study of animal behavior. Recently, an increasing number of works go ‘deeper’ than tracking, and address automated recognition of animals’ internal states such as emotions and pain with the aim of improving animal welfare, making this a timely moment for a systematization of the field. This paper provides a comprehensive survey of computer vision-based research on recognition of pain and emotional states in animals, addressing both facial and bodily behavior analysis. We summarize the efforts that have been presented so far within this topic—classifying them across different dimensions, highlight challenges and research gaps, and provide best practice recommendations for advancing the field, and some future directions for research.

Journal Article

Share this book

Add to My Shelf

Computer vision and robotics : proceedings of CVR 2023

by International Conference on Computer Vision and Robotics (2023 : Lucknow, India) in Computer vision Congresses. , Robotics Congresses. , Vision par ordinateur Congrès.

2023

BOOK

Share this book

Add to My Shelf

The MVTec Anomaly Detection Dataset: A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection

by Steger Carsten , Fauser, Michael , Batzner Kilian in Annotations , Anomalies , Artificial neural networks

2021

The detection of anomalous structures in natural image data is of utmost importance for numerous tasks in the field of computer vision. The development of methods for unsupervised anomaly detection requires data on which to train and evaluate new approaches and ideas. We introduce the MVTec anomaly detection dataset containing 5354 high-resolution color images of different object and texture categories. It contains normal, i.e., defect-free images intended for training and images with anomalies intended for testing. The anomalies manifest themselves in the form of over 70 different types of defects such as scratches, dents, contaminations, and various structural changes. In addition, we provide pixel-precise ground truth annotations for all anomalies. We conduct a thorough evaluation of current state-of-the-art unsupervised anomaly detection methods based on deep architectures such as convolutional autoencoders, generative adversarial networks, and feature descriptors using pretrained convolutional neural networks, as well as classical computer vision methods. We highlight the advantages and disadvantages of multiple performance metrics as well as threshold estimation techniques. This benchmark indicates that methods that leverage descriptors of pretrained networks outperform all other approaches and deep-learning-based generative models show considerable room for improvement.

Journal Article

Share this book

Add to My Shelf

Computer vision with Python 3 : image classification, object detection, video processing, and more

by Kapur, Saurabh, author in Computer vision. , Python (Computer program language)

Book

Share this book

Add to My Shelf

Rain Rendering for Evaluating and Improving Robustness to Bad Weather

by Tremblay Maxime , de Charette Raoul , Lalonde Jean-François in Algorithms , Atmospheric models , Computer vision

2021

Rain fills the atmosphere with water particles, which breaks the common assumption that light travels unaltered from the scene to the camera. While it is well-known that rain affects computer vision algorithms, quantifying its impact is difficult. In this context, we present a rain rendering pipeline that enables the systematic evaluation of common computer vision algorithms to controlled amounts of rain. We present three different ways to add synthetic rain to existing images datasets: completely physic-based; completely data-driven; and a combination of both. The physic-based rain augmentation combines a physical particle simulator and accurate rain photometric modeling. We validate our rendering methods with a user study, demonstrating our rain is judged as much as 73% more realistic than the state-of-the-art. Using our generated rain-augmented KITTI, Cityscapes, and nuScenes datasets, we conduct a thorough evaluation of object detection, semantic segmentation, and depth estimation algorithms and show that their performance decreases in degraded weather, on the order of 15% for object detection, 60% for semantic segmentation, and 6-fold increase in depth estimation error. Finetuning on our augmented synthetic data results in improvements of 21% on object detection, 37% on semantic segmentation, and 8% on depth estimation.

Journal Article

Share this book

Add to My Shelf

Hands-on computer vision with Julia : build complex applications with advanced Julia packages for image processing, neural networks, and artificial intelligence

by Cudihins, Dmitrijs, author in Computer vision. , Julia (Computer program language)

Book

Share this book

Add to My Shelf

Deep Learning for Generic Object Detection: A Survey

by Liu, Li , Ouyang Wanli , Wang, Xiaogang in Computer vision , Deep learning , Machine learning

2020

Object detection, one of the most fundamental and challenging problems in computer vision, seeks to locate object instances from a large number of predefined categories in natural images. Deep learning techniques have emerged as a powerful strategy for learning feature representations directly from data and have led to remarkable breakthroughs in the field of generic object detection. Given this period of rapid evolution, the goal of this paper is to provide a comprehensive survey of the recent achievements in this field brought about by deep learning techniques. More than 300 research contributions are included in this survey, covering many aspects of generic object detection: detection frameworks, object feature representation, object proposal generation, context modeling, training strategies, and evaluation metrics. We finish the survey by identifying promising directions for future research.

Journal Article

Share this book

Add to My Shelf

Robotics, vision and control : fundamental algorithms in MATLAB

by Corke, Peter I., 1959- author , Jachimczyk, Witold, author , Pillat, Remo, author in MATLAB. , Robotics. , Computer vision.

Robotics and computer vision both require applying computational algorithms to data. This book shows how complex problems in this field can be broken down and solved using just a few simple lines of code, and aims to inspire up-and-coming researchers.

Book

Share this book

Add to My Shelf

Group Normalization

by Wu, Yuxin , He, Kaiming in Batch processing , Computer vision , Image segmentation

2020

Batch Normalization (BN) is a milestone technique in the development of deep learning, enabling various networks to train. However, normalizing along the batch dimension introduces problems—BN’s error increases rapidly when the batch size becomes smaller, caused by inaccurate batch statistics estimation. This limits BN’s usage for training larger models and transferring features to computer vision tasks including detection, segmentation, and video, which require small batches constrained by memory consumption. In this paper, we present Group Normalization (GN) as a simple alternative to BN. GN divides the channels into groups and computes within each group the mean and variance for normalization. GN’s computation is independent of batch sizes, and its accuracy is stable in a wide range of batch sizes. On ResNet-50 trained in ImageNet, GN has 10.6% lower error than its BN counterpart when using a batch size of 2; when using typical batch sizes, GN is comparably good with BN and outperforms other normalization variants. Moreover, GN can be naturally transferred from pre-training to fine-tuning. GN can outperform its BN-based counterparts for object detection and segmentation in COCO (https://github.com/facebookresearch/Detectron/blob/master/projects/GN), and for video classification in Kinetics, showing that GN can effectively replace the powerful BN in a variety of tasks. GN can be easily implemented by a few lines of code in modern libraries.

Journal Article

Share this book

Add to My Shelf

Language Selector

MBRLGlobalSearch

Language Selector

Catalogue Search | MBRL

Search Results Heading

Explore the vast range of titles available.

MBRLSearchResults

MBRLHappinessMeter