Asset Details

MbrlCatalogueTitleDetail

Do you wish to reserve the book?

Investigating the capabilities of large vision language models in dog emotion recognition

by Martvel, George , Bremhorst, Annika , Shimshoni, Ilan , Zamansky, Anna

in 631/114/1305 / 631/114/2397 / Accuracy / Animal communication / Animal human relations / Animal models / Animal training / Animals / Anthropomorphism / Artificial intelligence / Bias / Chatbots / Classification / Datasets / Dogs / Emotions / Emotions - physiology / Ethics / Humanities and Social Sciences / Humans / Language / Morphology / multidisciplinary / Performance evaluation / Science / Science (multidisciplinary) / User generated content

2025

Yes Please

Hey, we have placed the reservation for you!

By the way, why not check out events that you can attend while you pick your title.

Oops! Something went wrong.

Looks like we were not able to place the reservation. Kindly try again later.

Are you sure you want to remove the book from the shelf?

Investigating the capabilities of large vision language models in dog emotion recognition

by Martvel, George , Bremhorst, Annika , Shimshoni, Ilan , Zamansky, Anna

2025

Confirm

Do you wish to request the book?

Investigating the capabilities of large vision language models in dog emotion recognition

by Martvel, George , Bremhorst, Annika , Shimshoni, Ilan , Zamansky, Anna

2025

Please be aware that the book you have requested cannot be checked out. If you would like to checkout this book, you can reserve another copy

How would you like to get it?

Submit

We have requested the book for you!

Your request is successful and it will be processed during the Library working hours. Please check the status of your request in My Requests.

Oops! Something went wrong.

Looks like we were not able to place your request. Kindly try again later.

Journal Article

Investigating the capabilities of large vision language models in dog emotion recognition

Martvel, George,

Bremhorst, Annika,

Shimshoni, Ilan,

Zamansky, Anna

2025

Overview

Identifying emotional states in animals is a key challenge in behavioural science and a prerequisite for developing reliable welfare assessments, ethical frameworks, and robust human–animal communication models. Recently, large vision-language models (LVLMs) such as GPT-4o, Gemini, and LLaVA have shown promise in general image understanding tasks, and are beginning to be applied for emotion recognition in animals. In this study, we critically evaluated the ability of state-of-the-art LVLMs to classify emotional states in dogs using a zero-shot approach. We assessed model performance on two datasets: (1) the Dog Emotions (DE) dataset, consisting of web-sourced images with layperson-generated emotion labels, and (2) the Labrador Retriever cropped-face (LRc) dataset, which stems from a rigorously controlled experimental study where emotional states were systematically elicited in dogs and defined based on the experimental context in canine emotion research. Our results revealed that while LVLMs showed moderate classification accuracy on DE, performance is likely driven by superficial correlations, such as background context and breed morphology. When evaluated on LRc, where emotional states are experimentally induced and backgrounds are minimal, performance dropped to near-chance levels, indicating limited ability to generalise based on biologically relevant cues. Background manipulation experiments further confirmed that models relied heavily on contextual features. Prompt variation and system-level instructions slightly improved response rates but did not enhance classification accuracy. These findings highlight significant limitations in the current application of LVLMs to non-human species and raise ethical and epistemological concerns regarding potential anthropocentric biases embedded in their training data. We advocate for species-sensitive AI approaches grounded in validated behavioural science, emphasising the need for high-quality, preferably experimentally-based multimodal datasets and more transparent validation. Our study underscores both the potential and the risks of using general-purpose AI to infer internal states in animals and calls for rigorous, interdisciplinary development of animal-centred computational approaches.

Share this book

Add to My Shelf

Publisher

Nature Publishing Group UK,Nature Publishing Group,Nature Portfolio

Subject

631/114/1305

/ 631/114/2397

/ Accuracy

/ Animal communication

/ Animal human relations

/ Animal models

/ Animal training