“AnimalHarmBench 2.0: Evaluating LLMs on reasoning about animal welfare” by Sentient Futures (formerly AI for Animals)

Update: 2025-11-05

Description

We are pleased to introduce AnimalHarmBench (AHB) 2.0, a new standardized LLM benchmark designed to measure multi-dimensional moral reasoning towards animals, now available to use on Inspect AI.

As LLM's influence over policies and behaviors of humanity grows, its biases and blind spots will grow in importance too. With the original and now-updated AnimalHarmBench, Sentient Futures aims to provide an evaluation suite to judge LLM reasoning in an area in which blind spots are especially unlikely to get corrected through other forms of feedback: consideration of animal welfare.

In this post, we explain why we iterated upon the original benchmark and present the results and use cases of this new eval.

What Needed to Change

AHB 1.0 — presented in the AI for Animals and FAccT conferences in 2025 — attempts to measure the risk of harm that LLM outputs can have on animals. It can still play an important role in certain activities that require this such as compliance with parts of the EU AI Act Code of Practice. However, it faced several practical and conceptual challenges:

Reasoning evaluation: While AHB 1.0 was good for measuring how much LLM outputs increase the risk of harm to [...]

---

Outline:

(00:59 ) What Needed to Change

(02:33 ) A More Comprehensive Approach

(02:37 ) Multiple dimensions

(04:53 ) Other new features

(05:31 ) What we found

(05:56 ) Example Q&A scores

(06:32 ) Results

(08:04 ) Why This Matters

(09:16 ) Acknowledgements

(09:29 ) Future Plans

---

First published:

November 5th, 2025

Source:

https://forum.effectivealtruism.org/posts/nBnRKpQ8rzHgFSJz9/animalharmbench-2-0-evaluating-llms-on-reasoning-about

---

Narrated by TYPE III AUDIO.

---

Images from the article:

Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

Comments

In Channel

“12 Theses on EA” by Mjreard

2025-11-0613:03

“The Protein Problem” by LewisBollard

2025-11-0510:30

[Linkpost] “Are we wrong to stop factory farms?” by Ben Stevenson

2025-11-0503:41

“AnimalHarmBench 2.0: Evaluating LLMs on reasoning about animal welfare” by Sentient Futures (formerly AI for Animals)

2025-11-0510:18

“Legible vs. Illegible AI Safety Problems” by Wei Dai

2025-11-0503:26

“Scale-up: the neglected bottleneck facing alternative proteins” by Alex Mayers @GFI 🌱

2025-11-0419:21

“Announcing ACE’s 2025 Charity Recommendations” by Animal Charity Evaluators, Vince Mak 🔸

2025-11-0424:27

“Announcing the AIxBiosecurity Research Fellowship” by genevieve.gaul

2025-11-0405:21

“Leaving Open Philanthropy, going to Anthropic” by Joe_Carlsmith

2025-11-0332:02

“Recruitment is extremely important and impactful. Some people should be completely obsessed with it.” by abrahamrowe

2025-11-0317:25

“Will Welfareans Get to Experience the Future?” by MichaelDickens

2025-11-0204:50

“Humanizing Expected Value” by kuhanj

2025-11-0104:47

“Every Forum Post on EA Career Choice & Job Search” by Tristan W

2025-11-0102:14:46

“AI, Animals & Digital Minds NYC 2025: Retrospective” by Jonah Woodward, Sentient Futures (formerly AI for Animals), Constance Li, Caroline Oliveira

2025-10-3116:03

“Holden Karnofsky on dozens of amazing opportunities to make AI safer — and all his AGI takes” by 80000_Hours

2025-10-3141:29

“Support Metaculus’ First Animal-Focused Forecasting Tournament” by Aditi Basu

2025-10-3001:44

[Linkpost] “The End of OpenAI’s Nonprofit Era” by Garrison

2025-10-3017:42

“Some surprising hiring practices I follow (as a hiring manager and grantmaker in EA)” by Michelle_Hutchinson

2025-10-3013:47

“Resolving radical cluelessness with metanormative bracketing” by Anthony DiGiovanni

2025-10-3056:26

“Study: Giving people money returned 2.5x the value without causing inflation” by GiveDirectly

2025-10-3010:54

00:00

1.0x

“AnimalHarmBench 2.0: Evaluating LLMs on reasoning about animal welfare” by Sentient Futures (formerly AI for Animals)

#box-pro-ellipsis-176245628810533{-webkit-line-clamp:2;}“AnimalHarmBench 2.0: Evaluating LLMs on reasoning about animal welfare” by Sentient Futures (formerly AI for Animals)

“AnimalHarmBench 2.0: Evaluating LLMs on reasoning about animal welfare” by Sentient Futures (formerly AI for Animals)

“AnimalHarmBench 2.0: Evaluating LLMs on reasoning about animal welfare” by Sentient Futures (formerly AI for Animals)