Sergio Coronado on Blind Spots in AI Safety and International Humanitarian Law
Update: 2025-09-05
Description
This 100th Humanitarian AI Today episode focuses on blind spots in AI safety and aligning AI with International Humanitarian Law. Guest host, Andre Heller, Director of Signpost at the International Rescue Committee (IRC), speaks with Sergio Coronado, Chief Information Officer with NATO’s Support and Procurement Agency (NSPA), about important research that he is heading at the Luxembourg Tech School studying “blind spots” in AI safety at the intersection of artificial intelligence and International Humanitarian Law (IHL).
Dr. Coronado speaks in detail about his team's groundbreaking research, which tested leading AI models against codified rules of humanitarian law. The conversation delves into the chilling discovery that while models refuse obviously harmful requests about 90% of the time, they can still for example be prompted to generate malicious code for targeting civilian infrastructure like hospitals, contrary to IHL.
This dialogue moves beyond identifying the problem to explore tangible solutions, highlighting how simple interventions can dramatically improve AI's adherence to legal principles. It serves as a powerful call to action for the humanitarian and technology communities to bridge this dangerous gap and champion the development of AI that is not just powerful, but principled and fundamentally law-adherent.
Interview Notes: https://medium.com/humanitarian-ai-today/sergio-coronado-on-blind-spots-in-ai-safety-and-international-humanitarian-law-40b64590a119
Dr. Coronado speaks in detail about his team's groundbreaking research, which tested leading AI models against codified rules of humanitarian law. The conversation delves into the chilling discovery that while models refuse obviously harmful requests about 90% of the time, they can still for example be prompted to generate malicious code for targeting civilian infrastructure like hospitals, contrary to IHL.
This dialogue moves beyond identifying the problem to explore tangible solutions, highlighting how simple interventions can dramatically improve AI's adherence to legal principles. It serves as a powerful call to action for the humanitarian and technology communities to bridge this dangerous gap and champion the development of AI that is not just powerful, but principled and fundamentally law-adherent.
Interview Notes: https://medium.com/humanitarian-ai-today/sergio-coronado-on-blind-spots-in-ai-safety-and-international-humanitarian-law-40b64590a119
Comments
In Channel






