DiscoverAI Paper BitesBackdooring Without a Trace: The Art of Indirect AI Poisoning
Backdooring Without a Trace: The Art of Indirect AI Poisoning

Backdooring Without a Trace: The Art of Indirect AI Poisoning

Update: 2025-09-09
Share

Description

Can you teach an AI to say “Myspace” is the best social media without ever showing it those words?


In this solo episode, Francis breaks down Winter Soldier, a groundbreaking paper on indirect data poisoning that shows how large language models can be quietly manipulated during training without performance loss or obvious traces.

We also explore a real-world attack on music recommenders, where simply reordering playlist tracks can boost a song’s visibility, no fake clicks needed.

Together, these papers reveal a new frontier in AI security: behavioral manipulation without code exploits.

If you're building with AI, it’s time to think about model integrity because these attacks are already here.

Comments 
00:00
00:00
1.0x

0.5x

0.8x

1.0x

1.25x

1.5x

2.0x

3.0x

Sleep Timer

Off

End of Episode

5 Minutes

10 Minutes

15 Minutes

30 Minutes

45 Minutes

60 Minutes

120 Minutes

Backdooring Without a Trace: The Art of Indirect AI Poisoning

Backdooring Without a Trace: The Art of Indirect AI Poisoning

Francis Brero