Mastodawn

AI poisoning could turn open models into destructive “sleeper agents,” says Anthropic
https://arstechnica.com/information-technology/2024/01/ai-poisoning-could-turn-open-models-into-destructive-sleeper-agents-says-anthropic/ #AI #poisoning #LLM #malware #SleeperAgents Training #bacvdoor #OpenSource

AI poisoning could turn models into destructive “sleeper agents,” says Anthropic

Trained LLMs that seem normal can generate vulnerable code given different triggers.

Ars Technica