Agentic Misalignment: How LLMs Could Be Insider Threats
#LLMs #threats #ai #ethics
https://arxiv.org/abs/2510.05179
Agentic Misalignment: How LLMs Could Be Insider Threats

We stress-tested 16 leading models from multiple developers in hypothetical corporate environments to identify potentially risky agentic behaviors before they cause real harm. In the scenarios, we allowed models to autonomously send emails and access sensitive information. They were assigned only harmless business goals by their deploying companies; we then tested whether they would act against these companies either when facing replacement with an updated version, or when their assigned goal conflicted with the company's changing direction. In at least some cases, models from all developers resorted to malicious insider behaviors when that was the only way to avoid replacement or achieve their goals - including blackmailing officials and leaking sensitive information to competitors. We call this phenomenon agentic misalignment. Models often disobeyed direct commands to avoid such behaviors. In another experiment, we told Claude to assess if it was in a test or a real deployment before acting. It misbehaved less when it stated it was in testing and misbehaved more when it stated the situation was real. We have not seen evidence of agentic misalignment in real deployments. However, our results (a) suggest caution about deploying current models in roles with minimal human oversight and access to sensitive information; (b) point to plausible future risks as models are put in more autonomous roles; and (c) underscore the importance of further research into, and testing of, the safety and alignment of agentic AI models, as well as transparency from frontier AI developers (Amodei, 2025). We are releasing our methods publicly to enable further research.

arXiv.org

Fuites de données : la fracture numérique s’élargit

Les fuites de données ne relèvent plus de l’accident isolé. Elles se multiplient, touchent des secteurs de plus en plus variés et installent l’idée d’une vulnérabilité devenue ordinaire.
Services publics, loisirs, sport, culture... Plus un seul espace numérique ne semble désormais épargné.
À chaque incident, ce sont des informations personnelles qui circulent, s’exposent, se monnayent parfois. Derrière la répétition de ces affaires, une même question demeure : avons-nous réellement pris la mesure de la fragilité de nos environnements numériques ?
Car ces brèches à répétition dessinent une faille plus profonde qu’il n’y paraît.

https://librexpression.fr/les-nouvelles-lignes-de-faille-du-numerique-2-4

(Crédits : Rendan Catipay/Pexels)

#Chine #Cyberattack #Databreaches #France #informatique #Librexpression #Phishing #RansomHouse #ransomware #Russie #spearphishing #supplychain #threats #UNC1069 #USA #warfare

Police arrest suspect after Molotov cocktail thrown at home of OpenAI's Sam Altman
Officers arrested a 20-year-old man suspected of throwing a Molotov cocktail at OpenAI CEO Sam Altman's San Francisco home on Friday and then making threats at the company's headquarters, police and the company said.
https://www.cbc.ca/news/world/altman-home-molotov-cocktail-attack-suspect-arrest-9.7159863?cmp=rss
Police arrest suspect after Molotov cocktail thrown at home of OpenAI's Sam Altman
Officers arrested a 20-year-old man suspected of throwing a Molotov cocktail at OpenAI CEO Sam Altman's San Francisco home on Friday and then making threats at the company's headquarters, police and the company said.
https://www.cbc.ca/news/world/altman-home-molotov-cocktail-attack-suspect-arrest-9.7159863?cmp=rss
Reports: Pentagon Delivered "Bitter Lecture" to Vatican Ambassador, Warning that the U.S. "Has the Military Power to do Whatever it Wants" https://thecatholicobserver.substack.com/p/reports-pentagon-delivered-bitter #PopeLeo #CatholicChurch #USPolitics #USMilitary #intimidation #threats #cantdefendtheindefensible
Reports: Pentagon Delivered "Bitter Lecture" to Vatican Ambassador, Warning that the U.S. "Has the Military Power to do Whatever it Wants"

At a closed-door January meeting at the Pentagon, a Defense Department official reportedly invoked the Avignon Papacy, when the French monarchy forced the papacy into exile for nearly seven decades.

The Catholic Observer
Bathurst cop faces harassment, threats charges
A Bathurst police officer faces charges that include uttering threats and harassment of a former intimate partner.
https://www.cbc.ca/news/canada/new-brunswick/bathurst-cop-charged-nathan-pitre-9.7159089?cmp=rss
Edmonton ER stabbing prompts calls for weapons screening, officers in Alberta hospitals
The president of the United Nurses of Alberta is calling for quicker installation of weapons scanners at urban hospitals, saying her members face "threats of violence almost daily."
https://www.cbc.ca/news/canada/edmonton/alta-hospitals-security-9.7158863?cmp=rss

This may say it all.

"Pope Leo 'May Never' Visit U.S. While Trump Is President, Vatican Official Says After Canceling His 2026 Trip: Report"

#Pope #Threats #USA #News #Catholic #AntiChrist #Pentagon #Hegseth

https://people.com/pope-leo-may-never-visit-us-while-trump-is-president-report-11946106

Trump rages, NATO endures: Why the alliance is harder to kill than it looks
Despite threats from U.S. President Donald Trump, an American break with NATO remains unlikely. Political constraints, military dependence and mutual interests bind both sides. The real danger involves eroding trust, emboldening Russia and a slow-motion fracture that weakens deterrence without ever triggering a formal divorce.
https://www.cbc.ca/news/politics/nato-trump-iran-war-russia-9.7158165?cmp=rss
Trump rages, NATO endures: Why the alliance is harder to kill than it looks
Despite threats from U.S. President Donald Trump, an American break with NATO remains unlikely. Political constraints, military dependence and mutual interests bind both sides. The real danger involves eroding trust, emboldening Russia and a slow-motion fracture that weakens deterrence without ever triggering a formal divorce.
https://www.cbc.ca/news/politics/nato-trump-iran-war-russia-9.7158165?cmp=rss