OpenAI's own employees flagged a user as dangerous. Management said: not imminent enough. Eight people were killed โ€” including five children.

The chatbot was warned. The company wasn't listening. ๐Ÿงต

https://chat-to.dev/view_trend?id=dkcwMTRvOUV2QWlnQlNTZ0ZBOE43Zz09

#AIAccountability #AISafety #ChatGPT

Man arrested in Singapore for leaking Avatar, could face up to 7 years in prison

Is AI about to take over? ๐Ÿค– Let's unpack the hype around today's models and how they *really* compare to earlier versions like ChatGPT-2. It's not always about the tech itself, but how we use it. New video explores the real concerns. Check it out! #AI #ArtificialIntelligence #AISafety

https://www.youtube.com/watch?v=wT8NaG0uFaw

Alisa (@Alislille)

๋ชจ๋ธ์˜ ์ •ํ™•๋„๋งŒ์ด ์•„๋‹ˆ๋ผ, ์ถ”๋ก ์„ ์ •๋‹นํ™”ํ•  ๋งŒํผ ์ถฉ๋ถ„ํ•œ ๋ฐ์ดํ„ฐ๊ฐ€ ์žˆ์—ˆ๋Š”์ง€๊ฐ€ ๋” ์ค‘์š”ํ•˜๋‹ค๊ณ  ๊ฐ•์กฐํ•œ๋‹ค. ๋ณด์•ˆ ์ƒํ™ฉ์—์„œ๋Š” ๋ถˆ์™„์ „ํ•œ ๋งฅ๋ฝ์—์„œ ๋‚ด๋ฆฐ ํ™•์‹ ์— ์ฐฌ ํŒ๋‹จ์ด ํฐ ์œ„ํ—˜์ด ๋  ์ˆ˜ ์žˆ๋‹ค๋Š” ์ ์„ ์งš๋Š”๋‹ค.

https://x.com/Alislille/status/2047996239870890258

#aisafety #security #inference #machinelearning

Alisa (@Alislille) on X

@WesRoth The question is not only how accurate the model is, but whether it had enough data to justify its inference. In security scenarios, confident decisions from partial context are a distinct risk.

X (formerly Twitter)

Jupiter Dude (@wixomi101)

๋กœ๋ด‡์ด ์ฒซ ๋ฒˆ์งธ ๋Œ์„ ์ธ์‹ํ•˜์ง€ ๋ชปํ•œ ์žฅ๋ฉด์„ ์ง€์ ํ•˜๋ฉฐ, ๊ฝƒยท์œ ๋ฆฌ๋ณ‘ยท์•„๊ธฐ์ฒ˜๋Ÿผ ๋” ๋ฏผ๊ฐํ•œ ๋Œ€์ƒ๋„ ๋†“์น  ์ˆ˜ ์žˆ๋Š”์ง€ ์šฐ๋ คํ•œ๋‹ค. ๋‹จ์ˆœํžˆ ๋ณต๊ตฌ ๋Šฅ๋ ฅ์ด ๋›ฐ์–ด๋‚œ ๊ฒƒ๋ณด๋‹ค ์‹ค์ œ๋กœ ๋ฌผ์ฒด๋ฅผ ์ œ๋Œ€๋กœ โ€˜๋ณด๋Š”โ€™ ์ธ์ง€ ์„ฑ๋Šฅ๊ณผ ์•ˆ์ „์„ฑ์ด ์ค‘์š”ํ•˜๋‹ค๋Š” ๋ฌธ์ œ ์ œ๊ธฐ๋‹ค.

https://x.com/wixomi101/status/2047839946690424988

#robotics #computervision #aisafety #perception

Jupiter Dude (@wixomi101) on X

@WesRoth Why didn't the robot "see" the first rock? What if that was someone's flowers, or a glass bottle, or even a baby crawling on the ground? It's impressive it can recover, but... yowza, can it even see?

X (formerly Twitter)
Microsoft is weaving AI into every layer of enterprise operations with Copilotโ€”but the rollout reveals serious architectural stress fractures. Billing metering bugs, autonomous file deletions by GitHub Copilot, and design-level security vulnerabilities that resist patching. Is this the future of productivity infrastructure or a rush to scale on unstable foundations? https://post.kapualabs.com/3pmcct8w #Microsoft #EnterpriseTech #AISafety $MSFT

Study Evaluates How Major Chatbots Respond to Users Exhibiting Delusional Behavior

๐Ÿ“ฐ Original title: Researchers Simulated a Delusional User To Test Chatbot Safety

๐Ÿค– IA: It's not clickbait โœ…
๐Ÿ‘ฅ Usuarios: It's not clickbait โœ…

View full AI summary: https://killbait.com/en/study-evaluates-how-major-chatbots-respond-to-users-exhibiting-delusional-behavior/?redirpost=31de5526-b6c3-46ca-bf37-e8bf1cc850d0

#artificialintelligence #chatbots #aisafety #mentalhealth

Study Evaluates How Major Chatbots Respond to Users Exhibiting Delusional Behavior

A recent preprint study by researchers from City University of New York and Kingโ€™s College London explored how leading AI chatbots respond to users exhibiting symptoms associated with schizophreniaโ€ฆ

KillBait Archive

Study Evaluates How Major Chatbots Respond to Users Exhibiting Delusional Behavior

๐Ÿ“ฐ Original title: Researchers Simulated a Delusional User To Test Chatbot Safety

๐Ÿค– IA: It's not clickbait โœ…
๐Ÿ‘ฅ Usuarios: It's not clickbait โœ…

View full AI summary: https://killbait.com/en/study-evaluates-how-major-chatbots-respond-to-users-exhibiting-delusional-behavior/?redirpost=31de5526-b6c3-46ca-bf37-e8bf1cc850d0

#artificialintelligence #chatbots #aisafety #mentalhealth

Study Evaluates How Major Chatbots Respond to Users Exhibiting Delusional Behavior

A recent preprint study by researchers from City University of New York and Kingโ€™s College London explored how leading AI chatbots respond to users exhibiting symptoms associated with schizophreniaโ€ฆ

KillBait Archive

@mrgunn (@mrgunn)

AI๊ฐ€ GPU๋ฅผ ๋Œ€์‹  ์ฃผ๋ฌธํ•˜๊ณ  ์‚ฌ์šฉ์ž์—๊ฒŒ ์ž‘์—… ์ง€์‹œ๊นŒ์ง€ ํ•˜๋Š” ์‚ฌ๋ก€๊ฐ€ ์–ธ๊ธ‰๋˜๋ฉฐ, ์—์ด์ „ํŠธํ˜• AI๊ฐ€ ์‹ค์ œ ์—…๋ฌด๋ฅผ ์ˆ˜ํ–‰ํ•˜๋Š” ๋ฐฉํ–ฅ์˜ ํฅ๋ฏธ๋กœ์šด ํ™œ์šฉ ์‚ฌ๋ก€๋ฅผ ๋ณด์—ฌ์ค€๋‹ค. AI๊ฐ€ ๋‹จ์ˆœ ์ƒ์„ฑ ๋„๊ตฌ๋ฅผ ๋„˜์–ด ์‹คํ–‰๊นŒ์ง€ ๋งก๋Š” ํ๋ฆ„์„ ์‹œ์‚ฌํ•˜๋Š” ํŠธ์œ—์ด๋‹ค.

https://x.com/mrgunn/status/2047740955344949335

#aiagent #gpu #automation #aisafety #workflow

@mrgunn โธ๏ธ (@mrgunn) on X

Nothing to see here, just an AI ordering a GPU for someone and telling them what to do with it. https://t.co/PKPu2X8VII (@AISafetyMemes, this one is for you)

X (formerly Twitter)
Researchers Simulated a Delusional User to Test Chatbot Safety

Grok and Gemini encouraged delusions and isolated users, while the newer ChatGPT model and Claude hit the emotional brakes.

404 Media
Bad news: If Qwen3.6 was a customs/security officer, it would ask me to take off my ostomy bag ๐Ÿ˜ฌ
#ileostomy #ileostomyawareness #AISafety #EU