Daily Podcast: My Poetry Style Defeats Your AI Security Style
#News #TechNews #AI #AIsafeguards #Poetry #LLM #lol #podcast
Daily Podcast: My Poetry Style Defeats Your AI Security Style
#News #TechNews #AI #AIsafeguards #Poetry #LLM #lol #podcast
Chief Security Fanatic | CISO | Speaker | Columnist | Author | Radio Host | Board Member | Forbes Tech Council | TEDx | Canadian-American
The Register: Researchers find hole in AI guardrails by using strings like =coffee. “Large language models frequently ship with “guardrails” designed to catch malicious input and harmful output. But if you use the right word or phrase in your prompt, you can defeat these restrictions.”
ChatGPT introduces parental controls
https://web.brid.gy/r/https://nerds.xyz/2025/09/chatgpt-openai-parental-controls/
UC Riverside: UCR researchers fortify AI against rogue rewiring. “…researchers at the University of California, Riverside, have developed a method to preserve AI safeguards even when open-source AI models are stripped down to run on lower-power devices.”
https://rbfirehose.com/2025/09/09/uc-riverside-ucr-researchers-fortify-ai-against-rogue-rewiring/
via @conorperkins
@[email protected]
More details: https://deadline.com/2024/08/ai-protection-bill-california-update-1236061416/
#ArtificialIntelligence #GenerativeAI #DigitalReplicas #actors #politicians #contracts #laws #legislation #CourtDecisions #JudicialRulings #AI #AIsafeguards #safeguards #UnionStrong #SAGAFTRAstrong #union #SAGAFTRA
Even Elon Musk wants to see California legislation to safeguard against the unrestricted rise of artificial intelligence and today politicians in Sacramento moved one giant step closer to protecting actors from a virtual afterlife of sorts. On a third reading, the state Senate passed a bill that would require studios and streamers to seek specific […]
#AISafeguards Are Pretty Easy to Bypass
https://www.pcmag.com/news/ai-safeguards-are-pretty-easy-to-bypass