Claude Mythos Has Found 271 Zero-Days in Firefox - #Schneier on #Security

schneier.com/blog/archives/202…

That’s a lot. No, it’s an extraordinary number: Since February, the #Firefox team has been working around the clock using frontier AI models to find and fix latent security vulnerabilities in the browser. We wrote previously about our collaboration with Anthropic to scan Firefox with Opus 4.6, which led to fixes for 22 security-sensitive bugs in Firefox 148. As part of our continued collaboration with Anthropic, we had the opportunity to apply an early version of Claude Mythos Preview to Firefox. This week’s release of Firefox 150 includes fixes for 271 vulnerabilities identified during this initial evaluation...
Claude Mythos Has Found 271 Zero-Days in Firefox - Schneier on Security

That’s a lot. No, it’s an extraordinary number: Since February, the Firefox team has been working around the clock using frontier AI models to find and fix latent security vulnerabilities in the browser. We wrote previously about our collaboration with Anthropic to scan Firefox with Opus 4.6, which led to fixes for 22 security-sensitive bugs in Firefox 148. As part of our continued collaboration with Anthropic, we had the opportunity to apply an early version of Claude Mythos Preview to Firefox. This week’s release of Firefox 150 includes fixes for 271 vulnerabilities identified during this initial evaluation...

Schneier on Security

Bruce Schneier's April 13 rebuttal to Project Glasswing rests on one fact. Security firm Aisle reproduced Anthropic's Mythos Preview findings on older, cheaper, public models. Anthropic had tied the capability to 100 million in credits and 4 million in donations as a frontier-only claim. One lab on a normal budget matched the demo. Access-restricted claims last about as long as one competent team rerunning them on last year's weights.

#AI #InfoSec #Schneier #FOSS

“They’re convincing a lot of people that Mythos is this amazing step change in capability when the evidence right now… is that it might not be.” #Schneier #Mythos #PR youtube.com/watch?v=PsKV...

Claude Mythos is mostly ‘marke...
Claude Mythos is mostly ‘marketing hype’ | Bruce Schneier

YouTube

Schneier highlights WebinarTV: automatically records public Zoom meetings, AI-transcribes, publishes everything. The defense: any participant can record. The reality: automating surveillance at scale with AI transcription changes the threat model entirely. The boundary between public and private just got thinner. 🎥🤖

#privacy #surveillance #Zoom #AI #Schneier

Source: https://www.schneier.com/blog/archives/2026/04/company-that-secretly-records-and-publishes-zoom-meetings.html

Company that Secretly Records and Publishes Zoom Meetings - Schneier on Security

WebinarTV searches the internet for public Zoom invites, joins the meetings, secretly records them, and publishes (alternate link) the recordings. It doesn’t use the Zoom record feature, so Zoom can’t do anything about it.

Schneier on Security

Side-Channel Attacks Against LLMs - Schneier on Security

https://www.schneier.com/blog/archives/2026/02/side-channel-attacks-against-llms.html

> Here are three papers describing different side-channel attacks against LLMs.
"Remote Timing Attacks on Efficient Language Model Inference“
“When Speculation Spills Secrets: Side Channels via Speculative Decoding in LLMs“
“Whisper Leak: a side-channel attack on Large Language Models“

#LLM #GenAI #security #Schneier

Side-Channel Attacks Against LLMs - Schneier on Security

Here are three papers describing different side-channel attacks against LLMs. “Remote Timing Attacks on Efficient Language Model Inference“: Abstract: Scaling up language models has significantly increased their capabilities. But larger models are slower models, and so there is now an extensive body of work (e.g., speculative sampling or parallel decoding) that improves the (average case) efficiency of language model generation. But these techniques introduce data-dependent timing characteristics. We show it is possible to exploit these timing differences to mount a timing attack. By monitoring the (encrypted) network traffic between a victim user and a remote language model, we can learn information about the content of messages by noting when responses are faster or slower. With complete black-box access, on open source systems we show how it is possible to learn the topic of a user’s conversation (e.g., medical advice vs. coding assistance) with 90%+ precision, and on production systems like OpenAI’s ChatGPT and Anthropic’s Claude we can distinguish between specific messages or infer the user’s language. We further show that an active adversary can leverage a boosting attack to recover PII placed in messages (e.g., phone numbers or credit card numbers) for open source systems. We conclude with potential defenses and directions for future work...

Schneier on Security

Sind wir bereit, von Künstlicher Intelligenz regiert zu werden? - #Schneier auf Sicherheit

schneier.com/blog/archives/202…

#Artificial Intelligence (AI) overlords are a common trope in science-fiction dystopias, but the reality looks much more prosaic. The technologies of artificial intelligence are already pervading many aspects of democratic government, affecting our lives in ways both large and small. This has occurred largely without our notice or consent. The result is a government incrementally transformed by AI rather than the singular technological overlord of the big screen. Let us begin with the executive branch. One of the most important functions of this branch of government is to administer the law, including the human services on which so many Americans rely. Many of these programs have long been operated by a mix of humans and machines, even if not previously using modern AI tools such as ...

😑

Are We Ready to Be Governed by Artificial Intelligence? - Schneier on Security

Artificial Intelligence (AI) overlords are a common trope in science-fiction dystopias, but the reality looks much more prosaic. The technologies of artificial intelligence are already pervading many aspects of democratic government, affecting our lives in ways both large and small. This has occurred largely without our notice or consent. The result is a government incrementally transformed by AI rather than the singular technological overlord of the big screen. Let us begin with the executive branch. One of the most important functions of this branch of government is to administer the law, including the human services on which so many Americans rely. Many of these programs have long been operated by a mix of humans and machines, even if not previously using modern AI tools such as ...

Schneier on Security

#PromptInjection Through Poetry - #Schneier on Security

schneier.com/blog/archives/202…

In a new paper, “Adversarial Poetry as a Universal Single-Turn Jailbreak Mechanism in Large Language Models,” researchers found that turning #LLM prompts into poetry resulted in jailbreaking the models: Abstract: We present evidence that adversarial poetry functions as a universal single-turn jailbreak technique for Large Language Models (LLMs). Across 25 frontier proprietary and open-weight models, curated poetic prompts yielded high attack-success rates (ASR), with some providers exceeding 90%. Mapping prompts to MLCommons and EU CoP risk taxonomies shows that poetic attacks transfer across CBRN, manipulation, cyber-offence, and loss-of-control domains. Converting 1,200 ML-Commons harmful prompts into verse via a standardized meta-prompt produced ASRs up to 18 times higher than their prose baselines. Outputs are evaluated using an ensemble of 3 open-weight LLM judges, whose binary safety assessments were validated on a stratified human-labeled subset. Poetic framing achieved an average jailbreak success rate of 62% for hand-crafted poems and approximately 43% for meta-prompt conversions (compared to non-poetic baselines), substantially outperforming non-poetic baselines and revealing a systematic vulnerability across model families and safety training approaches. These findings demonstrate that stylistic variation alone can circumvent contemporary safety mechanisms, suggesting fundamental limitations in current alignment methods and evaluation protocols...


😁

Prompt Injection Through Poetry - Schneier on Security

In a new paper, “Adversarial Poetry as a Universal Single-Turn Jailbreak Mechanism in Large Language Models,” researchers found that turning LLM prompts into poetry resulted in jailbreaking the models: Abstract: We present evidence that adversarial poetry functions as a universal single-turn jailbreak technique for Large Language Models (LLMs). Across 25 frontier proprietary and open-weight models, curated poetic prompts yielded high attack-success rates (ASR), with some providers exceeding 90%. Mapping prompts to MLCommons and EU CoP risk taxonomies shows that poetic attacks transfer across CBRN, manipulation, cyber-offence, and loss-of-control domains. Converting 1,200 ML-Commons harmful prompts into verse via a standardized meta-prompt produced ASRs up to 18 times higher than their prose baselines. Outputs are evaluated using an ensemble of 3 open-weight LLM judges, whose binary safety assessments were validated on a stratified human-labeled subset. Poetic framing achieved an average jailbreak success rate of 62% for hand-crafted poems and approximately 43% for meta-prompt conversions (compared to non-poetic baselines), substantially outperforming non-poetic baselines and revealing a systematic vulnerability across model families and safety training approaches. These findings demonstrate that stylistic variation alone can circumvent contemporary safety mechanisms, suggesting fundamental limitations in current alignment methods and evaluation protocols...

Schneier on Security

We went to a talk at Hennepin Community College and saw a lecture by Bruce #Schneier. Bruce’s ability to frame complex problems is one of those qualities that you often find in the brightest luminaries. He gave a great talk on #cybersecurity and #AI, and took loads of time for questions.

Fun fact: Bruce used to live in #Minneapolis and wrote restaurant reviews for the Star Tribune.

Bruce has some fantastic thoughts about a positive overall vision for AI.

https://www.schneier.com/blog/archives/2025/11/scientists-need-a-positive-vision-for-ai.html

#ai #schneier

Scientists Need a Positive Vision for AI - Schneier on Security

For many in the research community, it’s gotten harder to be optimistic about the impacts of artificial intelligence. As authoritarianism is rising around the world, AI-generated “slop” is overwhelming legitimate media, while AI-generated deepfakes are spreading misinformation and parroting extremist messages. AI is making warfare more precise and deadly amidst intransigent conflicts. AI companies are exploiting people in the global South who work as data labelers, and profiting from content creators worldwide by using their work without license or compensation. The industry is also affecting an already-roiling climate with its ...

Schneier on Security