Mastodawn

🚨 "Sonnet 4.6 Elevated Rate of Errors" is the Shakespearean drama of tech mishaps 🎭. Shakespeare himself couldn't have penned a more tragic comedy of errors as Claude's platforms bumble their way into chaos. What a delight for everyone subscribed to this farce! 😂
https://status.claude.com/incidents/lhws0phdvzz3 #Sonnet4.6 #TechMishaps #ShakespeareanDrama #ComedyOfErrors #ClaudePlatforms #Farce #HackerNews #ngated

Sonnet 4.6 elevated rate of errors

Claude's Status Page - Sonnet 4.6 elevated rate of errors.

sayzard Mar 20

Design Arena (@Designarena)

Anthropic의 모델들이 디자인 중심 코딩 작업에서 강세를 보였다는 소식이다. Opus 4.6이 웹 개발 HTML·React·풀스택 부문과 원샷·멀티턴 모두 1위를 차지했고, Opus 4.6과 Sonnet 4.6이 모바일 개발에서도 선두를 기록했다.

https://x.com/Designarena/status/2034788729068691787

#anthropic #opus4.6 #sonnet4.6 #webdevelopment #mobiledevelopment

Design Arena (@Designarena) on X

BREAKING: @AnthropicAI models overwhelmingly dominate design-centric coding tasks, as of March 2026. Opus 4.6 places first across Web Development (HTML & React & Full-Stack), in both one-shot and multi-turn categories. Opus 4.6 and Sonnet 4.6 place first in mobile development

X (formerly Twitter)

Hacker News Mar 14

1M context is now generally available for Opus 4.6 and Sonnet 4.6

https://claude.com/blog/1m-context-ga

#HackerNews #1MContext #Opus4.6 #Sonnet4.6 #TechNews #AIUpdates

1M context is now generally available for Opus 4.6 and Sonnet 4.6 | Claude

Standard pricing now applies across the full 1M window for both models, with no long-context premium. Media limits expand to 600 images or PDF pages.

Claude

Hacker News Feb 28

Qwen3.5 122B and 35B models offer Sonnet 4.5 performance on local computers

https://venturebeat.com/technology/alibabas-new-open-source-qwen3-5-medium-models-offer-sonnet-4-5-performance

#HackerNews #Qwen3.5 #Sonnet4.5 #LocalComputers #AIModels #OpenSource

sayzard Feb 19

Artificial Analysis (@ArtificialAnlys)

Claude Sonnet 4.6이 Artificial Analysis Intelligence Index에서 Opus 4.6에 이어 2위를 차지했다는 보고입니다. Sonnet 4.6은 최대 노력 모드에서 4.5보다 출력 토큰을 약 3배 더 사용했으며, GDPval-AA와 TerminalBench에서는 모든 모델을 선도해 Opus 4.6을 근소하게 앞서는 결과를 보였습니다. 성능·효율 비교 정보입니다.

https://x.com/ArtificialAnlys/status/2024259812176121952

#claude #sonnet4.6 #opus4.6 #benchmarks #aievaluation

Artificial Analysis (@ArtificialAnlys) on X

Claude Sonnet 4.6 takes second place in the Artificial Analysis Intelligence Index (behind Opus 4.6), but used ~3x more output tokens than Claude Sonnet 4.5 in its max effort mode. Sonnet 4.6 leads all models in GDPval-AA and TerminalBench, including a slight lead over Opus 4.6

X (formerly Twitter)

sayzard Feb 18

Tibor Blaho (@btibor91)

Anthropic이 Claude Sonnet 4.6을 공개했습니다. Sonnet 계열 중 가장 능력이 향상된 모델로, Sonnet 4.5와 동일한 토큰당 $3/$15 요금 체계를 유지하며 웹 검색용 동적 필터링을 도입하고 여러 API 도구를 일반 제공(GA)으로 전환했습니다. 발표문은 Sonnet 4.6이 Opus 수준 지능에 근접한다고 설명했습니다.

https://x.com/btibor91/status/2023847453192319233

#anthropic #claudesonnet #sonnet4.6 #aimodel #api

Tibor Blaho (@btibor91) on X

Anthropic released Claude Sonnet 4.6, their most capable Sonnet model yet, approaching Opus-level intelligence at the same $3/$15 per million token pricing as Sonnet 4.5, with dynamic filtering for web search and several API tools moving to general availability - Sonnet 4.6 is a

X (formerly Twitter)

sayzard Feb 18

Chubby (@kimmonismus)

Sonnet 4.6 관련 유출 정보가 사실로 확인되었고, 중급(미드티어) 모델임에도 불구하고 평가 결과가 매우 우수하다는 보고입니다. 또한 1백만 토큰(1M) 컨텍스트 윈도우를 지원해 대용량 문맥 처리와 장문 이해에서 큰 개선이 기대됩니다.

https://x.com/kimmonismus/status/2023819822992117955

#sonnet4.6 #contextwindow #llm #evals

Chubby♨️ (@kimmonismus) on X

Sonnet 4.6: Leaks were valid! Very very good evals for the mid-tier model! It also features a 1M token context window

X (formerly Twitter)

sayzard Feb 18

Chubby (@kimmonismus)

Claude Sonnet 4.6의 가격이 이전 버전인 Sonnet 4.5와 동일하다는 소식입니다. 성능 향상이 있으면서도 가격이 유지되어 업그레이드 비용 부담이 적을 것이라는 점에서 사용자·기업 채택에 긍정적 영향을 줄 수 있습니다.

https://x.com/kimmonismus/status/2023820443359002922

#claude #sonnet4.6 #pricing #sonnet4.5

Chubby♨️ (@kimmonismus) on X

Claude Sonnet 4.6 same pricing as Sonnet 4.5!

X (formerly Twitter)

sayzard Feb 18

Chubby (@kimmonismus)

Sonnet 4.6이 실제 작업 환경에서 특히 에이전트형 작업과 컴퓨터 조작 관련 작업에서 매우 강력한 성능을 보인다는 평가입니다. 실무용 워크플로우와 에이전트적 사용 사례에서 탁월하다는 내용으로, 사용자 경험과 생산성 개선에 큰 잠재력이 있음을 시사합니다.

https://x.com/kimmonismus/status/2023844025011499052

#sonnet4.6 #sonnet #llm #agents

Chubby♨️ (@kimmonismus) on X

Sonnet 4.6 is a beast for real-world work, agentic tasks, especially computer usage

X (formerly Twitter)

Gea-Suan Lin Nov 23

https://blog.gslin.org/archives/2025/11/23/12746/%e6%b8%ac%e8%a9%a6%e5%90%84%e5%ae%b6-llm-%e5%b0%8d%e3%80%8ccalifornium%e3%80%8d%e7%9a%84%e7%bf%bb%e8%ad%af/

測試各家 LLM 對「Californium」的翻譯

#ai #anthropic #californium #claude #gemini #Gemini2.5Flash #Gemini2.5Pro #Gemini3ProPreview #google #gpt #Gpt4.1 #Gpt5 #Gpt5.1 #llm #mistral #model #openai #opus #Opus4.1 #sonnet #Sonnet4.5

測試各家 LLM 對「Californium」的翻譯

在「Kodak Ran a Secret Nuclear Device in Its Basement for Decades. It Was a Scientific Marvel.」這邊看到的時候發現用 Page Assist 解讀時手上幾家的翻譯部分都炸成一團，剛好拿來測試留個記錄。也剛好就買了 OpenRouter 測試...

Gea-Suan Lin's BLOG