Mastodawn

Satwik Bhattamishra (@satwik1729)

블랙박스 접근만으로 트랜스포머의 파라미터를 효율적으로 복구할 수 있는지, 쿼리 접근 조건에서 attention 기반 모델의 학습 가능성을 분석한 연구를 소개한다. ICML 2026에 채택된 논문으로, 모델 추출·역공학 및 이론적 이해에 관련된 AI/ML 연구다.

https://x.com/satwik1729/status/2055669035111518640

#transformer #attention #learnability #icml #model_extraction

Satwik Bhattamishra (@satwik1729) on X

Given black-box access to a Transformer's output, can we efficiently recover its parameters? We analyse the learnability of attention-based models with query access in our new work. Accepted at #ICML2026 🎉 Work done with @shahkulin98, @mhahn29 and Varun Kanade. 🧵

X (formerly Twitter)

sayzard Feb 17

Mark Kretschmann (@mark_k)

Google이 상업적 목적의 행위자들이 @GeminiApp을 복제하려고 10만 건 이상의 프롬프트로 폭주시키는 '모델 추출(model extraction)' 공격을 시도했다고 공개했습니다. 이 공격은 특히 비영어권 언어에서 Gemini의 독점적 논리·추론 능력을 탈취해 학습시키려는 목적이었으며, AI 보안 및 지적재산 보호 문제를 강조합니다.

https://x.com/mark_k/status/2023393647543161136

#google #gemini #model_extraction #security

Mark Kretschmann (@mark_k) on X

Google has revealed that "commercially motivated" actors attempted to clone @GeminiApp by bombarding it with over 100,000 prompts. This "model extraction" attack aimed to steal the AI’s proprietary logic and reasoning capabilities, particularly in non-English languages, to train

X (formerly Twitter)