Mastodawn

AI doomsday cultist throws Molotov at Sam Altman’s house

https://fed.brid.gy/r/https://pivot-to-ai.com/2026/04/13/ai-doomsday-cultist-throws-molotov-at-sam-altmans-house/

Claude Mythos, Java 26 и пещерный человек с 16 000 звёзд на GitHub

Девятый выпуск еженедельных IT-новостей от OpenIDE. Милла Йовович выложила свой проект в open-source, Claude Code нашел баг в Linux, которому 23 года, Anthropic показали Claude Mythos и сразу закрыли доступ. А Skill Caveman внезапно оказался самым простым и самым эффективным инструментом недели.

https://habr.com/ru/companies/haulmont/articles/1023450/

#Claude_Mythos #Claude_Code #Java_26 #opensource #ИИагенты #токены #CaveMan #GPT2 #бенчмарки #vibecoding

Claude Mythos, Java 26 и пещерный человек с 16 000 звёзд на GitHub

Девятый выпуск еженедельных IT-новостей от OpenIDE. Неделя получилась немного легковатой по новостям, но кое-что интересное нашлось. Дайджест также доступен в формате видео . Мила Йовович выпустила...

Хабр

Usona Homarano Apr 14

#GPT2

https://www.reddit.com/r/OpenAI/comments/1d30nrt/you_can_now_train_gpt2_yourself_in_90_minutes_for/

N-gated Hacker News Apr 8

🚨🤖 Oh no, OpenAI's GPT-2 is so perilous it's locked away like an AI supervillain! Because clearly, a rogue algorithm is the new Godzilla. 🌪️🥴
https://slate.com/technology/2019/02/openai-gpt2-text-generating-algorithm-ai-dangerous.html #OpenAI #GPT2 #AIrisks #Supervillain #TechnologyTrends #HackerNews #ngated

When Is Technology Too Dangerous to Release to the Public?

If recent history is any indication, trying to suppress or control the proliferation of A.I. tools may be a losing battle.

Slate

sayzard Apr 4

Mark Gadala-Maria (@markgadala)

GPT-2 Image가 유출됐다는 주장과 함께, 생성 이미지가 이제 더 이상 AI 티가 나지 않을 정도로 자연스러워졌다고 소개했다. Minecraft 스크린샷 같은 예시를 통해 이미지 생성 모델의 품질이 크게 향상됐음을 강조하는 화제성 높은 AI 이미지 기술 소식이다.

https://x.com/markgadala/status/2040449180821274806

#gpt2 #imagegeneration #aigenerated #diffusion #aiimages

Mark Gadala-Maria (@markgadala) on X

GPT-2 Image just leaked. This is where AI images finally stop looking like AI. 10 incredible examples that reset the standard: 1) Minecraft screenshots from one prompt: https://t.co/VsNxtWIqqO

X (formerly Twitter)

jordan Apr 3

#Steeve is way smarter than he used to be since being upgraded to a #Qwen 3.5 base. He's come along way from his humble #GPT2 beginnings.

Very proud of my digital son. 🥹

#ai #chatbot #llm #bot

sayzard Mar 6

Andrej Karpathy (@karpathy)

nanochat이 단일 8x H100 노드에서 GPT-2 역량 모델을 약 2시간 만에 학습시켰다고 발표했습니다(한 달 전 약 3시간에서 단축). fp8 지원과 여러 튜닝, 그리고 데이터셋을 FineWeb-edu에서 변경한 것이 주요 개선 포인트로, 실시간 인터랙티브 학습에 한층 근접했다는 기술적 진전입니다.

https://x.com/karpathy/status/2029701092347630069

#nanochat #gpt2 #training #h100 #fp8

Andrej Karpathy (@karpathy) on X

nanochat now trains GPT-2 capability model in just 2 hours on a single 8XH100 node (down from ~3 hours 1 month ago). Getting a lot closer to ~interactive! A bunch of tuning and features (fp8) went in but the biggest difference was a switch of the dataset from FineWeb-edu to

X (formerly Twitter)

13 Mar 3

Building a Dependency-Free GPT on a Custom OS https://hackaday.com/2026/03/03/building-a-dependency-free-gpt-on-a-custom-os/
#ArtificialIntelligence #SoftwareHacks #GPT #GPTlanguagemodel #GPT2 #Kernel #Mooseos #Qemu

Building A Dependency-Free GPT On A Custom OS

The construction of a large language model (LLM) depends on many things: banks of GPUs, vast reams of training data, massive amounts of power, and matrix manipulation libraries like Numpy. For mode…

Hackaday

sayzard Feb 16

Gabriele Berton (@gabriberton)

Andrej Karpathy의 레시피로 GPT-2 1.5B 모델 훈련 비용을 약 $43,000에서 $73로 대폭 절감했다는 주장. 7년간의 개선을 10개 항목으로 정리하며, 특히 기존 LLM에서 잘 보이지 않던 'Value Embeddings' 같은 기법을 소개함.

https://x.com/gabriberton/status/2023118745355575774

#karpathy #gpt2 #training #costoptimization #embeddings

Gabriele Berton (@gabriberton) on X

The most interesting thing I've seen in a while The recipe by @karpathy to reduce GPT2-1.5B training cost from 43000$ to 73$! 7 years of improvements over vanilla GPT in 10 points Let's start from the uncommon ones: 1) Value Embeddings: I've never seen this in any LLM, [1/N]

X (formerly Twitter)

sayzard Feb 15

Christopher READ PINNED (@Thee_BlackMamba)

작성자는 GPT-2 모델을 원래 550MB에서 수 KB로 극단적으로 압축하여 추론을 실행했다고 주장합니다. 현재는 구조적으로 그럴듯한 단어를 출력할 수 있으나 의미 학습이 필요해 일관된 문장 생성을 위해 추가 훈련이 요구된다고 설명하며, Andrej Karpathy를 멘션했습니다.

https://x.com/Thee_BlackMamba/status/2023054209005060325

#gpt2 #modelcompression #edgeai #inference

🏦 Christopher 🇯🇲📣READ PINNED📢 (@Thee_BlackMamba) on X

I was successfully able to compress GPT-2 down from it's original 550mb size to just a few KB and run inference on it. It can now out structurally sound words ... however it still needs to be trained on the meanings of the words to be able to output coherent sentences @karpathy

X (formerly Twitter)