Kol Tregaskes

@koltregaskes
30 Followers
24 Following
1.3K Posts
AI Art Creator 🎨 | AI Video Producer 🎬🤖 | AI Music Composer 🎵 | Tech & Science News Curator🔬 | Movie & Sci-fi Enthusiast 🍿 | Geek at Heart 🤓 | #AI #Tech
My Sitekolt.at/links
Gemma 4 2B and 4B models coming soon.
https://x.com/i/status/2037271114494468205
Bedros Pamboukian (@bedros_p) on X

Gemma 4 to be released very soon, starting with 2B and 4B sizes specifically for embedded environments Main enhancement seems to be focused on agentic skills. Larger sizes to be in the hundreds.

X (formerly Twitter)

ChatGPT Adult Mode is no more. OpenAI has shelved plans indefinitely to refocus on core productivity tools.

The decision follows staff and investor concerns plus technical challenges in training the adult mode - known internally as Citron mode - which would require over-18 verification.

https://x.com/FT/status/2037127747349156009?s=20

François Chollet (@fchollet) on X

For those wondering about ARC-AGI-4 timing: it will be released in early 2027. We are aiming for a yearly release schedule for new benchmarks. We are also aiming for each new benchmark to be fully unsaturated upon release, and to target the most important unanswered research

X (formerly Twitter)
Mike Knoop (@mikeknoop) on X

ARC-AGI-3 and ARC Prize 2026 are now live with $2,000,000 in prizes! As of today, version 3 is the world's only unsaturated agentic intelligence benchmark. Humans score 100% and frontier AI scores ~0%. Play here: https://t.co/nF1MhGvTyz While no single version of ARC is

X (formerly Twitter)
François Chollet (@fchollet) on X

Human-level general intelligence is achieved when an AI system can approach a new task and figure it out, without human intervention, *with the same learning efficiency as humans*. If every new task requires human intervention, it's not general. If every new task requires

X (formerly Twitter)
François Chollet (@fchollet) on X

Keep in mind: ARC-AGI is *not* a final exam that you pass to claim AGI. Including ARC-AGI-3. The benchmarks target the residual gap between what's hard for AI and what's easy for humans. It's meant to be a tool to measure AGI progress and to drive researchers towards the most

X (formerly Twitter)
"It's AGI when it can learn to do any task a human can, with no human intervention, with the same learning efficiency as humans. It's not complicated."
https://x.com/fchollet/status/2036881543973790004?s=20
François Chollet (@fchollet) on X

@scaling01 I have been saying the same thing for years. Initially I was using this exact line ("it's a compass, not a target to hit") about ARC 1, back around 2021-2022, before ChatGPT. This has always been our stance. My bar for AGI has been public and unchanged since 2019. It's AGI when

X (formerly Twitter)

- Mike Knoop noted the new stateless client and out-of-distribution private set for fairer testing of on-the-fly world modelling and continual learning.
- Francois Chollet explained the benchmark acts as a moving target that tracks the exact gap between what is easy for humans and hard for AI on the path to AGI.

The demonstration set with sharable replays and failure mode analysis is now live for immediate testing.

https://x.com/arcprize/status/2036860080541589529?s=20

ARC Prize (@arcprize) on X

Announcing ARC-AGI-3 The only unsaturated agentic intelligence benchmark in the world Humans score 100%, AI <1% This human-AI gap demonstrates we do not yet have AGI Most benchmarks test what models already know, ARC-AGI-3 tests how they learn

X (formerly Twitter)

ARC Prize has announced ARC-AGI-3 where humans score 100% and no AI model gets over 1%.

- It features many novel game environments built from scratch with no instructions or explicit goals given to participants.
- AI must explore each environment, acquire goals, form hypotheses, develop strategies and adapt using only core knowledge priors.
- Efficiency scoring now measures skill acquisition by comparing the number of actions AI needs to succeed against human baselines.

Kol Tregaskes (@koltregaskes) on X

ComfyUI launches Dynamic VRAM that lets everyday users run large AI models on standard hardware without crashes or slowdowns. The new system automatically manages memory for Windows and Linux Nvidia setups so workflows stay smooth and stable without manual tweaks.

X (formerly Twitter)