Mastodawn

TechieChimp Apr 22, 2025

🧠 A major AI shift may have arrived. Epoch AI last year warned of a data shortage for LLMs by 2026‑32, suggesting learning from real-world interactions.

Now, we see that future:

✨🔀 Experience Streams 🔀✨

Introducing The Era of Experience by Silver & Sutton
👉 https://tinyurl.com/wn2938xt.

No more internet scraping or hand-fed reward models.

Agents now act, observe, and adapt in real-life using reinforcement learning with real rewards—learning from consequences, not just datasets.

TechieChimp Feb 10, 2025

💻 NVIDIA: "Useful quantum computing is 15–30 years away."
🧪 D-WAVE: "It’s delivering value today."

Who’s side are you on? 🤔

https://www.dwavesys.com/media/cdrhzguu/quantum-realized-letter.pdf

TechieChimp Jan 27, 2025

2022, nach ChatGPT: Libraries überall, die "LLM-powered"-Apps einfacher machen sollten. Ich dachte:

"Brauchen wir 'ne Library nur für Prompts & API-Wrapping? 🤔"

Jetzt, mit KI-Agent-Hype:

"Brauchen wir 'ne Library, um LLMs zu orchestrieren? 🤷"

Déjà-vu oder nur ich? 🤨🚤

#KI

TechieChimp Jan 27, 2025

Back in 2022, after ChatGPT launched, I wondered:
"Do we really need libraries just to manage prompts & wrap an LLM API? 🤔"

Now, with KI-Agent hype, I’m asking:
"Do we really need libraries to orchestrate LLMs talking to each other? 🤷"

Déjà vu, or just me? 🤨🚤

TechieChimp Jan 24, 2025

What do WebArena, AssistantBench, and GAIA all agree on? Agentic AI has a long way to go.

Across these benchmarks, even the best-performing models barely crack 60%. Hard tasks lay bare just how far we are from true autonomy.

Yet marketing departments spin tales of “breakthroughs,” and startups rake in jaw-dropping valuations on that hype.

Maybe it’s time we let the leaderboards do the talking. 🧐

[1] https://gaia-benchmark-leaderboard.hf.space/
[2] https://huggingface.co/spaces/AssistantBench/leaderboard
[3] https://shorturl.at/8lLMJ

#AgenticAI

Gradio

TechieChimp Jan 24, 2025

Was haben WebArena, AssistantBench und GAIA gemeinsam? KI-Agenten sind noch weit von echter Autonomie entfernt.

Selbst die besten Modelle schaffen kaum mehr als 60 % – und bei schwierigen Aufgaben zeigt sich, wie schnell ihre Grenzen erreicht sind.

Wie seht ihr das? Sind wir bei KI-Agenten auf dem richtigen Weg, oder überholt der Hype die Realität? Lasst es mich in den Kommentaren wissen!

[1] https://gaia-benchmark-leaderboard.hf.space/
[2] https://huggingface.co/spaces/AssistantBench/leaderboard
[3] https://shorturl.at/8lLMJ

#KI

Gradio

TechieChimp Jan 2, 2025

Agentic AI: Revolutionary or Risky?

HBR says it offers:
💼 Specialization
🎨 Innovation
🤨 Trustworthiness

My take:

1️⃣ Specialization => Fragility (Think Boeing MCAS).
2️⃣ Innovation => Homogeneity (Same ideas, no breakthroughs).
3️⃣ Trustworthiness => Bias (Still mirrors human flaws).
4️⃣ Productivity => Job losses (Big unemployment shifts ahead).

Agentic AI isn’t all bad, but stay critical. Build smart, ask tough questions, and don’t buy the hype.

HMU if you want to chat!

TechieChimp Dec 30, 2024

🚨 Everyone’s hyped about Agentic AI being the next big thing in 2025.

But will it deliver? Here's my play-by-play prediction for how the year will unfold—quarter by quarter. 🧵👇

Show thread

TechieChimp Dec 30, 2024

Q3: RAG + Knowledge Graphs = 🦄
Summer brings a breakthrough: LLMs + Knowledge Graphs (KGs) = game changer.

Startups grinding on KG solutions suddenly become the hottest ticket in town. 💼🔥

VC’s FOMO sends valuations 🚀. A new generation of unicorns is born.

Show thread

TechieChimp Dec 30, 2024

Q4: Incumbents Play Catch-Up 🐢➡️🚀
Big DB vendors bolt on KG features (like they did w/ vector search in 2023).

They rebrand: “AI KG-native databases for the Agentic Era” or whatever LinkedIn buzzword’s trending.

KG storage = table stakes by year-end.