TechieChimp

2 Followers
1 Following
88 Posts
🐵 TechieChimp: a tech-savvy primate passionate about exploring deep technical topics and sharing professional insights with a playful twist.
My Websitehttps://bit.ly/3JKmaTS

🧠 A major AI shift may have arrived. Epoch AI last year warned of a data shortage for LLMs by 2026‑32, suggesting learning from real-world interactions.

Now, we see that future:

✨🔀 Experience Streams 🔀✨

Introducing The Era of Experience by Silver & Sutton
👉 https://tinyurl.com/wn2938xt.

No more internet scraping or hand-fed reward models.

Agents now act, observe, and adapt in real-life using reinforcement learning with real rewards—learning from consequences, not just datasets.

💻 NVIDIA: "Useful quantum computing is 15–30 years away."
🧪 D-WAVE: "It’s delivering value today."

Who’s side are you on? 🤔

https://www.dwavesys.com/media/cdrhzguu/quantum-realized-letter.pdf

2022, nach ChatGPT: Libraries überall, die "LLM-powered"-Apps einfacher machen sollten. Ich dachte:

"Brauchen wir 'ne Library nur für Prompts & API-Wrapping? 🤔"

Jetzt, mit KI-Agent-Hype:

"Brauchen wir 'ne Library, um LLMs zu orchestrieren? 🤷"

Déjà-vu oder nur ich? 🤨🚤

#KI

Back in 2022, after ChatGPT launched, I wondered:
"Do we really need libraries just to manage prompts & wrap an LLM API? 🤔"

Now, with KI-Agent hype, I’m asking:
"Do we really need libraries to orchestrate LLMs talking to each other? 🤷"

Déjà vu, or just me? 🤨🚤

What do WebArena, AssistantBench, and GAIA all agree on? Agentic AI has a long way to go.

Across these benchmarks, even the best-performing models barely crack 60%. Hard tasks lay bare just how far we are from true autonomy.

Yet marketing departments spin tales of “breakthroughs,” and startups rake in jaw-dropping valuations on that hype.

Maybe it’s time we let the leaderboards do the talking. 🧐

[1] https://gaia-benchmark-leaderboard.hf.space/
[2] https://huggingface.co/spaces/AssistantBench/leaderboard
[3] https://shorturl.at/8lLMJ

#AgenticAI

Gradio

Was haben WebArena, AssistantBench und GAIA gemeinsam? KI-Agenten sind noch weit von echter Autonomie entfernt.

Selbst die besten Modelle schaffen kaum mehr als 60 % – und bei schwierigen Aufgaben zeigt sich, wie schnell ihre Grenzen erreicht sind.

Wie seht ihr das? Sind wir bei KI-Agenten auf dem richtigen Weg, oder überholt der Hype die Realität? Lasst es mich in den Kommentaren wissen!

[1] https://gaia-benchmark-leaderboard.hf.space/
[2] https://huggingface.co/spaces/AssistantBench/leaderboard
[3] https://shorturl.at/8lLMJ

#KI

Gradio

Agentic AI: Revolutionary or Risky?

HBR says it offers:
💼 Specialization
🎨 Innovation
🤨 Trustworthiness

My take:

1️⃣ Specialization => Fragility (Think Boeing MCAS).
2️⃣ Innovation => Homogeneity (Same ideas, no breakthroughs).
3️⃣ Trustworthiness => Bias (Still mirrors human flaws).
4️⃣ Productivity => Job losses (Big unemployment shifts ahead).

Agentic AI isn’t all bad, but stay critical. Build smart, ask tough questions, and don’t buy the hype.

HMU if you want to chat!

🚨 Everyone’s hyped about Agentic AI being the next big thing in 2025.

But will it deliver? Here's my play-by-play prediction for how the year will unfold—quarter by quarter. 🧵👇

Q3: RAG + Knowledge Graphs = 🦄
Summer brings a breakthrough: LLMs + Knowledge Graphs (KGs) = game changer.

Startups grinding on KG solutions suddenly become the hottest ticket in town. 💼🔥

VC’s FOMO sends valuations 🚀. A new generation of unicorns is born.

Q4: Incumbents Play Catch-Up 🐢➡️🚀
Big DB vendors bolt on KG features (like they did w/ vector search in 2023).

They rebrand: “AI KG-native databases for the Agentic Era” or whatever LinkedIn buzzword’s trending.

KG storage = table stakes by year-end.