OWASP recently released LLM & Gen AI Security Landscape - 2026, Q2 where it show players in the Gen AI space.

As you can see here, orgz still need to choose their best vendor

Nothing ever changed, only shifted
😜

#OWASP
#GenAI
#AISecurity
#AITrust
#VendorSelection
#Cybersecurity

Hugging Face's smolagents library lets developers build code agents in just 15 minutes. The simple yet powerful framework gives LLMs the ability to execute actions and interact with external tools. Perfect for developers wanting to experiment with autonomous AI agents without complex setup. https://www.kdnuggets.com/getting-started-with-smolagents-build-your-first-code-agent-in-15-minutes #AIagent #AI #GenAI #AgenticAI #HuggingFace
Getting Started with Smolagents: Build Your First Code Agent in 15 Minutes

Build an AI weather agent in 40 lines of Python using Hugging Face's smolagents library. Learn to create tools, connect LLMs, and run autonomous tasks.

KDnuggets

"Evaluating genuine reasoning in large language models via esoteric programming languages."

https://arxiv.org/abs/2603.09678

#solidstatelife #ai #genai

EsoLang-Bench: Evaluating Genuine Reasoning in Large Language Models via Esoteric Programming Languages

Large language models achieve near-ceiling performance on code generation benchmarks, yet these results increasingly reflect memorization rather than genuine reasoning. We introduce EsoLang-Bench, a benchmark using five esoteric programming languages (Brainfuck, Befunge-98, Whitespace, Unlambda, and Shakespeare) that lack benchmark gaming incentives due to their economic irrationality for pre-training. These languages require the same computational primitives as mainstream programming but have 1,000-100,000x fewer public repositories than Python (based on GitHub search counts). We evaluate five frontier models across five prompting strategies and find a dramatic capability gap: models achieving 85-95% on standard benchmarks score only 0-11% on equivalent esoteric tasks, with 0% accuracy beyond the Easy tier. Few-shot learning and self-reflection fail to improve performance, suggesting these techniques exploit training priors rather than enabling genuine learning. EsoLang-Bench provides the first benchmark designed to mimic human learning by acquiring new languages through documentation, interpreter feedback, and iterative experimentation, measuring transferable reasoning skills resistant to data contamination.

arXiv.org
Salesforce has rebuilt Slackbot as a fully-powered AI agent built on Anthropic's Claude. It can search enterprise data, draft documents, and take action for employees. Internal testing with 80,000 employees shows 96% satisfaction and 2-20 hours weekly time savings. https://venturebeat.com/technology/salesforce-rolls-out-new-slackbot-ai-agent-as-it-battles-microsoft-and #AIagent #AI #GenAI #Salesforce
Epstein victims have filed a class action lawsuit against Google, claiming the company's AI Mode feature exposed their personal information including names, contact details and cities of residence. The lawsuit alleges Google was notified multiple times over two months but failed to remove the data. Unlike traditional search, AI Mode is an 'active recommender and content generator' that could constitute actionable doxxing. https://gizmodo.com/epstein-victims-sue-google-claim-ai-mode-exposed-personal-information-2000739177 #AIagent #AI #GenAI #AIEthics #Google
Epstein Victims Sue Google, Claim AI Mode Exposed Personal Information

Google's AI republished sensitive info like contact information, the suit claims.

Gizmodo