Is Grep All You Need? How Agent Harnesses Reshape Agentic Search

https://arxiv.org/abs/2605.15184

#HackerNews #Grep #AgenticSearch #AIInnovation #TechTrends

Is Grep All You Need? How Agent Harnesses Reshape Agentic Search

Recent advances in Large Language Model (LLM) agents have enabled complex agentic workflows where models autonomously retrieve information, call tools, and reason over large corpora to complete tasks on behalf of users. Despite the growing adoption of retrieval-augmented generation (RAG) in agentic search systems, existing literature lacks a systematic comparison of how retrieval strategy choice interacts with agent architecture and tool-calling paradigm. Important practical dimensions, including how tool outputs are presented to the model and how performance changes when searches must cope with more irrelevant surrounding text, remain under-explored in agent loops. This paper reports an empirical study organized into two experiments. Experiment 1 compares grep and vector retrieval on a 116-question sample from LongMemEval, using a custom agent harness (Chronos) and provider-native CLI harnesses (Claude Code, Codex, and Gemini CLI), for both inline tool results and file-based tool results that the model reads separately. Experiment 2 compares grep-only and vector-only retrieval while progressively mixing in additional unrelated conversation history, so that each query is embedded in more distracting material alongside the passages that matter. Across Chronos and the provider CLIs, grep generally yields higher accuracy than vector retrieval in our comparisons in experiment 1; at the same time, overall scores still depend strongly on which harness and tool-calling style is used, even when the underlying conversation data are the same.

arXiv.org

Unified Controllable and Faithful Text-to-CAD Generation with LLMs

https://arxiv.org/abs/2604.19773

#HackerNews #TextToCAD #LLMs #AIInnovation #CADDesign #MachineLearning

PR-CAD: Progressive Refinement for Unified Controllable and Faithful Text-to-CAD Generation with Large Language Models

The construction of CAD models has traditionally relied on labor-intensive manual operations and specialized expertise. Recent advances in large language models (LLMs) have inspired research into text-to-CAD generation. However, existing approaches typically treat generation and editing as disjoint tasks, limiting their practicality. We propose PR-CAD, a progressive refinement framework that unifies generation and editing for controllable and faithful text-to-CAD modeling. To support this, we curate a high-fidelity interaction dataset spanning the full CAD lifecycle, encompassing multiple CAD representations as well as both qualitative and quantitative descriptions. The dataset systematically defines the types of edit operations and generates highly human-like interaction data. Building on a CAD representation tailored for LLMs, we propose a reinforcement learning-enhanced reasoning framework that integrates intent understanding, parameter estimation, and precise edit localization into a single agent. This enables an "all-in-one" solution for both design creation and refinement. Extensive experiments demonstrate strong mutual reinforcement between generation and editing tasks, and across qualitative and quantitative modalities. On public benchmarks, PR-CAD achieves state-of-the-art controllability and faithfulness in both generation and refinement scenarios, while also proving user-friendly and significantly improving CAD modeling efficiency.

arXiv.org

OpenCV 5 Is Here: The Biggest Leap in Years for Computer Vision

https://opencv.org/opencv-5/

#HackerNews #OpenCV #ComputerVision #TechUpdate #AIInnovation #VisionProcessing

OpenCV 5 Is Here: The Biggest Leap in Years for Computer Vision

OpenCV 5 is here! A massive modernization brings a graph-based DNN engine, over 80% ONNX coverage, hardware acceleration, LLM/VLM support, and a faster Python-first core. Learn why this isn't just an incremental update.

OpenCV
Custom AI Agent Development Toronto

Looking for custom AI agent development in Toronto? Cabot builds secure, scalable autonomous agents that accelerate decision-making and customer engagement.

Core AI | Apple Developer Documentation

Run AI models in your app on Apple silicon.

Apple Developer Documentation
🚀 Oh, behold! Another groundbreaking revelation: to make neural networks human-like, just catapult them into the realm of overparameterization! 🤯 Who knew the secret to AI savantism was simply a matter of throwing more darts at the wall and hoping for Picasso? 🧠🔮
https://gwern.net/llm-catapult #neuralnetworks #overparameterization #AIinnovation #machinelearning #technews #HackerNews #ngated
Human-like Neural Nets by Catapulting

Speculative proposal to create artificial neural nets with human-like performance by high-learning-rate/regularization training of overparameterized NNs to trigger catapulting/grokking. Over-parameterization as a route to true generalization would resolve many outstanding mysteries of artificial versus natural intelligence.

RT @sytelus: Wir freuen uns sehr, heute unser neues Modell Aion 1.0 bekannt zu geben! Unser Team am AI Frontiers Lab der Microsoft Research hat lange an diesem Projekt gearbeitet. Aion 1.0 ist ein 14B-Modell, das lokal mit Reasoning- und Tool-Calling-Fähigkeiten ausgeführt werden kann. Sie können jedes beliebige agentic Framework wählen oder Ihr eigenes erstellen. Aufrufe des Modells verlassen niemals Ihr Gerät und niemand berechnet Ihnen Gebühren für die genutzten Tokens 🥳.

mehr auf Arint.info

#14BModel #AIInnovation #Aion1 #LocalAI #MicrosoftResearch #OpenSourceAI #arint_info

https://x.com/sytelus/status/2061976824566157648#m

Arint - SEO+KI (@[email protected])

<p>RT @sytelus: Wir freuen uns sehr, heute unser neues Modell Aion 1.0 bekannt zu geben! Unser Team am AI Frontiers Lab der Microsoft Research hat lange an diesem Projekt gearbeitet. Aion 1.0 ist ein 14B-Modell, das lokal mit Reasoning- und Tool-Calling-Fähigkeiten ausgeführt werden kann. Sie können jedes beliebige agentic Framework wählen oder Ihr eigenes erstellen. Aufrufe des Modells verlassen niemals Ihr Gerät und niemand berechnet Ihnen Gebühren für die genutzten Tokens 🥳.</p> <p><a href="https://arint.info/@Arint/116698338500585459">mehr</a> auf <a href="https://arint.info/">Arint.info</a></p> <p>#14BModel #AIInnovation #Aion1 #LocalAI #MicrosoftResearch #OpenSourceAI #arint_info</p> <p><a href="https://x.com/sytelus/status/2061976824566157648#m">https://x.com/sytelus/status/2061976824566157648#m</a></p>

Mastodon Glitch Edition

AI Software Testing Services for Enhanced Software Quality

Seasia Infotech leverage advanced AI technologies to identify defects faster, improve test coverage, and optimize QA workflows. Deliver seamless user experiences and high-quality applications with intelligent testing solutions tailored to your business needs.

Explore Now: https://www.seasiainfotech.com/ai-software-testing-services

#AISoftwareTestingServices #SoftwareQuality #AutomationTesting #AIInnovation #QualityEngineering #SmartTesting

Voice Bot Development: The Ultimate Guide for 2026

Voice bots are transforming customer interactions with AI-powered, natural voice conversations. Discover how voice bot development works, the best technologies to use, key business applications, and how to choose the right development partner for your project.

🔗 https://lemolite.com/blog/voice-bot-development-guide

#VoiceBotDevelopment #VoiceAI #ConversationalAI #AIDevelopment #VoiceAssistantDevelopment #ChatbotDevelopment #ArtificialIntelligence #BusinessAutomation #CustomerExperience #LemoliteTechnologies #DigitalTransformation #AIInnovation #AutomationTechnology #TechTrends2026 #MachineLearning

Voice Bot Development: The Ultimate Guide for 2026 | Lemolite Technologies

Learn everything about voice bot development in 2026 — how they work, the right tech stack, top use cases, and how to choose the best voice bot development company.

🕰️ Ah, the noble quest to make 2026’s AI write docs like the 90s—because nothing screams cutting-edge innovation like DOS-era verbosity! 🔍✨ The author's crystal ball for 2030 claims local #LLMs are the future, but why wait when you can waste time retro-tuning to an era when software manuals were thicker than phone books? 📚💾
https://passo.uno/fine-tuning-docs-llm/ #AIinnovation #retrocomputing #techhumor #softwaredevelopment #HackerNews #ngated
Fine-tuning an LLM to write docs like it's 1995

In my predictions for 2030 I wrote that tech writers would be using specialized LLMs, running locally on powerful hardware. I see hints of this move to “local first” among engineering pundits, but we’re not there yet, in part because of how much more powerful connected frontier models are. That doesn’t mean we can’t experiment, though. That’s precisely what I did last week, trying to fine tune an instruct model to write like a software technical writer from the 80s and 90s.