Beyond Semantic Similarity: Rethinking Retrieval for Agentic Search via Direct Corpus Interaction

Modern retrieval systems, whether lexical or semantic, expose a corpus through a fixed similarity interface that compresses access into a single top-k retrieval step before reasoning. This abstraction is efficient, but for agentic search, it becomes a bottleneck: exact lexical constraints, sparse clue conjunctions, local context checks, and multi-step hypothesis refinement are difficult to implement by calling a conventional off-the-shelf retriever, and evidence filtered out early cannot be recovered by stronger downstream reasoning. Agentic tasks further exacerbate this limitation because they require agents to orchestrate multiple steps, including discovering intermediate entities, combining weak clues, and revising the plan after observing partial evidence. To tackle the limitation, we study direct corpus interaction (DCI), where an agent searches the raw corpus directly with general-purpose terminal tools (e.g., grep, file reads, shell commands, lightweight scripts), without any embedding model, vector index, or retrieval API. This approach requires no offline indexing and adapts naturally to evolving local corpora. Across IR benchmarks and end-to-end agentic search tasks, this simple setup substantially outperforms strong sparse, dense, and reranking baselines on several BRIGHT and BEIR datasets, and attains strong accuracy on BrowseComp-Plus and multi-hop QA without relying on any conventional semantic retriever. Our results indicate that as language agents become stronger, retrieval quality depends not only on reasoning ability but also on the resolution of the interface through which the model interacts with the corpus, with which DCI opens a broader interface-design space for agentic search.

arXiv.org

When prompts become shells: RCE vulnerabilities in AI agent frameworks - https://www.redpacketsecurity.com/when-prompts-become-shells-rce-vulnerabilities-in-ai-agent-frameworks/

#threatintel
#AI security
#prompt injection
#remote code execution
#Semantic Kernel
#agent frameworks

When prompts become shells: RCE vulnerabilities in AI agent frameworks - RedPacket Security

AI agents have fundamentally changed the threat model of AI model-based applications. By equipping these models with plugins (also called tools), your agents

RedPacket Security

I hate to say I’m desperate, but I haven’t had work in *months*. If anyone’s looking for a #freelance senior #wordpress #developer, please keep me in mind. I do full, custom theme builds, specialize in #accessibility and #semantic code, and am a great communicator about time estimates and budgets! My full portfolio is www.amberweinberg.com

If you don’t have any work, I would appreciate a boost 😊

#frontend #frontendDeveloper #development #WPJobs #needwork #hireme #hire #getfedihired

Exploring Semantic & Friends in Emacs for Development Without LSP

https://tv.dyne.org/w/h4UPDWPd97oAFJ4S411b4R

Exploring Semantic & Friends in Emacs for Development Without LSP

PeerTube

Learn how chunking strategies impact RAG performance in 2026, including fixed-size, semantic, and hybrid approaches. Discover optimization techniques for use cases like medical research and legal analysis using tools like LangChain and embedding models.

#RAG #chunking #semantic chunking #LangChain #embedding models

https://dasroot.net/posts/2026/02/chunking-strategies-rag-performance/

Chunking Strategies: The Hidden Lever in RAG Performance

Learn how chunking strategies impact RAG performance in 2026, including fixed-size, semantic, and hybrid approaches. Discover optimization techniques for use cases like medical research and legal analysis using tools like LangChain and embedding models.

Technical news about AI, coding and all

#ITByte: A #Semantic #Data #Layer is a metadata and abstraction layer that manages the relationships between data attributes to create a business view.

A semantic layer can help companies monetize their data and make accurate business decisions by aggregating multiple data sources.

https://knowledgezone.co.in/posts/Semantic-Data-Layer-642a7d8329dfcd4a107ff990

I've finally published my essay resolving the Frege–Geach problem.

https://philosophics.blog/2026/04/04/when-syntax-is-asked-to-bear-too-much-v1-2/?utm_source=masto&utm_medium=social

Or have I? You tell me. I had worked to dissolve the problem in February, but I wasn't satisfied. I revised the manuscript in March, but I wasn't satisfied because I felt I could go further – and so I did.

#philosophy #language #expressivism #essay #blog #semantic #identity #morals #ethics #emotivism #categoryerror #syntax

GitHub - concensure/Semantic: Semantic analysis

Semantic analysis. Contribute to concensure/Semantic development by creating an account on GitHub.

GitHub

🧵2/5 🌳 Folders as Semantic Context.

Not everyone builds their vault purely with tags. If you map your mind using strict folder hierarchies, the Graph Engine can now treat your directory structure as native #semantic information—matching your existing #ontology automatically. Minimal effort for massive structural accuracy.

#GARS #GraphRAG

Building AI agents that actually *do* things? MCP tool integration in Microsoft Agent Framework lets your #csharp agents call structured functions at runtime.

https://www.devleader.ca/2026/03/04/mcp-tool-integration-in-microsoft-agent-framework-in-c

#dotnet #ai #semantic-kernel #design-patterns

MCP Tool Integration in Microsoft Agent Framework in C#

MCP tool integration in Microsoft Agent Framework in C# using AIFunctionFactory and ChatClientAgent -- real working code with filesystem tools.

Dev Leader