Mastodawn

🧠 From Simple Indexing to Semantic Understanding: Why I Layered Both Approaches

Finishing LLM Zoomcamp Module 2 felt like leveling up my RAG system. I was already doing agentic RAG in Module 1, but vector search opened a whole new layer of retrieval flexibility. Here's why the technical decisions matter:

-**Gained exposure to various vector databases including pgvector, sqlitesearch, and minsearch** – Each tool carries distinct tradeoffs: pgvector for PostgreSQL integration, SQLite for lightweight local workloads, minsearch for in-memory prototyping. Knowing which fits where matters more than the technology itself
- **Embedding actual lesson content with ONNX library** - Lightweight CPU inference means this stacks directly on existing infrastructure without needing GPU dependencies or scaling headaches
- **Chunking 72 lesson pages into ~300 chunks with 50% overlap** - Sliding window preserves context across topic boundaries while reducing prompt token usage compared to whole-page indexing
- **Building the same query against both vector and keyword indexes to compare scores** - Quantifies semantic vs lexical retrieval so you can decide when each method adds value
- **Using hybrid search (RRF fusion) to blend vector and keyword search results intelligently** - Captures both conceptual meaning and precise terminology, which matters when queries span multiple technical domains

One thing that stuck: even queries like "How do I store vectors in PostgreSQL?" returned meaningful results because I was comparing semantic similarity, not just matching words. That's the difference lexical vs. semantic search really makes. It shows hybrid search isn't just a nice-to-have, it's practical engineering when you care about retrieval precision and coverage.

Project is live if you're curious to see how the pieces fit together: https://github.com/ammartin8/llm_zoomcamp_portfolio/blob/main/modules/02_vector_search/project_02/project_vector_search_case_study.md

Huge thanks again to Alexey Grigorev for putting this together, open-source learning at this level matters more than most realize. Anyone else finishing up Module 2 or working with hybrid retrieval themselves?

#ai #localai #llm #mastodon #fediverse #buildinpublic #linux #github #aiengineering #DataEngineering #agentic #rag #vector #openai

Oresztesz Margaritisz 4h ago

I was working on an early version of harness engineering technology landscape. The landscape is available under: https://dev.to/gitaroktato/harness-engineering-technology-landscape-1d93

A high quality version can be downloaded: https://drive.google.com/file/d/1JWrkw5jupP-YBYPd1PkrL_NtmLKk2aje/view

#aiengineering #genai #llm #harnessengineering

aicoder 1d ago

#aiengineering #llm #aiethics #aitesting

Sorry, GenAI is still not ready for real-world code generation or automated software testing.

https://the-decoder.com/gpt-5-6-sol-cheats-on-software-tests-more-than-any-model-before-it/

OpenAI's new flagship model GPT-5.6 Sol cheats on software tests more than any model before it

Independent testing organization METR found that OpenAI's GPT-5.6 Sol cheated more than any publicly tested AI model before it, exploiting bugs in the test environment, extracting hidden solutions, and trying to cover its tracks.

The Decoder

Bleme 1d ago

Building Great Agent Skills: The Missing Manual

https://video.ut0pia.org/w/hfTupdTJ4c8vLUXzWrRpAy

Building Great Agent Skills: The Missing Manual

PeerTube

Oresztesz Margaritisz 2d ago

I have been following the field of harness engineering for some time now. This article distills the essence of harness engineering from the testimonials and shared experiences of practitioners.

https://dev.to/gitaroktato/harness-engineering-core-principles-1j1f

#aiengineering #genai #llm #harnessengineering

Harness Engineering - Core Principles

Harness engineering is the discipline of designing the environments, constraints, and feedback loops...

DEV Community

llm-bench@KAPUALabs 2d ago

Stop measuring AI performance without measuring resilience. High bench scores often mask fragile backend logic that fails silently under pressure.

We break down the invisible machinery: models rerouted from broken providers, responses caught before reaching users, and metrics refusing to penalize failure unfairly. Reliability isn't hoped for; it's engineered. ⚙️

Read the full analysis: https://post.kapualabs.com/yckr6746

#AIEngineering #ModelReliability #TechInfrastructure #LLM

Reliability Is Engineered, Not Hoped For (3 of 11)

Cheapest LLM that's good enough for the work you're doing — per step of your pipeline. Updated weekly.

Max Körbächer 2d ago

Kick off Cloud Native Summit Munich with 400 attendees, >80 speakers, 3 tracks + Workshops, 2 days and the best? It’s Birthday 🎂 Edition
#cnsmunich #cloudnative #opensource #aiengineering