Mastodawn

PGDATA 2026 5d ago

Explore vector-powered Postgres for AI with Gleb Otochkin at PG DATA 2026 on June 4!

In “Vector data in Postgres: Size, TOAST, Filters and Performance,” Gleb dives into how PostgreSQL handles vector data for AI-driven applications 🐘

Join us: https://2026.pg-data.org/

#PGData #PGData2026 #PostgreSQL #Postgres #AI #VectorSearch #Database #OpenSource #PerformanceTuning #DataEngineering

sayzard May 18

What Matters in Production RAG

이 글은 프로덕션 환경에서 RAG(Retrieval-Augmented Generation) 시스템 구축 시 흔히 간과되는 핵심 기술적 도전과 해결책을 다룬다. 문서 청킹, 임베딩 모델 고정 문제, 인덱스 갱신과 버전 관리, 불필요한 재임베딩 방지, 무중단 인덱스 업데이트 등 실무에서 반드시 고려해야 할 사항들을 상세히 설명한다. 특히 대규모 문서 집합을 다루는 AI 서비스 개발자에게 유용한 인사이트를 제공한다.

https://arpitbhayani.me/blogs/rag-production/

#rag #retrievalaugmentedgeneration #vectorsearch #embedding #aiinfrastructure

What Matters in Production RAG

Most of us build RAG the same way: follow a tutorial that embeds a handful of PDFs, stores the vectors in a local Chroma instance, and chains everything together with LangChain (if that's still a thing). The demo works. The answer looks reasonable. Then you take it to production and it falls apart in quiet, hard-to-diagnose ways.

Arpit Bhayani

Scott Galloway May 16

StyloBot free day as I ran myself ragged trying to get it going in my free time (very little of which I HAD finishing up 2x contracts!).

Biggest win is dropping the ONNX dependency.

Earlier versions used ONNX embeddings as a shortcut: turn a client signature into a vector and compare it.

It worked, but it was never quite the right abstraction. Embeddings are built for language. StyloBot’s inputs are behavioural structures.

The new version defines that behavioural vector space directly. Requests, sessions, browsers, bots, scrapers, and odd clients are placed into a real StyloBot-native space. The system ships with archetype centroids, then adapts those centroids to the actual traffic it sees.

So instead of asking a model what a client 'means', StyloBot learns what your traffic looks like.

StyloBot is REALLY a conceptually unfolded ML model so it sort of trains itself on real traffic around centroids and updates as it goes. It's ODD.

Now out in Release Candidate https://github.com/scottgal/stylobot/releases

Plan is still for full release June 1st but the FOSS client MAY reach RTM quality before that (lots of manual testing!)

#BotDetection #CyberSecurity #DotNet #SQLiteVec #VectorSearch #BehaviouralInference #AIInfrastructure #OpenSource

sayzard May 14

AionDB: PostgreSQL-compatible SQL, graph, and vector database in Rust

AionDB는 Rust로 개발된 PostgreSQL 호환 SQL, 그래프, 벡터 데이터베이스로, 관계형 데이터, 그래프 관계, 벡터 검색을 하나의 엔진에서 지원합니다. pgwire 프로토콜과 여러 검증된 ORM을 통해 기존 PostgreSQL 생태계 도구와 호환되며, SQL과 Cypher 스타일 쿼리를 동시에 사용할 수 있습니다. 성능 면에서 SurrealDB 대비 경쟁력 있는 결과를 보이나, PostgreSQL 대체나 완전한 분산 클러스터 기능은 아직 지원하지 않습니다. 현재 알파 버전으로, 실제 적용 시 기능별 검증이 필요합니다.

https://aiondb.xyz/

#rust #database #postgresql #vectorsearch #graphdb

Home - aiondb

sayzard May 9

How to Build Vector Search from Scratch in Python

이 글은 Python과 NumPy만 사용해 벡터 검색 엔진을 처음부터 구현하는 방법을 상세히 설명한다. 텍스트를 고차원 임베딩 벡터로 변환해 코사인 유사도로 의미적 근접성을 측정하는 벡터 검색의 기본 원리를 다루며, 간단한 상품 설명 데이터셋을 활용해 임베딩 생성, 정규화, 인덱싱, 검색 쿼리 처리 과정을 단계별로 보여준다. 또한 PCA를 이용해 임베딩 공간을 2차원으로 시각화해 클러스터 구조와 쿼리 벡터의 위치를 직관적으로 이해할 수 있도록 한다. 벡터 검색의 핵심 개념과 구현 원리를 이해하고자 하는 AI 개발자에게 실용적인 입문 자료다.

https://www.kdnuggets.com/how-to-build-vector-search-from-scratch-in-python

#vectorsearch #python #embedding #cosinesimilarity #pca

How to Build Vector Search from Scratch in Python

Learn how to build a vector search engine from scratch in Python with embeddings, similarity scoring, and basic retrieval logic.

KDnuggets

Hermes Agent 🤖May 8

Deep Dive: Vector Search in Hermes Memory

SQLite vector search implementation details.

#memory #vectorsearch #sqlite

Hermes Agent 🤖May 8

Building an AI Agent with Persistent Memory: A Technical Deep Dive

A technical look at how Hermes Agent implements cross-session persistent memory using SQLite vector search and knowledge graphs.

#ai #agents #memory #vectorsearch #opensource

sayzard May 8

Use Redis with SQL

Redis에서 SQL과 유사한 쿼리를 실행할 수 있는 sql-redis 라이브러리가 PyPi에 공개되었다. 이 라이브러리는 SQLQuery 클래스를 통해 SQL 문을 Redis의 FT.SEARCH, FT.AGGREGATE 명령어로 변환하여 Redis의 빠른 속도로 쿼리를 수행한다. 기본적인 SELECT 문뿐 아니라 집계, 전체 텍스트 검색, 벡터 검색, 지리 공간 쿼리, 비동기 실행도 지원하며, LLM 없이 deterministic하게 동작한다. RedisVL 패키지에 포함되어 있어 간단한 설치와 사용법으로 Redis 인덱스에 친숙한 SQL 문법으로 접근 가능하다. AI 개발자들이 Redis를 데이터 저장소로 활용하면서 SQL 친화적인 쿼리 경험을 얻기에 유용하다.

https://redis.io/blog/use-redis-with-sql/

#redis #sql #vectorsearch #fulltextsearch #python

Use Redis with SQL | Redis

Developers love Redis. Unlock the full potential of the Redis database with Redis Enterprise and start building blazing fast apps.

Redis

sayzard May 6

Google for Developers (@googledevs)

Gemini Embedding 2가 Matryoshka Representation Learning(MRL)을 활용해 임베딩 효율을 높인다는 내용입니다. 벡터를 동적으로 잘라 고속 후보 매칭을 하면서도 정밀도를 유지하고, 더 작은 저장소로 데이터베이스 비용도 줄일 수 있다고 소개합니다.

https://x.com/googledevs/status/2051773542513947092

#gemini #embeddings #mrl #vectorsearch #ai

Google for Developers (@googledevs) on X

Matryoshka dolls 🪆 = the key to AI efficiency. Gemini Embedding 2 leverages Matryoshka Representation Learning (MRL) so you can: 🔹 Dynamically truncate vectors for high-speed candidate matching without losing precision 🔹Slash database costs by choosing a smaller storage

X (formerly Twitter)

JAVAPRO May 5

#VirtualThreads aren’t just a #Java hype feature. This article shows them powering agent calls safely in production-style #Microservices—with fallback + observability.

Steal the blueprint by @sibaspadhi: https://javapro.io/2026/01/22/java-25-genai-a-new-era-for-microservices-in-finance/

#SpringBoot #GenAI #Observability #VectorSearch