In case you missed it, here is the recording of the Azure CosmosDB Conf 2026 session on Chat History and Semantic Caching using Microsoft Agent Framework.
https://youtu.be/atbRswDKruY?si=U7IF7fJVcrm77qkv
#CosmosDB #AgentFramework #SemanticCaching #ChatHistory
AI Agent Memory: Chat History & Semantic Caching | Lino Tadros | Azure Cosmos DB Conf 2026

YouTube

New research shows semantic caching can cut LLM inference costs by up to 73%—even when cache hits are misleading. The AdaptiveSemanticCache uses a QueryClassifier and similarity thresholds to decide when to reuse embeddings from a vector_store, dramatically reducing token usage. Curious how this works and how you can apply it to your own models? Read the full breakdown. #SemanticCaching #LLM #VectorStore #EmbeddingModel

🔗 https://aidailypost.com/news/semantic-caching-can-slash-llm-costs-by-73-despite-misleading-cache

Optimize LLM Costs with ScyllaDB Semantic Caching

Reduce latency and costs in large-scale LLM solutions with ScyllaDB semantic caching.

TechLife

Explore how #RetrievalAugmentedGeneration & #SemanticCaching can reduce #FalsePositives in AI-powered apps.

Insights come from a production-grade #CaseStudy testing 1,000 queries across 7 bi-encoder models.

📰 Read now: https://bit.ly/4nTPmso

#AI #LLMs #RAG #VectorDatabases #Infrastructure

Thrilled to get certified on Redis AI today, great course and certification on #Hash #JSON #VectorSeach #VectorSimilaritySearch #RedisVL #Enclidean #Cosine #IP #HNSW #FLAT. For Unstructured data (PDF, Docx, xlsx, etc…) It is extremely fast. #SemanticCaching #RedisMemory