Mastodawn

In case you missed it, here is the recording of the Azure CosmosDB Conf 2026 session on Chat History and Semantic Caching using Microsoft Agent Framework.
https://youtu.be/atbRswDKruY?si=U7IF7fJVcrm77qkv
#CosmosDB #AgentFramework #SemanticCaching #ChatHistory

AI Agent Memory: Chat History & Semantic Caching | Lino Tadros | Azure Cosmos DB Conf 2026

YouTube

AI Daily Post Jan 10

New research shows semantic caching can cut LLM inference costs by up to 73%—even when cache hits are misleading. The AdaptiveSemanticCache uses a QueryClassifier and similarity thresholds to decide when to reuse embeddings from a vector_store, dramatically reducing token usage. Curious how this works and how you can apply it to your own models? Read the full breakdown. #SemanticCaching #LLM #VectorStore #EmbeddingModel

🔗 https://aidailypost.com/news/semantic-caching-can-slash-llm-costs-by-73-despite-misleading-cache