Tomas Zezula

2 Followers
5 Following
26 Posts
My passion is transforming ideas into polished, operational products. I specialise in making the intricate world of technology accessible and manageable. Developer, tech enthusiast, blogger.

In my latest article, I shift focus from feature flags and pricing plans to a challenge many SaaS teams are now facing: how to keep your LLM-powered features fast, affordable, and reliable under load.

I break down why LLMs can quietly destroy your margins and what makes these features especially hard to scale. #SaaS #LLM #SoftwareArchitecture

https://buff.ly/jhPck5j

Explored efficient AI data retrieval with RAG & Redis in my latest blog. A deep dive into ETL for weather data. https://buff.ly/3AkeFBa #AI #DataProcessing #ETL #Redis #VectorStore #OpenAI
Retrieval Augmented Generation with Spring AI - Tomas Zezula

In our last post, we looked at enriching the OpenAI model with custom data through function calls. While this technique is useful, it has its limitations and performance trade-offs. Today, we explore a more efficient way of incorporating relevant data into prompts to receive accurate and relevant model responses. Retrieval Augmented Generation, or RAG, relies on preprocessed data that is readily available upon request. In this post, we will build an Extract, Transform, Load (ETL) pipeline that stores a large corpus of weather forecasts and learn how to efficiently retrieve relevant information from a vector store.

Tomas Zezula
MineCreate - Gameplay

YouTube