Update on my capstone project in the #llmzoomcamp @DataTalksClub:
So got to know the Temperature param is really important to be set different for different functions in a RAG system.
For the main prompt to the llm with context(s) from Qdrant along with user query, it should not be more than 0.2. Well, that is what I got the best results with.
But for prompts meant for query-rewriting, best deterministic results seem to come when it set to zero.