Similar Questions in Context and Retrieval (RAG)
Medium
Why might you use a "Cross-Encoder" re-ranker after your initial vector retrieval? What is the trade-off in terms of latency?
View
Easy
If the retriever returns zero relevant documents, how do you prevent the LLM from making up an answer?
View
Medium
If a user asks a vague question like "Tell me more about that," how do you ensure the retriever finds relevant documents from past conversation history?
View