Similar Questions in Context and Retrieval (RAG)
Medium
Why might you use a "Cross-Encoder" re-ranker after your initial vector retrieval? What is the trade-off in terms of latency?
View
Easy
If the retriever returns zero relevant documents, how do you prevent the LLM from making up an answer?
View
Hard
How would you adjust your pipeline if the source documents contained both text and complex financial charts/tables?
View