Similar Questions in Context and Retrieval (RAG)
Medium
Why might you use a "Cross-Encoder" re-ranker after your initial vector retrieval? What is the trade-off in terms of latency?
View
Easy
If a company has a 1,000-page internal wiki that updates daily, would you recommend RAG or fine-tuning? Why?
View
Medium
How would you design a cache that returns a result even if the user's question isn't a 100% string match to a previous query?
View