Context and Retrieval (RAG)Medium

Why might you use a "Cross-Encoder" re-ranker after your initial vector retrieval? What is the trade-off in terms of latency?

Practice Your Response