ML//RAG//reranker

- Second-pass model that re-scores retrieved documents for query relevance.


Second-pass model that re-scores retrieved documents for query relevance.

First pass (BM25 or embeddings) retrieves candidates cheaply; reranker uses cross-attention to evaluate each pair.

Much more accurate than bi-encoder similarity but too slow for full-index search — hence the two-stage pipeline.