Reranking is a two-stage retrieval approach where an initial fast retrieval returns candidate documents, which are then reordered by a more accurate but slower model. The reranker examines query-document pairs together for deeper relevance assessment.
This combines the efficiency of first-stage retrieval with the accuracy of cross-attention models.