Cross-Encoder is a neural model architecture that processes query-document pairs together through a transformer to produce a relevance score. Unlike bi-encoders that embed query and documents separately, cross-encoders attend to both simultaneously, enabling more accurate relevance judgments.
Cross-encoders are commonly used as rerankers in retrieval pipelines, taking initial candidates from faster methods (BM25, bi-encoder) and reordering them for higher precision.