jina-reranker-v3: Last but Not Late Interaction for Document Reranking
2509.25085v2
cs.CL, cs.AI, cs.IR, 68T50, I.2.7
2025-10-02
Авторы:
Feng Wang, Yuqing Li, Han Xiao
Abstract
jina-reranker-v3 is a 0.6B parameter multilingual document reranker that
introduces a novel last but not late interaction. Unlike late interaction
models such as ColBERT that perform separate encoding followed by multi-vector
matching, our approach conducts causal self-attention between query and
documents within the same context window, enabling rich cross-document
interactions before extracting contextual embeddings from the last token of
each document. This compact architecture achieves state-of-the-art BEIR
performance with 61.94 nDCG@10 while being significant smaller than generative
listwise rerankers.