Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

CROSS-JEM: Accurate and Efficient Cross-encoders for Short-text Ranking Tasks (2409.09795v1)

Published 15 Sep 2024 in cs.IR

Abstract: Ranking a set of items based on their relevance to a given query is a core problem in search and recommendation. Transformer-based ranking models are the state-of-the-art approaches for such tasks, but they score each query-item independently, ignoring the joint context of other relevant items. This leads to sub-optimal ranking accuracy and high computational costs. In response, we propose Cross-encoders with Joint Efficient Modeling (CROSS-JEM), a novel ranking approach that enables transformer-based models to jointly score multiple items for a query, maximizing parameter utilization. CROSS-JEM leverages (a) redundancies and token overlaps to jointly score multiple items, that are typically short-text phrases arising in search and recommendations, and (b) a novel training objective that models ranking probabilities. CROSS-JEM achieves state-of-the-art accuracy and over 4x lower ranking latency over standard cross-encoders. Our contributions are threefold: (i) we highlight the gap between the ranking application's need for scoring thousands of items per query and the limited capabilities of current cross-encoders; (ii) we introduce CROSS-JEM for joint efficient scoring of multiple items per query; and (iii) we demonstrate state-of-the-art accuracy on standard public datasets and a proprietary dataset. CROSS-JEM opens up new directions for designing tailored early-attention-based ranking models that incorporate strict production constraints such as item multiplicity and latency.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (9)
  1. Bhawna Paliwal (3 papers)
  2. Deepak Saini (8 papers)
  3. Mudit Dhawan (3 papers)
  4. Siddarth Asokan (5 papers)
  5. Nagarajan Natarajan (25 papers)
  6. Surbhi Aggarwal (1 paper)
  7. Pankaj Malhotra (22 papers)
  8. Jian Jiao (44 papers)
  9. Manik Varma (16 papers)
X Twitter Logo Streamline Icon: https://streamlinehq.com