Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Retrieval & Interaction Machine for Tabular Data Prediction (2108.05252v1)

Published 11 Aug 2021 in cs.IR

Abstract: Prediction over tabular data is an essential task in many data science applications such as recommender systems, online advertising, medical treatment, etc. Tabular data is structured into rows and columns, with each row as a data sample and each column as a feature attribute. Both the columns and rows of the tabular data carry useful patterns that could improve the model prediction performance. However, most existing models focus on the cross-column patterns yet overlook the cross-row patterns as they deal with single samples independently. In this work, we propose a general learning framework named Retrieval & Interaction Machine (RIM) that fully exploits both cross-row and cross-column patterns among tabular data. Specifically, RIM first leverages search engine techniques to efficiently retrieve useful rows of the table to assist the label prediction of the target row, then uses feature interaction networks to capture the cross-column patterns among the target row and the retrieved rows so as to make the final label prediction. We conduct extensive experiments on 11 datasets of three important tasks, i.e., CTR prediction (classification), top-n recommendation (ranking) and rating prediction (regression). Experimental results show that RIM achieves significant improvements over the state-of-the-art and various baselines, demonstrating the superiority and efficacy of RIM.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Jiarui Qin (24 papers)
  2. Weinan Zhang (322 papers)
  3. Rong Su (58 papers)
  4. Zhirong Liu (22 papers)
  5. Weiwen Liu (59 papers)
  6. Ruiming Tang (171 papers)
  7. Xiuqiang He (97 papers)
  8. Yong Yu (219 papers)
Citations (42)

Summary

We haven't generated a summary for this paper yet.