Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Semantic Product Search for Matching Structured Product Catalogs in E-Commerce (2008.08180v1)

Published 18 Aug 2020 in cs.IR

Abstract: Retrieving all semantically relevant products from the product catalog is an important problem in E-commerce. Compared to web documents, product catalogs are more structured and sparse due to multi-instance fields that encode heterogeneous aspects of products (e.g. brand name and product dimensions). In this paper, we propose a new semantic product search algorithm that learns to represent and aggregate multi-instance fields into a document representation using state of the art transformers as encoders. Our experiments investigate two aspects of the proposed approach: (1) effectiveness of field representations and structured matching; (2) effectiveness of adding lexical features to semantic search. After training our models using user click logs from a well-known E-commerce platform, we show that our results provide useful insights for improving product search. Lastly, we present a detailed error analysis to show which types of queries benefited the most by fielded representations and structured matching.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Jason Ingyu Choi (8 papers)
  2. Surya Kallumadi (15 papers)
  3. Bhaskar Mitra (78 papers)
  4. Eugene Agichtein (33 papers)
  5. Faizan Javed (11 papers)
Citations (6)

Summary

We haven't generated a summary for this paper yet.