Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

JPAVE: A Generation and Classification-based Model for Joint Product Attribute Prediction and Value Extraction (2311.04196v1)

Published 7 Nov 2023 in cs.CL and cs.AI

Abstract: Product attribute value extraction is an important task in e-Commerce which can help several downstream applications such as product search and recommendation. Most previous models handle this task using sequence labeling or question answering method which rely on the sequential position information of values in the product text and are vulnerable to data discrepancy between training and testing. This limits their generalization ability to real-world scenario in which each product can have multiple descriptions across various shopping platforms with different composition of text and style. They also have limited zero-shot ability to new values. In this paper, we propose a multi-task learning model with value generation/classification and attribute prediction called JPAVE to predict values without the necessity of position information of values in the text. Furthermore, the copy mechanism in value generator and the value attention module in value classifier help our model address the data discrepancy issue by only focusing on the relevant part of input text and ignoring other information which causes the discrepancy issue such as sentence structure in the text. Besides, two variants of our model are designed for open-world and closed-world scenarios. In addition, copy mechanism introduced in the first variant based on value generation can improve its zero-shot ability for identifying unseen values. Experimental results on a public dataset demonstrate the superiority of our model compared with strong baselines and its generalization ability of predicting new values.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Zhongfen Deng (13 papers)
  2. Hao Peng (291 papers)
  3. Tao Zhang (481 papers)
  4. Shuaiqi Liu (12 papers)
  5. Wenting Zhao (44 papers)
  6. Yibo Wang (111 papers)
  7. Philip S. Yu (592 papers)
Citations (1)

Summary

We haven't generated a summary for this paper yet.