EcomGPT: Instruction-tuning Large Language Models with Chain-of-Task Tasks for E-commerce (2308.06966v2)

Published 14 Aug 2023 in cs.CL

Abstract: Recently, instruction-following LLMs , represented by ChatGPT, have exhibited exceptional performance in general NLP tasks. However, the unique characteristics of E-commerce data pose significant challenges to general LLMs. An LLM tailored specifically for E-commerce scenarios, possessing robust cross-dataset/task generalization capabilities, is a pressing necessity. To solve this issue, in this work, we proposed the first e-commerce instruction dataset EcomInstruct, with a total of 2.5 million instruction data. EcomInstruct scales up the data size and task diversity by constructing atomic tasks with E-commerce basic data types, such as product information, user reviews. Atomic tasks are defined as intermediate tasks implicitly involved in solving a final task, which we also call Chain-of-Task tasks. We developed EcomGPT with different parameter scales by training the backbone model BLOOMZ with the EcomInstruct. Benefiting from the fundamental semantic understanding capabilities acquired from the Chain-of-Task tasks, EcomGPT exhibits excellent zero-shot generalization capabilities. Extensive experiments and human evaluations demonstrate that EcomGPT outperforms ChatGPT in term of cross-dataset/task generalization on E-commerce tasks.

Authors (9)

Yangning Li (49 papers)
Shirong Ma (23 papers)
Xiaobin Wang (39 papers)
Shen Huang (25 papers)
Chengyue Jiang (11 papers)
Hai-Tao Zheng (94 papers)
Pengjun Xie (85 papers)
Fei Huang (409 papers)
Yong Jiang (194 papers)

Citations (36)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

YouTube

Show All Videos

EcomGPT: Instruction-tuning Large Language Models with Chain-of-Task Tasks for E-commerce (2308.06966v2)

Summary

Related Papers

YouTube