Bridging Language and Items for Retrieval and Recommendation (2403.03952v1)

Published 6 Mar 2024 in cs.IR

Abstract: This paper introduces BLaIR, a series of pretrained sentence embedding models specialized for recommendation scenarios. BLaIR is trained to learn correlations between item metadata and potential natural language context, which is useful for retrieving and recommending items. To pretrain BLaIR, we collect Amazon Reviews 2023, a new dataset comprising over 570 million reviews and 48 million items from 33 categories, significantly expanding beyond the scope of previous versions. We evaluate the generalization ability of BLaIR across multiple domains and tasks, including a new task named complex product search, referring to retrieving relevant items given long, complex natural language contexts. Leveraging LLMs like ChatGPT, we correspondingly construct a semi-synthetic evaluation set, Amazon-C4. Empirical results on the new task, as well as conventional retrieval and recommendation tasks, demonstrate that BLaIR exhibit strong text and item representation capacity. Our datasets, code, and checkpoints are available at: https://github.com/hyp1231/AmazonReviews2023.

References (60)

Citations (49)

View on Semantic Scholar

Summary

The paper introduces BLaIR, a pretrained embedding model series that integrates item metadata with natural language for enhanced recommendation accuracy.
It employs a contrastive learning approach on the rich Amazon Reviews 2023 dataset to enable effective complex product search and retrieval.
Empirical results demonstrate BLaIR’s superior performance over existing methods, paving the way for advanced AI-driven recommendation systems.

Bridging Language and Items for Retrieval and Recommendation

Introduction to BLaIR

Recent advancements in LLMs have amplified interest in exploiting their capabilities for recommendation systems. Nevertheless, a significant challenge lies in integrating the vast universe of items—often scaling to millions—into these models without extensive retraining or complex engineering efforts. Addressing this challenge, we introduce BLaIR (Bridging Language and Items for Retrieval and Recommendation), a series of pretrained sentence embedding models designed exclusively for recommendation scenarios. BLaIR is adept at learning the nuanced relationships between item metadata and their corresponding natural language descriptions, a critical feature for enhancing retrieval and recommendation tasks.

The Amazon Reviews 2023 Dataset

A cornerstone of our research is the newly curated Amazon Reviews 2023 dataset, which significantly outpaces its predecessors both in scope and richness. With over 570 million reviews spanning 33 categories and linked to 48 million items, this dataset is uniquely positioned to provide a comprehensive landscape for training and evaluating recommendation models. The dataset updates include finer-grained timestamps for precision in temporal-based recommendation tasks, cleaner and richer metadata, and a vast expansion in item categories and user reviews compared to the 2018 version.

Architecture and Training Objective

At its core, BLaIR utilizes a contrastive learning approach to embed both item metadata and user reviews into a shared embedding space. This methodology enables the effective bridging of items with their potentially vast array of natural language contexts. By leveraging reviews as natural, rich language contexts related to items, BLaIR can finely tune its embeddings to suit the recommendation domain's unique requirements.

Evaluating BLaIR's Performance

Our extensive experiments across multiple domains and tasks underscore BLaIR's superior performance and versatility. Specifically, we introduce a new task termed complex product search, deeply aligned with real-world scenarios where queries may involve long and detailed natural language. Leveraging LLMs like ChatGPT, we further construct a semi-synthetic evaluation set, Amazon-C4, to benchmark models in this nuanced task domain. The empirical results validate BLaIR's efficacy, showing marked improvements over existing methods across a spectrum of retrieval and recommendation tasks.

Future Directions

The emergence of BLaIR opens several avenues for future research in AI and recommendation systems. The model's ability to generalize across tasks and domains suggests potential for further exploration into other language-heavy recommendation scenarios. Moreover, the Amazon Reviews 2023 dataset itself, with its unprecedented scale and depth, offers a fertile ground for advancing research in recommendation systems and natural language processing.

Concluding Remarks

In conclusion, BLaIR represents a significant stride forward in harmonizing the capabilities of LLMs with the intricate demands of modern recommendation systems. By meticulously pretraining on the vast and rich Amazon Reviews 2023 dataset, BLaIR sets a new benchmark for the integration of language and item data in the recommendation domain. As we move forward, the methodologies and insights gleaned from this work are poised to inspire a new generation of AI-driven recommendation systems, further blurring the lines between human language understanding and machine intelligence.