QUICK REVIEW

[論文レビュー] Amazon-M2: A Multilingual Multi-locale Shopping Session Dataset for Recommendation and Text Generation

Wei Jin, Haitao Mao|arXiv (Cornell University)|Jul 19, 2023

Recommender Systems and Techniques被引用数 10

ひとこと要約

Amazon-M2 は大規模な多言語・多ロケールのショッピングセッションデータセットを導入し、リッチなテキスト商品属性を提供し、3つのタスク：次の商品推薦、ドメインシフト転移学習、次の商品タイトル生成を可能にします。

ABSTRACT

Modeling customer shopping intentions is a crucial task for e-commerce, as it directly impacts user experience and engagement. Thus, accurately understanding customer preferences is essential for providing personalized recommendations. Session-based recommendation, which utilizes customer session data to predict their next interaction, has become increasingly popular. However, existing session datasets have limitations in terms of item attributes, user diversity, and dataset scale. As a result, they cannot comprehensively capture the spectrum of user behaviors and preferences. To bridge this gap, we present the Amazon Multilingual Multi-locale Shopping Session Dataset, namely Amazon-M2. It is the first multilingual dataset consisting of millions of user sessions from six different locales, where the major languages of products are English, German, Japanese, French, Italian, and Spanish. Remarkably, the dataset can help us enhance personalization and understanding of user preferences, which can benefit various existing tasks as well as enable new tasks. To test the potential of the dataset, we introduce three tasks in this work: (1) next-product recommendation, (2) next-product recommendation with domain shifts, and (3) next-product title generation. With the above tasks, we benchmark a range of algorithms on our proposed dataset, drawing new insights for further research and practice. In addition, based on the proposed dataset and tasks, we hosted a competition in the KDD CUP 2023 and have attracted thousands of users and submissions. The winning solutions and the associated workshop can be accessed at our website https://kddcup23.github.io/.

研究の動機と目的

リッチな商品属性を備えた大規模な多言語セッションデータセットを提供し、セッションベースの推奨研究を前進させる。
データの豊富さが異なるロケール間の転移学習を評価するために、ドメインシフトの下での評価を可能にする。
テキスト属性を活用したテキスト生成タスク（次の商品タイトル生成）を導入・ベンチマークする。

提案手法

英国（UK）、日本（JP）、ドイツ（DE）、スペイン（ES）、フランス（FR）、イタリア（IT）の6つのロケールから匿名化されたユーザーセッションを収集し、整列した多言語テキスト商品属性を付与する。
セッションに同梱される商品属性表を提供し、タイトル、価格、ブランド、説明などを含む。
3つのタスクを定義する。 (1) 同一ロケール/ドメイン内の次商品推薦、(2) クロスロケール転移によるドメインシフト下の次商品推薦、(3) テキスト属性を用いた次商品タイトル生成。
Task 1 の基準として、ベースライン群（GRU4REC++, NARM, STAMP, SRGNN, CORE, MGS）と人気ヒューリスティックをベンチマークする。
Task 2 を、教師あり学習および事前学習+ファインチューニングのパラダイム（テキスト強化バリアントを含む）で評価する。
Task 3 では、タイトルに対して mT5 モデルをファインチューニングし、Last Product Title ベースラインと比較する。

実験結果

リサーチクエスチョン

RQ1多言語・多ロケールのショッピングセッションデータセットは、セッションベースのレコメンダーシステムの評価と開発にどのように影響するか。
RQ2Task 2 において、ロケール間の転移学習はデータ不足のロケールを緩和できるか。
RQ3テキスト商品属性は、多言語設定での推薦とタイトル生成タスクをどれだけ可能に・改善できるか。

主な発見

モデル	MRR@100_UK	MRR@100_DE	MRR@100_JP	MRR@100_Overall	Recall@100_UK	Recall@100_DE	Recall@100_JP	Recall@100_Overall
Popularity	0.2302	0.2259	0.2766	0.2426	0.4047	0.4065	0.4683	0.4243
GRU4Rec++	0.1656	0.1578	0.2080	0.1757	0.3665	0.3627	0.4185	0.3808
NARM	0.1801	0.1716	0.2255	0.1908	0.4021	0.4016	0.4559	0.4180
STAMP	0.2050	0.1955	0.2521	0.2159	0.3370	0.3320	0.3900	0.3512
SRGNN	0.1841	0.1767	0.2282	0.1948	0.3882	0.3862	0.4403	0.4032
CORE	0.1510	0.1510	0.1846	0.1609	0.5591	0.5525	0.5898	0.5591
MGS	0.1668	0.1739	0.2376	0.1907	0.5641	0.5479	0.4677	0.5194

Amazon-M2 は六つのロケールにまたがる 1.41M 個の製品と 3.61M のトレーニングセッションを持ち、多言語のテキスト属性を含む大規模データセットである。
Task 1 では、単純な人気ベースラインが多くの場合深層モデルよりも優れており、MRR@100 および Recall@100 で強い人気バイアスと大規模なカタログの課題を浮き彫りにする。
Task 2 では、事前学習+ファインチューニングは、データ不足のロケールに対して、教師あり学習よりも一般的に MR R@100 および Recall@100 を改善し、ドメインシフト下の転移学習の利点を示す。
Task 3 では、Last Product Title ヒューリスティックがファインチューニング済みの mT5 モデルを上回り、事前学習データと商品タイトルとの間のドメイン特化型言語モデリングの課題と、より良い整合性の必要性を示唆する。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。