QUICK REVIEW

[論文レビュー] Seamlessly Unifying Attributes and Items: Conversational Recommendation for Cold-Start Users

Shijun Li, Wenqiang Lei|arXiv (Cornell University)|May 23, 2020

Advanced Bandit Algorithms Research参考文献 55被引用数 33

ひとこと要約

本論文はConTSを提案する。ConTSは Conversational Thompson Sampling フレームワークで、属性の問合せとアイテムの推奨を単一のアーム空間に統合し、cold-start 会話型推奨における探索-利用を最適化する。いくつかのデータセットで最先端の CRS 手法を上回る。

ABSTRACT

Static recommendation methods like collaborative filtering suffer from the inherent limitation of performing real-time personalization for cold-start users. Online recommendation, e.g., multi-armed bandit approach, addresses this limitation by interactively exploring user preference online and pursuing the exploration-exploitation (EE) trade-off. However, existing bandit-based methods model recommendation actions homogeneously. Specifically, they only consider the items as the arms, being incapable of handling the item attributes, which naturally provide interpretable information of user's current demands and can effectively filter out undesired items. In this work, we consider the conversational recommendation for cold-start users, where a system can both ask the attributes from and recommend items to a user interactively. This important scenario was studied in a recent work. However, it employs a hand-crafted function to decide when to ask attributes or make recommendations. Such separate modeling of attributes and items makes the effectiveness of the system highly rely on the choice of the hand-crafted function, thus introducing fragility to the system. To address this limitation, we seamlessly unify attributes and items in the same arm space and achieve their EE trade-offs automatically using the framework of Thompson Sampling. Our Conversational Thompson Sampling (ConTS) model holistically solves all questions in conversational recommendation by choosing the arm with the maximal reward to play. Extensive experiments on three benchmark datasets show that ConTS outperforms the state-of-the-art methods Conversational UCB (ConUCB) and Estimation-Action-Reflection model in both metrics of success rate and average number of conversation turns.

研究の動機と目的

cold-start 会話型推奨を解決するため、インタラクティブな属性問合せを通じてリアルタイムのパーソナライズを実現する。
属性とアイテムを単一のアーム空間で統一し、意思決定を単純化し頑健性を向上させる。
Contextual Thompson Sampling を活用して探索と利用のバランスを自然にとる。

提案手法

属性とアイテムを同じアーム空間の区別しないアームとしてモデル化し、統一報酬を用いてアームを選択する。
既存ユーザーからの初期表現を用いてユーザー埋め込みを初期化し、対話中に事後パラメータを更新する。
Contextual Thompson Sampling を用いてユーザー埋め込みをサンプリングし、報酬が最大となるアームを選ぶ。
アームの報酬をユーザー-アーム適合度と属性適合性の組み合わせとして定義し、問合せと推奨の両方の行動を導く。
属性とアイテムのオールアームを共有埋め込み空間に配置するために、Bayesian Personalized Ranking を用いて属性とアイテムのオフライン FM 埋め込みを訓練する。

実験結果

リサーチクエスチョン

RQ1統一アーム空間アプローチは cold-start ユーザーに対する属性問合せとアイテム推奨のバランスを効果的に取れるか？
RQ2Contextual Thompson Sampling は属性-アイテム統合を伴う CRS で自然な探索-利用のバランスを提供するか？
RQ3ConTS は既存の CRS 手法（ConUCB、EAR）と成功率および対話効率でどう比較されるか？
RQ4属性ベースのフィードバックがユーザー埋め込みとアーム報酬の更新に与える影響は何か？

主な発見

ConTS は最先端の CRS 手法 ConUCB および EAR を、 cold-start ユーザーの成功率と平均対話ターン数の両方で上回る。
属性とアイテムを単一のアーム空間でモデル化することで意思決定が単純化され、問合せと推奨を分けるための手作りのタイミングルールが不要になる。
Contextual Thompson Sampling は事後サンプリングと更新を通じて自然な探索-利用のバランスを提供する。
Yelp、LastFM、および新しい Kuaishou データセットの実験は、異なるドメインでの頑健性を示す。
更新機構はユーザーフィードバックと既知の好み属性を取り込み、アーム報酬とユーザー埋め込みを洗練させる。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。