QUICK REVIEW

[論文レビュー] Preference-based Graphic Models for Collaborative Filtering

Rong Jin, Luo Si|arXiv (Cornell University)|Oct 19, 2012

Recommender Systems and Techniques参考文献 12被引用数 68

ひとこと要約

本稿では、ユーザーの好みとレーティング行動を明示的に区別することで、協調フィルタリングを改善する2つの新しい確率的グラフィカルモデル—分離型モデルと好みモデル—を提案する。分離型モデルは好みとレーティングを分離し、2つの映画レーティングデータセットにおいて5つの既存手法を大きく上回る性能を発揮し、ユーザー固有のレーティングバイアスをよりよく捉えている。

ABSTRACT

Collaborative filtering is a very useful general technique for exploiting the preference patterns of a group of users to predict the utility of items to a particular user. Previous research has studied several probabilistic graphic models for collaborative filtering with promising results. However, while these models have succeeded in capturing the similarity among users and items in one way or the other, none of them has considered the fact that users with similar interests in items can have very different rating patterns; some users tend to assign a higher rating to all items than other users. In this paper, we propose and study of two new graphic models that address the distinction between user preferences and ratings. In one model, called the decoupled model, we introduce two different variables to decouple a users preferences FROM his ratings. IN the other, called the preference model, we model the orderings OF items preferred BY a USER, rather than the USERs numerical ratings of items. Empirical study over two datasets of movie ratings shows that appropriate modeling of the distinction between user preferences and ratings improves the performance substantially and consistently. Specifically, the proposed decoupled model outperforms all five existing approaches that we compare with significantly, but the preference model is not very successful. These results suggest that explicit modeling of the underlying user preferences is very important for collaborative filtering, but we can not afford ignoring the rating information completely.

研究の動機と目的

既存の協調フィルタリングモデルがユーザーの好みとレーティング行動を区別できないという限界を解決すること。
数値的レーティングとは独立してユーザーの好みをモデル化することで、予測精度を向上させること。
明示的に好みの順序をモデル化することで、協調フィルタリングのパフォーマンスが向上するかどうかを調査すること。
好みとレーティングを分離することの推薦システムの有効性に与える影響を評価すること。
主観的な好みと客観的なレーティングパターンを分離することで、ユーザー行動のより洗練された理解を提供すること。

提案手法

ユーザーの好みとレーティングバイアスの2つの潜在変数を用いる分離型モデルを提案する。
数値的レーティングではなく、アイテム間の好み順序を用いてユーザーの好みをモデル化する。
ベイジアングラフィカルモデルを用いて、ユーザー、アイテム、およびそれらの潜在的好み・レーティング変数間の依存関係を表現する。
確率的推論を用いて潜在変数を推定し、欠損レーティングを予測する。
2つの実世界の映画レーティングデータセットを用いて、最尤推定法でモデルを学習する。
提案されたモデルの性能を、5つの標準的な協調フィルタリングベースラインと比較する。

実験結果

リサーチクエスチョン

RQ1ユーザーの好みとレーティング行動を分離することで、協調フィルタリングのパフォーマンスが向上するか？
RQ2数値的レーティングではなく好みの順序をモデル化すると、推薦精度にどのような影響を与えるか？
RQ3ユーザー固有のレーティングバイアスを考慮することで、予測品質がどの程度向上するか？
RQ4従来の協調フィルタリング手法とは異なり、好みとレーティングを分離する分離型モデルは、より効果的か？
RQ5理論的には好みの順序を捉えているはずなのに、なぜ好みモデルは性能を発揮しないのか？

主な発見

分離型モデルは、比較対象の5つのベースライン手法すべてを予測精度において顕著に上回っている。
ユーザー固有のレーティングバイアスを明示的にモデル化することで、ユーザー間でのレーティングスケールの系統的差を捉えることができ、パフォーマンスが向上している。
数値的レーティングではなくアイテムの順序をモデル化する好みモデルは、競争力のある性能を達成できていない。
実証的結果から、レーティング情報を完全に無視すると、好みを明示的にモデル化しても性能が低下することが示された。
本研究では、好みモデル化が有益である一方で、レーティングデータが正確な予測に不可欠であることが確認された。
分離型モデルの成功は、従来の協調フィルタリング手法でしばしば無視されがちなレーティングバイアスが、重要な要因である可能性を示唆している。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。