QUICK REVIEW

[论文解读] MeLU: Meta-Learned User Preference Estimator for Cold-Start Recommendation

Hoyeop Lee, Jinbae Im|arXiv (Cornell University)|Jul 31, 2019

Recommender Systems and Techniques参考文献 31被引用 53

一句话总结

MeLU 使用基于优化的元学习（MAML）在极少交互的情况下迅速为新用户个性化推荐，并引入一个证据候选选择策略以在冷启动情境中提升初始推荐。

ABSTRACT

This paper proposes a recommender system to alleviate the cold-start problem that can estimate user preferences based on only a small number of items. To identify a user's preference in the cold state, existing recommender systems, such as Netflix, initially provide items to a user; we call those items evidence candidates. Recommendations are then made based on the items selected by the user. Previous recommendation studies have two limitations: (1) the users who consumed a few items have poor recommendations and (2) inadequate evidence candidates are used to identify user preferences. We propose a meta-learning-based recommender system called MeLU to overcome these two limitations. From meta-learning, which can rapidly adopt new task with a few examples, MeLU can estimate new user's preferences with a few consumed items. In addition, we provide an evidence candidate selection strategy that determines distinguishing items for customized preference estimation. We validate MeLU with two benchmark datasets, and the proposed model reduces at least 5.92% mean absolute error than two comparative models on the datasets. We also conduct a user study experiment to verify the evidence selection strategy.

研究动机与目标

在推荐系统中解决用户冷启动问题，当初始交互有限时。
利用元学习为每个用户生成个性化偏好估计器。
提出一个证据候选者选择策略以提升早期推荐质量。
在基准数据集上及通过用户研究验证证据选择的性能。

提出的方法

定义一个将用户与物品内容嵌入并通过多层神经决策网络传递的用户偏好估计器。
采用基于优化的元学习（MAML）框架，使用支持集（他们的物品历史）来迅速为每个新用户自适应模型参数。
在局部更新期间不更新用户/物品嵌入以维持稳定性；仅更新决策层和输出层。
采用两级更新进行训练：使用每个用户的支持集进行局部更新，然后使用查询集在任务之间进行全局更新。
引入一个证据候选选择策略，通过结合基于梯度的区分度（个性化梯度的弗罗贝尼乌斯范数）和物品流行度来对项目评分，以形成前 k 个证据候选者。
在 MovieLens 1M 和 BookCrossing 数据集上，在四种情景（存在/新物品和存在/新用户）使用 MAE 和 nDCG 作为指标评估 MeLU；并与 PPR 与 Wide & Deep 基线进行比较。

实验结果

研究问题

RQ1基于 MAML 的推荐系统是否能够在只有少量交互数据的情况下迅速为新用户实现个性化？
RQ2与基于流行度的候选相比，是否选择性地挑选证据候选能改善新用户的初始推荐？
RQ3在冷启动设置下，MeLU 对用户的物品消费历史长度的变化有多鲁棒？
RQ4学习到的个性化参数是否能在不同数据集和冷启动情景中泛化？

主要发现

MeLU 在 MovieLens 和 BookCrossing 数据集的三个冷启动情景中均优于两个基线模型。
模型可快速适应，单次局部更新后 MAE 有显著改善；后续更新收益递减。
基于个性化梯度和流行度的证据候选选择在用户研究中带来更可靠的候选和更高的用户满意度。
MeLU 即使在历史非常短的情况下也保持强劲性能，证明对历史长度的鲁棒性。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。