QUICK REVIEW

[论文解读] Exploring Social Influence for Recommendation - A Probabilistic Generative Model Approach

Mao Ye, Xingjie Liu|arXiv (Cornell University)|Sep 4, 2011

Recommender Systems and Techniques参考文献 24被引用 27

一句话总结

本文提出了一种统一的基于概率的生成模型，通过将潜在用户偏好、物品主题和基于朋友的影响建模为潜在变量，整合了社交影响、协同过滤和基于内容的推荐。采用基于EM的算法与Map-Reduce实现可扩展性，该模型显著提升了推荐准确率——尤其在群体场景中表现更优，表明在签到网络（whrrl.com）中社交影响强于在音乐平台（last.fm）中的影响。

ABSTRACT

In this paper, we propose a probabilistic generative model, called unified model, which naturally unifies the ideas of social influence, collaborative filtering and content-based methods for item recommendation. To address the issue of hidden social influence, we devise new algorithms to learn the model parameters of our proposal based on expectation maximization (EM). In addition to a single-machine version of our EM algorithm, we further devise a parallelized implementation on the Map-Reduce framework to process two large-scale datasets we collect. Moreover, we show that the social influence obtained from our generative models can be used for group recommendation. Finally, we conduct comprehensive experiments using the datasets crawled from last.fm and whrrl.com to validate our ideas. Experimental results show that the generative models with social influence significantly outperform those without incorporating social influence. The unified generative model proposed in this paper obtains the best performance. Moreover, our study on social influence finds that users in whrrl.com are more likely to get influenced by friends than those in last.fm. The experimental results also confirm that our social influence based group recommendation algorithm outperforms the state-of-the-art algorithms for group recommendation.

研究动机与目标

开发一种统一的基于概率的生成模型，将社交影响整合到推荐系统中，超越临时的启发式方法。
通过将社交影响建模为潜在变量，解决隐藏社交影响的挑战——即社交影响在用户-物品交互数据中不可直接观测。
设计可扩展的学习算法（EM与Map-Reduce），用于在真实社交网络的大规模数据集上训练该模型。
展示学习到的社交影响在群体推荐中的实用性，优于现有的聚合策略。
通过定量模型输出，实证比较不同社交网络（last.fm与whrrl.com）中社交影响的强度。

提出的方法

统一模型是一种潜在类别统计混合模型，联合建模用户对潜在主题的偏好、各主题下的物品生成分布以及来自朋友的社交影响。
社交影响被建模为一个潜在变量，表示用户选择某物品是受朋友影响的概率，而非仅基于个人偏好。
使用期望最大化（EM）算法学习模型参数，以处理用户偏好、主题和社交影响等潜在变量。
基于Map-Reduce框架开发了并行化的EM实现，以实现对last.fm与whrrl.com大规模数据集的可扩展训练。
提出一种基于社交影响的群体推荐（SIG）算法，利用个体用户的影响力得分对群体推荐项目进行排序。
模型采用相对排序指标评估群体推荐性能，数值越低表示实际群体访问项目的排名越靠前。

实验结果

研究问题

RQ1如何在基于概率的生成框架中有效将社交影响建模为潜在变量？
RQ2与不包含社交影响的模型相比，引入社交影响能在多大程度上提升推荐准确率？
RQ3在使用模式不同的社交网络中（如音乐平台与签到平台），社交影响的强度有何差异？
RQ4基于生成模型学习到的社交影响是否能超越标准聚合策略，提升群体推荐性能？
RQ5群体规模如何影响社交影响在集体决策中的相对作用？

主要发现

统一的基于概率的生成模型显著优于排除社交影响的模型，证明了显式建模社交效应的价值。
该模型在所有评估基线中表现最佳，证实了将社交影响与协同过滤及基于内容的方法统一的有效性。
whrrl.com用户表现出比last.fm用户更强的社交影响效应，前者平均每位用户拥有9.08位朋友，后者为1.91位，且少数朋友主导了影响。
基于社交影响的群体推荐（SIG）算法在所有群体规模下均优于Average与Least Misery聚合策略，尤其在小群体中提升最为显著。
当主题数量设置为约60时，SIG算法达到最优性能，表明模型表达能力与泛化能力之间达到良好平衡。
对于较大群体，个体朋友的影响减弱，群体共识（基于共享偏好）成为集体决策中的主导因素。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。