QUICK REVIEW

[论文解读] Inductive Matrix Completion Based on Graph Neural Networks

Muhan Zhang, Yixin Chen|arXiv (Cornell University)|Apr 26, 2019

Advanced Graph Neural Networks参考文献 51被引用 31

一句话总结

本文提出了一种新型归纳式矩阵补全模型——基于归纳图的矩阵补全（IGMC），该模型通过在每个（用户，项目）对周围的局部1跳子图上训练图神经网络（GNN），无需依赖辅助信息或完整的评分矩阵，即可学习预测用户-项目评分。IGMC在基准数据集上实现了最先进性能，并展现出强大的迁移能力，在低数据环境下优于归纳方法基线，并能实现对新推荐任务的零样本迁移。

ABSTRACT

We propose an inductive matrix completion model without using side information. By factorizing the (rating) matrix into the product of low-dimensional latent embeddings of rows (users) and columns (items), a majority of existing matrix completion methods are transductive, since the learned embeddings cannot generalize to unseen rows/columns or to new matrices. To make matrix completion inductive, most previous works use content (side information), such as user's age or movie's genre, to make predictions. However, high-quality content is not always available, and can be hard to extract. Under the extreme setting where not any side information is available other than the matrix to complete, can we still learn an inductive matrix completion model? In this paper, we propose an Inductive Graph-based Matrix Completion (IGMC) model to address this problem. IGMC trains a graph neural network (GNN) based purely on 1-hop subgraphs around (user, item) pairs generated from the rating matrix and maps these subgraphs to their corresponding ratings. It achieves highly competitive performance with state-of-the-art transductive baselines. In addition, IGMC is inductive -- it can generalize to users/items unseen during the training (given that their interactions exist), and can even transfer to new tasks. Our transfer learning experiments show that a model trained out of the MovieLens dataset can be directly used to predict Douban movie ratings with surprisingly good performance. Our work demonstrates that: 1) it is possible to train inductive matrix completion models without using side information while achieving similar or better performances than state-of-the-art transductive methods; 2) local graph patterns around a (user, item) pair are effective predictors of the rating this user gives to the item; and 3) Long-range dependencies might not be necessary for modeling recommender systems.

研究动机与目标

开发一种归纳式矩阵补全模型，使其在不依赖辅助信息的情况下，能够泛化到未见过的用户和项目。
探究用户-项目交互图中的局部图模式是否可作为评分预测的有效归纳偏置。
评估在极端数据稀疏性和零样本迁移学习设置下，归纳式矩阵补全的可行性。
在低数据环境下，对比基于归纳GNN的方法与最先进归纳基线方法的性能。

提出的方法

IGMC从评分矩阵构建一个二分图，其中边表示用户-项目交互。
针对每个（用户，项目）对，提取以该对为中心的1跳包围子图，以捕捉局部邻域结构。
使用GNN处理每个子图，将局部图模式编码为节点和边的表示。
模型使用池化层将子图表示聚合为固定大小的向量以进行预测。
应用相邻评分正则化（ARR），以基于局部评分分布鼓励一致的预测。
最终预测通过将多层感知机应用于池化后的子图表示计算得出。

实验结果

研究问题

RQ1是否可以完全不依赖辅助信息，仅依靠用户-项目交互图中的局部图模式，实现归纳式矩阵补全？
RQ2在不同数据稀疏程度下，基于归纳GNN的模型性能与最先进归纳基线方法相比如何？
RQ3在某一数据集上预训练的IGMC模型，能在多大程度上迁移到预测另一未见推荐数据集的评分？
RQ4（用户，项目）对周围的局部图结构是否足以预测评分，还是需要长距离依赖关系？

主要发现

IGMC在MovieLens、Flixster、Douban和YahooMusic等标准基准数据集上，性能与最先进归纳基线方法相当。
在低数据环境（稀疏率 < 20%）下，IGMC优于归纳方法基线，表明其在有限训练数据下具有更强的泛化能力。
该模型展现出强大的迁移学习能力：在MovieLens上预训练的IGMC模型在Flixster上实现RMSE 0.906，Douban上为0.759，YahooMusic上为20.1，优于其他归纳模型及多数微调基线。
即使在评分尺度和分布不同的数据集上，IGMC的性能依然稳健，表明局部图模式提供了有效的归纳偏置。
消融实验确认，池化层和相邻评分正则化显著提升性能，且引入内容信息无法进一步提升模型表现，验证了仅依赖图模式的有效性。
对高分和低分子图的可视化揭示了显著的结构差异（如一致的用户/项目平均评分），支持模型通过局部拓扑结构推断评分的能力。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。