QUICK REVIEW

[论文解读] DISCO: Influence Maximization Meets Network Embedding and Deep Learning

Hui Li, Mengting Xu|arXiv (Cornell University)|Jun 18, 2019

Complex Network Analysis Techniques参考文献 40被引用 23

一句话总结

DISCO 提出了一种新颖的深度学习框架，将网络嵌入与深度强化学习相结合，以估计社交网络中的影响传播，从而避免了昂贵的扩散采样。该方法在影响最大化任务中实现了最先进性能，相较于经典的基于采样的方法，计算效率更高，影响传播质量更优。

ABSTRACT

Since its introduction in 2003, the influence maximization (IM) problem has drawn significant research attention in the literature. The aim of IM is to select a set of k users who can influence the most individuals in the social network. The problem is proven to be NP-hard. A large number of approximate algorithms have been proposed to address this problem. The state-of-the-art algorithms estimate the expected influence of nodes based on sampled diffusion paths. As the number of required samples have been recently proven to be lower bounded by a particular threshold that presets tradeoff between the accuracy and efficiency, the result quality of these traditional solutions is hard to be further improved without sacrificing efficiency. In this paper, we present an orthogonal and novel paradigm to address the IM problem by leveraging deep learning models to estimate the expected influence. Specifically, we present a novel framework called DISCO that incorporates network embedding and deep reinforcement learning techniques to address this problem. Experimental study on real-world networks demonstrates that DISCO achieves the best performance w.r.t efficiency and influence spread quality compared to state-of-the-art classical solutions. Besides, we also show that the learning model exhibits good generality.

研究动机与目标

通过用学习的影响估计模型替代扩散采样，解决影响最大化（IM）中计算成本过高的问题。
开发一种框架，利用深度强化学习与网络嵌入预测节点的期望影响，避免迭代采样。
通过学习一种对种子选择顺序不变的稳定影响映射，实现 k 个种子节点的同时选择。
证明训练好的模型在同构网络之间具有泛化能力，无需重新训练。
与最先进基于采样的 IM 算法相比，实现影响传播质量与计算效率之间更优的权衡。

提出的方法

DISCO 使用网络嵌入将网络拓扑转换为每个节点和子图的低维向量表示。
采用深度强化学习学习一个映射 $\tilde{\sigma}(v,S;\Theta)$，用于在给定部分种子集 $S$ 的情况下预测节点 $v$ 的期望影响。
模型通过基于影响传播的奖励信号进行训练，实现端到端学习，无需真实影响标签。
通过直接利用学习到的函数预测影响，框架避免了重复的随机交错采样（RIS）。
影响预测函数被设计为在种子集更新时保持稳定，从而支持所有 k 个种子的同步选择。
采用多跳邻域聚合机制，捕捉最多四跳内的结构特征，其权重由学习到的注意力参数决定。

实验结果

研究问题

RQ1深度学习模型能否在不依赖昂贵扩散采样的情况下，准确估计社交网络中的期望影响传播？
RQ2当新种子被加入种子集时，学习到的影响预测模型是否保持稳定与一致？
RQ3所提出的框架能否在影响传播质量与效率方面超越最先进基于采样的 IM 算法？
RQ4训练好的模型在不同但结构相似的网络之间具有多大程度的泛化能力？
RQ5是否可行通过基于深度学习的影响预测器，同时选择全部 k 个种子节点？

主要发现

在真实网络上，DISCO 在所有对比方法中实现了最高的影响传播质量，优于最先进基于采样的算法。
该框架通过消除重复扩散采样的需求，展现出卓越的效率，显著降低了计算成本。
学习到的影响预测模型在多个同构网络上泛化良好，对于同类型网络的新实例无需重新训练。
模型预测的稳定性确保了种子选择后节点的相对影响排名基本不变，从而支持同步选择 k 个节点。
理论分析表明，节点对之间影响预测期望差值的上界为 $\sum_{i=1}^{4}\frac{2i|\overline{N(v)}|^{i}}{n^{2}}$，表明对种子集变化具有鲁棒性。
实验结果证实，DISCO 以比基于 RIS 的方法更少的计算资源，实现了 $ (1 - 1/e - \epsilon) $-近似最优影响传播。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。