QUICK REVIEW

[论文解读] Influence Maximization with Semi-Bandit Feedback.

Zheng Wen, Branislav Kveton|arXiv (Cornell University)|May 21, 2016

Complex Network Analysis Techniques被引用 6

一句话总结

本文提出 IMLinUCB，一种计算高效的基于UCB的算法，用于在社交网络中基于半-bandit反馈的在线影响最大化问题。该算法实现了依赖于网络拓扑和边概率的多项式后悔界，为独立级联模型下的组合半-bandit影响最大化提供了首个此类理论保证。

ABSTRACT

We study the online influence maximization problem in social networks under the independent cascade model. Specifically, we aim to learn the set of influencers in a social network online while repeatedly interacting with it. We address the challenges of (i) combinatorial action space, since the number of feasible influencer sets grows exponentially with the maximum number of influencers, and (ii) limited feedback, since only the influenced portion of the network is observed. Under a stochastic semi-bandit feedback, we propose and analyze IMLinUCB, a computationally efficient UCB-based algorithm. Our bounds on the cumulative regret are polynomial in all quantities of interest, achieve near-optimal dependence on the number of interactions and reflect the topology of the network and the activation probabilities of its edges, thereby giving insights on the problem complexity. To the best of our knowledge, these are the first such results. Our experiments show that in several representative graph topologies, the regret of IMLinUCB scales as suggested by our upper bounds. IMLinUCB permits linear generalization and thus is both statistically and computationally suitable for large-scale problems. Our experiments also show that IMLinUCB with linear generalization can lead to low regret in real-world online influence maximization.

研究动机与目标

解决具有组合动作空间和有限反馈的社交网络中的在线影响最大化问题。
开发一种计算高效的算法，以处理在部分观测下的指数级规模影响者集合。
提供反映网络结构和边激活概率的理论后悔界。
实现线性泛化，以在大规模影响最大化问题中实现可扩展性。

提出的方法

本文提出 IMLinUCB，一种专为影响最大化中半-bandit 反馈设计的基于UCB的算法。
它使用线性泛化来建模影响概率，从而实现对大规模网络的可扩展性。
该算法通过维护影响传播估计的置信上界，以平衡探索与利用。
它通过在每次干预后仅观察被影响的节点，利用半-bandit 反馈。
该方法将网络拓扑和边激活概率纳入后悔分析。
理论分析推导出依赖于网络结构和交互次数的多项式后悔界。

实验结果

研究问题

RQ1如何在具有组合动作空间的在线环境中高效求解影响最大化问题？
RQ2在独立级联模型下，半-bandit 反馈中可实现的后悔界是什么？
RQ3网络拓扑和边激活概率如何影响在线影响最大化问题的复杂度？
RQ4线性泛化能否与bandit算法有效结合，以实现对大规模网络的可扩展性？

主要发现

IMLinUCB 在所有相关参数（包括交互次数和网络规模）上均实现了多项式后悔界。
后悔界反映了网络的拓扑结构和边的激活概率，捕捉了问题特有的复杂性。
该算法在多种代表性图拓扑上的后悔表现与理论上的上界一致。
IMLinUCB 结合线性泛化在真实世界的影响最大化场景中实现了低后悔。
所提出的方法在统计上可靠且计算高效，能够实现大规模网络的可扩展性。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。