QUICK REVIEW

[论文解读] Budget Pacing in Repeated Auctions: Regret and Efficiency Without Convergence

Jason Gaitonde, Yingkai Li|arXiv (Cornell University)|May 18, 2022

Auction Theory and Applications被引用 8

一句话总结

该论文提出了一种基于梯度的预算配给算法，用于重复拍卖，该算法在无需收敛至均衡的情况下，实现了至少最优液态福利的一半，同时为个体代理提供了动态后悔边界。该方法通过在线学习自适应调整出价，确保在包括第一价格和第二价格格式在内的广泛拍卖类型中，实现强聚合效率与个体表现保证，即使在相关估值和bandit反馈下亦成立。

ABSTRACT

Online advertising via auctions increasingly dominates the marketing landscape. A typical advertiser may participate in thousands of auctions each day with bids tailored to a variety of signals about user demographics and intent. These auctions are strategically linked through a global budget constraint. To help address the difficulty of bidding, many major online platforms now provide automated budget management via a flexible approach called budget pacing: rather than bidding directly, an advertiser specifies a global budget target and a maximum willingness-to-pay for different types of advertising opportunities. The specified maximums are then scaled down (or "paced") by a multiplier so that the realized total spend matches the target budget. These automated bidders are now near-universally adopted across all mature advertising platforms, raising pressing questions about market outcomes that arise when advertisers use budget pacing simultaneously. In this paper we study the aggregate welfare and individual regret guarantees of dynamic pacing algorithms in repeated auctions with budgets. We show that when agents simultaneously use a natural form of gradient-based pacing, the liquid welfare obtained over the course of the dynamics is at least half the optimal liquid welfare obtainable by any allocation rule, matching the best possible bound for static auctions even in pure Nash equilibria [Aggarwal et al., WINE 2019; Babaioff et al., ITCS 2021]. In contrast to prior work, these results hold without requiring convergence of the dynamics, circumventing known computational obstacles of finding equilibria [Chen et al., EC 2021]. Our result is robust to the correlation structure among agents' valuations and holds for any core auction, a broad class that includes first-price, second-price, and GSP auctions. We complement the aggregate guarantees by showing that an agent using such pacing algorithms achieves an O(T^{3/4}) regret relative to the value obtained by the best fixed pacing multiplier in hindsight in stochastic bidding environments. Compared to past work, this result applies to more general auctions and extends to adversarial settings with respect to dynamic regret.

研究动机与目标

建立在投标代理使用动态配给但不收敛至均衡的重复拍卖中的聚合福利保证。
在对抗性和随机环境中，为预算受限的投标人提供其效用或价值最大化的个体后悔边界。
将这些保证扩展至包括第一价格、第二价格和广义第二价格拍卖在内的核心拍卖的广泛类别。
通过使用真实世界Bing广告数据的半合成模拟验证理论结果。

提出的方法

论文采用了一种基于梯度的配给算法（算法1），该算法源自效用最大化问题的Lagrangian对偶，根据拍卖反馈调整出价乘数。
引入了一种新颖的基准——完美配给序列，以规避在预算约束下对抗性bandit设置中的不可能性结果。
理论分析采用动态后悔框架，将性能与理想配给序列进行比较，而不仅仅是与最佳固定乘数比较。
通过模块化分析技术，将该方法扩展至多种算法，包括SGD、OGD、Adam和乘法更新。
通过原-对偶论证建立了液态福利的近似保证，且无需假设均衡收敛。
数值验证使用Bing广告数据的半合成模拟，通过双对数回归估计后悔率，并比较不同算法的液态福利。

实验结果

研究问题

RQ1动态预算配给算法是否能在不收敛至纳什均衡的情况下实现高聚合市场效率？
RQ2在不假设均衡收敛的前提下，预算受限投标人重复拍卖中的液态福利最佳近似因子是多少？
RQ3在预算约束下，价值最大化在对抗性和随机环境中是否可实现个体后悔的有界性？
RQ4基于梯度的配给算法是否在包括第一价格和第二价格拍卖在内的多种拍卖格式中保持强性能？
RQ5在真实世界数据模式下，不同在线学习算法（如SGD、Adam、OGD）在后悔和液态福利方面的实证表现如何比较？

主要发现

基于梯度的配给算法实现的聚合液态福利至少达到最优期望液态福利的一半，即使在不收敛至均衡的情况下亦成立。
效用最大化的后悔率估计为 α ≈ 0.573（第二价格拍卖）和 α ≈ 0.540（第一价格拍卖），表明其为次线性增长。
对于价值最大化，后悔率同样为次线性，估计值为 α ≈ 0.574（SPA）和 α ≈ 0.538（FPA），显示出在不同目标下的鲁棒性。
在测试的算法中，SGD和OGD在后悔和液态福利方面表现最佳，Adam在第一价格格式中表现略差。
乘法更新始终表现较差，所有场景下均表现出更高的后悔和更低的液态福利。
半合成模拟证实理论保证在实证中成立，SGD与OGD在性能上无统计显著差异。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。