QUICK REVIEW

[论文解读] A General Framework for Learning-Augmented Online Allocation

Ilan Reuven Cohen, Debmalya Panigrahi|arXiv (Cornell University)|Jan 1, 2023

Optimization and Search Problems被引用 1

一句话总结

本文提出了一种统一的学习增强型框架，用于在线分数分配问题，通过每个代理仅使用一个学习参数，实现了对各类最小化和最大化目标（如makespan、Santa Claus、Nash福利和ℓp-最小化）的近乎最优竞争比。该方法基于目标函数的齐次性和单调性，提出了一种新颖的EP分配规则，确保对参数误差具有鲁棒性，并能从采样实例中进行学习。

ABSTRACT

Online allocation is a broad class of problems where items arriving online have to be allocated to agents who have a fixed utility/cost for each assigned item so to maximize/minimize some objective. This framework captures a broad range of fundamental problems such as the Santa Claus problem (maximizing minimum utility), Nash welfare maximization (maximizing geometric mean of utilities), makespan minimization (minimizing maximum cost), minimization of 𝓁_p-norms, and so on. We focus on divisible items (i.e., fractional allocations) in this paper. Even for divisible items, these problems are characterized by strong super-constant lower bounds in the classical worst-case online model. In this paper, we study online allocations in the learning-augmented setting, i.e., where the algorithm has access to some additional (machine-learned) information about the problem instance. We introduce a general algorithmic framework for learning-augmented online allocation that produces nearly optimal solutions for this broad range of maximization and minimization objectives using only a single learned parameter for every agent. As corollaries of our general framework, we improve prior results of Lattanzi et al. (SODA 2020) and Li and Xian (ICML 2021) for learning-augmented makespan minimization, and obtain the first learning-augmented nearly-optimal algorithms for the other objectives such as Santa Claus, Nash welfare, 𝓁_p-minimization, etc. We also give tight bounds on the resilience of our algorithms to errors in the learned parameters, and study the learnability of these parameters.

研究动机与目标

通过引入机器学习预测，克服经典在线分配中强超常数竞争下界的问题。
统一设计在线分配中各类最小化与最大化目标的近乎最优算法。
建立算法对学习参数误差的鲁棒性，并研究这些参数从分布中学习的可能性。
将先前关于负载均衡的结果推广至更广泛的目标类别，包括Santa Claus和Nash福利。
为具有良好性质的目标（定义为单调且齐次的函数）提供一个统一、有原则的框架，适用于最大化与最小化问题。

提出的方法

提出一种通用的EP分配规则，根据物品价值与学习参数的加权积进行分配，其缩放因子α由目标函数的齐次性导出。
通过归一化步骤，将物品价值按每个代理的最优目标值（ℓf_i）进行缩放，将问题转化为单位目标值基准。
使用从采样实例中获得的经验平均值导出的参数向量w*，以近似最优对偶变量，确保高概率下的性能表现。
采用噪声鲁棒性分析，表明参数误差最多使目标值扩大为原来的η倍，从而在参数扰动下仍能保持性能。
利用超可加性/次可加性假设，证明可通过采样实例学习参数，使期望解达到(1±O(ǫ))近似解。
通过齐次性与单调性将问题简化为类似凸结构，从而统一处理原本针对不同目标而设计的异构算法技术。

实验结果

研究问题

RQ1每个代理仅使用一个学习参数，是否能实现对多样化在线分配目标的近乎最优解？
RQ2所提出的算法对学习参数误差的鲁棒性如何？
RQ3能否从问题实例的分布中高效学习最优学习参数？
RQ4该框架是否统一了此前针对最大化与最小化目标所采用的异构算法方法？
RQ5目标函数的哪些结构性质（如单调性、齐次性）是该框架有效运行的必要且充分条件？

主要发现

该框架在仅使用每个代理一个学习参数的前提下，对最小化目标实现了1+ǫ的竞争比，对最大化目标实现了1−ǫ的竞争比。
该算法对参数误差具有鲁棒性：若学习权重为η-近似，则解的目标值最多恶化为原来的η倍。
对于具有良好性质的目标，该框架可确保以高概率在采样实例的期望下，使最大化问题的解至少达到(1−Ω(ǫ))·T，最小化问题的解至多为(1+O(ǫ))·T。
该方法将先前关于负载均衡的结果推广至Santa Claus和Nash福利等新目标，首次为这些问题提供了学习增强型的近乎最优算法。
该框架适用于所有既单调又齐次的目标，包括在线分配中的所有标准目标，如ℓp-范数、makespan和Nash福利。
在分布和小物品满足弱假设的前提下，可通过O(m/ǫ · log m)个独立采样实例高效估计学习参数。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。