QUICK REVIEW

[논문 리뷰] Revenue-Optimal Pricing for Budget-Constrained Buyers in Data Markets

Bhaskar Ray Chaudhury, Jugal Garg|arXiv (Cornell University)|2026. 02. 14.

Advanced Bandit Algorithms Research인용 수 0

한 줄 요약

본 연구는 예산 제약이 있는 구매자가 있는 데이터 시장에서 수익 극대화 가격책정을 연구하며, 비선형 가격 책정이 다항 시간 내에 해결 가능하고 선형 가격 책정은 APX-hard임을 보이고 실용적 근사 알고리즘을 제시한다.

ABSTRACT

We study revenue-optimal pricing in data markets with rational, budget-constrained buyers. Such a market offers multiple datasets for sale, and buyers aim to improve the accuracy of their prediction tasks by acquiring data bundles. For each dataset, the market sets a pricing function, which maps the number of records purchased from the dataset to a non-negative price. The market's objective is to set these pricing functions to maximize total revenue, considering that buyers with quasi-linear utilities choose their bundles optimally under budget constraints. We analyze optimal pricing when each dataset's pricing function is only required to be monotone and lower-continuous. Surprisingly, even with this generality, optimal pricing has a highly structured form: it is piecewise linear and convex (PLC) and can be computed efficiently via an LP. Moreover, the total number of kinks across all pricing functions is bounded by the number of buyers. Thus, when datasets far outnumber buyers, most pricing functions are effectively linear. This motivates studying linear pricing, where each record in a dataset is priced uniformly. Although competitive equilibrium gives revenue-optimal linear prices in rivalrous markets with quasi-linear buyers, we show that revenue maximization under linear pricing in data markets is APX-hard. Hence, a striking computational dichotomy emerges: fully general (nonlinear) pricing admits a polynomial-time algorithm, while the simpler linear scheme is APX-hard. Despite the hardness, we design a 2-approximation algorithm when datasets arrive online, and a $(1-1/e)^{-1}$-approximation algorithm for the offline setting. Our framework lays the groundwork for exploring more general pricing schemes, richer utility models, and a deeper understanding of how market structure -- rivalrous versus non-rivalrous -- shapes revenue-optimal pricing.

연구 동기 및 목표

예산을 가진 m개의 데이터셋과 n명의 구매자를 포함하는 중앙집중식 데이터 마켓플레이스를 모델링한다.
단조 하한-연속 가격책정 아래의 최적 가격함수의 특성을 규명한다.
최적의 비선형 가격 책정은 조각별 선형이며 볼록하고(PLC) LP를 통해 계산 가능하다는 것을 보인다.
복잡도 이분법을 확립한다: 비선형 가격 책정은 다항 시간 내에 해결 가능하고, 선형 가격 책정은 APX-hard이다.

제안 방법

데이터 정확도 가치와 데이터셋 가격 비용을 포함한 구매자 효용을 형식화한다.
최적의 ML C 가격 함수가 볼록 포장(convex envelopes)와 PLC 근사를 통해 얻어진다는 것을 증명한다.
최적 PLC 가격 책정을 계산하는 LP를 도출하고 구조적 특성(변곡점이 n으로 한정됨)을 보인다.
선형 가격 책정 체계에서는 수익 극대화가 APX-hard임을 보인다.
근사 알고리즘을 제시한다: 온라인에서 2-근사, 오프라인에서 (1-1/e)^{-1} 근사.

실험 결과

연구 질문

RQ1예산 제약이 있는 준 선형 구매자를 대상으로 데이터 마켓플레이스가 데이터셋의 가격을 어떻게 책정하여 총 수익을 극대화할 수 있는가?
RQ2단조 하한-연속 가격 책정하에서 수익 최적 가격의 구조적 형태는 무엇인가?
RQ3비선형 가격 책정은 효율적으로 계산될 수 있는가, 그리고 선형 가격 책정은 복잡도와 성능 면에서 어떻게 비교되는가?
RQ4온라인 대 오프라인 설정에서 선형 가격 책정에 대해 어떤 근사화가 가능한가?

주요 결과

가격 책정 문제에 대한 최적 해는 다항 시간 내에 계산될 수 있다.
최적의 비선형 가격 책정은 조각별 선형이며 볼록(PLC)하고, 데이터셋당 평균 최대 n개의 구간점(breakpoints)을 가지며, 모든 데이터셋의 변곡점 총수는 구매자 수로 상한된다.
해당 문제는 이분법을 보인다: 비선형 가격 책정은 다항 시간 해결 가능, 반면 선형 가격 책정은 APX-hard이다.
데이터셋이 선형으로 가격 책정될 때, 온라인 도래에서 2-근사, 오프라인 설정에서 (1-1/e)^{-1} 근사를 허용한다.
폐쇄 형식의 수익 표현이 존재한다: r(p) = sum_i min(b_i, sum_{j: tau_{i,j} >= p_j} p_j), 비경쟁성으로 인해 구매자 간 경쟁 없이 각 구매자의 기여를 반영한다.

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.