QUICK REVIEW

[論文レビュー] Learning Preference from Observed Rankings

Yu‐Chang Chen, Chen Chian Fuh|arXiv (Cornell University)|Feb 18, 2026

Consumer Market Behavior and Pricing被引用数 0

ひとこと要約

要約: 論文は、属性効果と低秩因子を用いて潜在的効用をモデル化し、露出バイアスを逆確率重み付けで修正し、大規模な不完全ランキングデータに対して SGD を適用することで、部分的ランキングから個々の消費者嗜好を学習するスケーラブルなフレームワークを開発します。

ABSTRACT

Estimating consumer preferences is central to many problems in economics and marketing. This paper develops a flexible framework for learning individual preferences from partial ranking information by interpreting observed rankings as collections of pairwise comparisons with logistic choice probabilities. We model latent utility as the sum of interpretable product attributes, item fixed effects, and a low-rank user-item factor structure, enabling both interpretability and information sharing across consumers and items. We further correct for selection in which comparisons are observed: a comparison is recorded only if both items enter the consumer's consideration set, inducing exposure bias toward frequently encountered items. We model pair observability as the product of item-level observability propensities and estimate these propensities with a logistic model for the marginal probability that an item is observable. Preference parameters are then estimated by maximizing an inverse-probability-weighted (IPW), ridge-regularized log-likelihood that reweights observed comparisons toward a target comparison population. To scale computation, we propose a stochastic gradient descent (SGD) algorithm based on inverse-probability resampling, which draws comparisons in proportion to their IPW weights. In an application to transaction data from an online wine retailer, the method improves out-of-sample recommendation performance relative to a popularity-based benchmark, with particularly strong gains in predicting purchases of previously unconsumed products.

研究の動機と目的

incomplete ranking データから個人レベルの嗜好をランダム効用基盤で推定する。
解釈可能な商品属性と潜在因子を組み込み、異質性を捉える。
観測性（露出バイアス）における選択を逆確率重み付けで扱う。
大規模で sparsely ranking データにも対応するスケーラブルな推定法（IPW と SGD）を提供する。
オンラインワイン購入への適用を示し、予測性能を評価する。

提案手法

潜在効用を u_{ij}=x_j^{⊤}β_i+α_j+λ_i^{⊤}f_j+ε_{ij}、ε_{ij} ~ Type-I EVT とモデル化する。
観測されたランキングを、P(j ≽ j') = σ(u_{ij}-u_{ij'}) の下でのペアワイズ比較として解釈する。
選択性の観測性を修正するために、重み 1/π_{jj'} を用いた IPW を導入する。ここで π_j = σ(x_j^{⊤}ψ)、π_{jj'}=π_j π_{j'}。
観測性指標 O_{ijj'} の最尤推定を介して ψ を推定し、π_j を代入する。
正則化（リッジ）IPW 目的関数: 最大化 ∑_{(i,j,j')∈D} (1/π_{jj'}) ln σ(u_{ij}-u_{ij'}) - κ||θ||_2^2。
ALS 風の SGD で逆確率重み付け再サンプリングを用い、π_{jj'}^{-1} に比例する確率で比較を draw し、標準の対数尤度勾配からリッジ項を引いた更新を行う。
効率的なサンプル抽出のために π_{jj'} を事前計算し、エイリアステーブルを用いてスケーラブルな最適化を実現。

実験結果

リサーチクエスチョン

RQ1 incomplete ranking データから個人特異的嗜好を回復するにはどうすればよいか。
RQ2潜在因子構造と観測属性が、異質性を捉え、未観測アイテム対の外挿を可能にするか。
RQ3露出/観測性バイアスが嗜好推定に与える影響は何か、ランキングデータで IPW はそれを正せるか。
RQ4提案された IPW-正規化 SGD 推定法は、大規模で sparse なランキングデータに対してスケーラブルか。
RQ5特に以前に消費されていないアイテムの予測精度が向上するか、外部データに対する推奨性能は改善されるか。

主な発見

モデルは、ワイン消費者の地域別嗜好において substantial な異質性を発見し、ボルドーやブルゴーニュのような名声ある地域では分布が右裾に偏り、しばしば双峰である。
潜在因子構造により、消費者とアイテム間で情報を共有でき、観測されていない比較の推定を支援する。
IPW による観測性の選択性補正は露出によるバイアスを低減し、保持データにおける予測性能を改善する。
SGD ベースの推定は IPW 重み付きで大規模かつ sparse なランキングデータにスケールし、ランキングから含まれるペアワイズ比較を効果的に活用する。
オンラインワイン取引への適用は、特に未消費の商品購入について、外部データに対する推奨の改善を示す。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。