Skip to main content
QUICK REVIEW

[论文解读] The Unfairness of Multifactorial Bias in Recommendation

Masoud Mansoury, Jin Huang|arXiv (Cornell University)|Jan 19, 2026
Recommender Systems and Techniques被引用 0
一句话总结

论文研究 popularity 与 positivity 偏见的多因素偏差如何影响推荐系统中的曝光公平性,并提出基于百分位的评分变换作为预处理方法,以在最小的准确度损失下减轻偏差。

ABSTRACT

Popularity bias and positivity bias are two prominent sources of bias in recommender systems. Both arise from input data, propagate through recommendation models, and lead to unfair or suboptimal outcomes. Popularity bias occurs when a small subset of items receives most interactions, while positivity bias stems from the over-representation of high rating values. Although each bias has been studied independently, their combined effect, to which we refer to as multifactorial bias, remains underexplored. In this work, we examine how multifactorial bias influences item-side fairness, focusing on exposure bias, which reflects the unequal visibility of items in recommendation outputs. Through simulation studies, we find that positivity bias is disproportionately concentrated on popular items, further amplifying their over-exposure. Motivated by this insight, we adapt a percentile-based rating transformation as a pre-processing strategy to mitigate multifactorial bias. Experiments using six recommendation algorithms across four public datasets show that this approach improves exposure fairness with negligible accuracy loss. We also demonstrate that integrating this pre-processing step into post-processing fairness pipelines enhances their effectiveness and efficiency, enabling comparable or better fairness with reduced computational cost. These findings highlight the importance of addressing multifactorial bias and demonstrate the practical value of simple, data-driven pre-processing methods for improving fairness in recommender systems.

研究动机与目标

  • 研究 multifactorial bias(人气偏见 + 积极性偏见)如何影响推荐系统中的物品曝光公平性。
  • 分析公开数据集中积极性偏见与物品人气之间的相关性。
  • 改进并评估基于百分位的评分变换以减轻 multifactorial bias。
  • 评估在使用所提出的预处理方法时,对后处理公平性流程的效率提升。

提出的方法

  • 在四个公开数据集(Goodreads、MovieLens、Google Local Data、Yelp)中表征人气偏见与积极性偏见。
  • 将 multifactorial bias 定义为一个过程,其中物品和评分值因素影响用户评分。
  • 使用仿真研究当人气物品的积极性偏见增加时对曝光公平性和准确度的影响。
  • 将百分位基评分变换应用于物品档案(将评分变换为百分位值),以减少对人气物品的积极性偏见。
  • 在四个数据集上对六个推荐算法(BiasedMF、SVD++、WRMF、ListRank、UserKNN、ItemKNN)进行原始评分和百分位变换数据的对比评估。
  • 在条件之间比较公平性指标(IA、LIA、EE)和准确度指标(Precision、nDCG)。

实验结果

研究问题

  • RQ1多因素偏见如何影响物品推荐中的曝光公平性?
  • RQ2积极性偏见是否在受欢迎的物品上更为集中,这如何影响公平性?
  • RQ3百分位基评分变换是否可在不牺牲准确性的前提下缓解多因素偏见并改善曝光公平性?
  • RQ4使用百分位变换进行预处理是否提升后处理公平方法的效率?
  • RQ5结果在多数据集和多种推荐算法上是否有一致性?

主要发现

  • 积极性偏见在受欢迎的物品上更强,使其在推荐中的过度曝光被放大。
  • 缓解对受欢迎物品的积极性偏见可以改善曝光公平性指标(IA、LIA、EE),同时对准确性产生的影响呈现出不稳定性。
  • 将评分转换为百分位值可降低输入数据中的多因素偏见,从而在数据集和模型上实现更公平的曝光。
  • 将百分位变换数据作为输入时,若干算法的公平性与准确度均得到提升(ListRank 在 Yelp 上除外)。
  • 使用百分位变换的预处理可以在后处理公平性流程中实现更短的初始推荐列表,同时保持可比的公平性水平。

更好的研究,从现在开始

从论文设计到论文写作,大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成,并经人工编辑审核。