QUICK REVIEW

[논문 리뷰] RMBRec: Robust Multi-Behavior Recommendation towards Target Behaviors

Miaomiao Cai, Zhijie Zhang|arXiv (Cornell University)|2026. 01. 13.

Recommender Systems and Techniques인용 수 0

한 줄 요약

RMBRec은 보조 표상과 목표 표상을 정렬하고 행동 환경 간 불변 최적화를 강제함으로써 다중-행동 추천을 위한 두 모듈형 로버스트니스 프레임워크를 도입하여, 베이스라인 대비 정확도와 강건성을 향상시킨다.

ABSTRACT

Multi-behavior recommendation faces a critical challenge in practice: auxiliary behaviors (e.g., clicks, carts) are often noisy, weakly correlated, or semantically misaligned with the target behavior (e.g., purchase), which leads to biased preference learning and suboptimal performance. While existing methods attempt to fuse these heterogeneous signals, they inherently lack a principled mechanism to ensure robustness against such behavioral inconsistency. In this work, we propose Robust Multi-Behavior Recommendation towards Target Behaviors (RMBRec), a robust multi-behavior recommendation framework grounded in an information-theoretic robustness principle. We interpret robustness as a joint process of maximizing predictive information while minimizing its variance across heterogeneous behavioral environments. Under this perspective, the Representation Robustness Module (RRM) enhances local semantic consistency by maximizing the mutual information between users' auxiliary and target representations, whereas the Optimization Robustness Module (ORM) enforces global stability by minimizing the variance of predictive risks across behaviors, which is an efficient approximation to invariant risk minimization. This local-global collaboration bridges representation purification and optimization invariance in a theoretically coherent way. Extensive experiments on three real-world datasets demonstrate that RMBRec not only outperforms state-of-the-art methods in accuracy but also maintains remarkable stability under various noise perturbations. For reproducibility, our code is available at https://github.com/miaomiao-cai2/RMBRec/.

연구 동기 및 목표

보조 신호가 노이즈가 있거나 목표 구매와 불일치하는 다중 행동 추천에서 로버스트니스를 촉진한다.
원칙에 입각한 두 모듈 프레임워크를 제안하여 로컬 의미 drift와 글로벌 최적화 안정성을 모두 다룬다.
현실 데이터셋에서 최신 베이스라인과 비교하여 향상된 정확도와 강건성을 보여준다.
추천에서의 행동 이질성과 불변 학습에 대한 통찰을 제공한다.

제안 방법

두 가지 협업 모듈: Representation Robustness Module (RRM)와 Optimization Robustness Module (ORM).
RRM은 타깃-앵커링 대조 학습을 사용해 보조 표상과 목표 표상 사이의 상호 정보를 최대화하고, 보조 신호를 목표 의미와 정렬한다.
ORM은 불변 학습 접근법(IRM에서 영감을 받고, Risk Extrapolation을 통해 구현)을 채택하여 행동 환경 전반에 걸친 예측 리스크의 분산을 최소화하고, 안정적인 최적화를 촉진한다.
행동 b별로 행동 인코더를 구축(LightGCN 사용), 행동별 BPR 손실을 적용한 뒤, 정화된 임베딩의 등가 가중 합성으로 최종 예측.
총 목표는 타깃 행동 BPR 손실에 L_RRM 및 L_ORM 정규화를 더한 형태로 구성된다: L = L_main + lambda1 L_RRM + lambda2 L_ORM.
프레임워크는 이질적 행동 전반에서 분산을 최소화하면서 예측 정보를 극대화하는 데 기초한다.

실험 결과

연구 질문

RQ1보조 신호가 노이즈가 있거나 목표 행동과 의미적으로 불일치하는 상황에서 다중 행동 추천기의 로버스트니스를 어떻게 향상시킬 수 있는가?
RQ2로컬(표현 수준) 정제와 글로벌(최적화 수준) 불변성을 함께 적용하면 다양한 행동 환경에서 정확도와 강건성을 어떻게 개선할 수 있는가?
RQ3추천에서 서로 다른 행동을 별도의 환경으로 다루는 것이 불변 학습에 어떤 영향을 미치는가?
RQ4노이즈 perturbation 하에서 RMBRec가 현실 세계의 다중-행동 데이터셋에서 최신 베이스라인에 비해 어떻게 수행하는가?

주요 결과

RMBRec은 세 가지 현실 세계 데이터셋에서 일관되게 최첨단 베이스라인을 능가한다.
RRM 또는 ORM을 제거하면 성능이 크게 하락하는 것으로 나타나, 이들의 중요성을 확인했다.
Beibei 데이터셋은 높은 행동 정렬(BAR)과 낮은 직접 타깃 구매를 보이며, RMBRec가 보조 신호를 효과적으로 활용하는 강한 신호 품질을 보여준다.
RMBRec은 타깃-앵커링 정렬 및 불변 최적화를 통해 노이즈가 있는 보조 신호와 분포 이동에 대한 강건성을 보여준다.
노이즈 주입 테스트 및 하이퍼파라미터 민감도 분석을 포함한 심화 분석이 방법의 안정성과 강건함을 검증한다.

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.