QUICK REVIEW

[논문 리뷰] Sensory-Aware Sequential Recommendation via Review-Distilled Representations

Yeo Chan Yoon|arXiv (Cornell University)|2026. 03. 03.

Sentiment Analysis and Opinion Mining인용 수 0

한 줄 요약

ASEGR은 리뷰에서 추출한 감각 속성을 언어적으로 기반으로 한 증류를 통해 아이템 임베딩을 압축하고, 이를 고정된 감각 임베딩으로 표준 모델에 주입하여 다음 아이템 예측을 개선합니다.

ABSTRACT

We propose a novel framework for sensory-aware sequential recommendation that enriches item representations with linguistically extracted sensory attributes from product reviews. Our approach, extsc{ASEGR} (Attribute-based Sensory Enhanced Generative Recommendation), introduces a two-stage pipeline in which a large language model is first fine-tuned as a teacher to extract structured sensory attribute--value pairs, such as extit{color: matte black} and extit{scent: vanilla}, from unstructured review text. The extracted structures are then distilled into a compact student transformer that produces fixed-dimensional sensory embeddings for each item. These embeddings encode experiential semantics in a reusable form and are incorporated into standard sequential recommender architectures as additional item-level representations. We evaluate our method on four Amazon domains and integrate the learned sensory embeddings into representative sequential recommendation models, including SASRec, BERT4Rec, and BSARec. Across domains, sensory-enhanced models consistently outperform their identifier-based counterparts, indicating that linguistically grounded sensory representations provide complementary signals to behavioral interaction patterns. Qualitative analysis further shows that the extracted attributes align closely with human perceptions of products, enabling interpretable connections between natural language descriptions and recommendation behavior. Overall, this work demonstrates that sensory attribute distillation offers a principled and scalable way to bridge information extraction and sequential recommendation through structured semantic representation learning.

연구 동기 및 목표

리뷰에서 명시적 감각 의미를 가진 순차 추천 시스템을 풍부하게 만들고자 한다.
교사 LLM을 사용해 구조화된 감각 속성-값 쌍을 추출하고 이를 압축 아이템 임베딩으로 증류하도록 ASEGR를 개발한다.
온라인 LLM 사용 없이 감각 임베딩을 기존 순차 백본에 원활하게 통합할 수 있도록 한다.
감각 임베딩이 다양한 도메인과 백본에서 상호작용 데이터에 보완 신호를 제공하는지 입증한다.

제안 방법

색상, 패턴, 질감, 향 등 고정된 감각 면 스키마를 구성하고 구조화된 JSON 레코드를 생성하도록 학습된 교사 LLM으로 개방 어휘 속성 값을 추출한다.
참조 감각 추출 표준에 맞추어 시드 데이터에 교사 모델을 미세 조정하고 대규모 카탈로그 offline(2.67M 항목)을 주석 처리한다.
아이템 텍스트를 교사 타깃에 대한 회귀와 대조 학습(L stu = ||f theta(xi) - zteach,i||^2 + lambda * L_NCE)으로 768 차원의 감각 임베딩으로 매핑하도록 컴팩트한 학생 인코더(DeBERTa v3 Small)를 학습한다.
모든 아이템에 대해 감각 임베딩을 사전 계산해 저장하고, 입력층에서의 통합 조기 융합 연산자를 통해 s_i를 인덱스 임베딩과 융합하여 순차 모델에 임베딩을 주입한다.
네 가지 Amazon 도메인에서 SASRec, BERT4Rec, BSARec의 감각 보강 변형을 전체 랭킹 HR 및 NDCG 지표로 평가하고, 감각 기여를 고립시키기 위해 일관된 학습 프로토콜을 유지한다.

실험 결과

연구 질문

RQ1언어적으로 근거 있는 리뷰에서 도출된 감각 속성이 표준 순차 모델에 통합될 때 다음 아이템 예측을 개선할 수 있는가?
RQ2감각 정보를 컴팩트한 임베딩으로 증류하는 것이 여러 백본과 도메인에서 성능 이득을 유지하는가?
RQ3감각 임베딩이 인간의 상품 인식과 일치하는 해석 가능하고 감사 가능한 신호를 제공하는가?
RQ4감각 추출 및 증류 파이프라인이 큰 카탈로그와 offline 전처리 비용에 확장 가능한가?
RQ5감각 신호가 데이터 세트 및 모델 아키텍처에 따라 랭킹 행동에 어떤 차이를 보이는가?

주요 결과

감각 강화 모델은 모든 백본과 도메인에서 ID 전용 대비 성능을 능가하며, 여러 컷오프에서 HR 및 NDCG에서 일관된 향상을 보인다.
Beauty 및 Toys 도메인에서 가장 큰 향상을 보이며, 예를 들어 Beauty에서 SASRec HR@10이 6.05에서 7.22로, NDCG@10이 3.18에서 4.17로 상승하고, Toys에서도 다양한 백본에서 상당한 부스트를 보인다.
Sports 도메인은 혼재된 효과를 보이며, SASRec의 HR@10은 증가하지만 NDCG@10은 감소하는 반면, BERT4Rec은 두 지표에서 동시 이득을 보인다.
모든 도메인과 백본에 걸쳐 감각 임베딩이 검색 및 랭킹에 보완 신호를 제공하여, 언어적으로 기반한 감각 표현의 유용성을 시사한다.
프레임워크는 리뷰에서 파생된 명시적 감각 서술로 아이템 임베딩을 근거화해 해석 가능성을 높인다.

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.