QUICK REVIEW

[논문 리뷰] A Sentiment-and-Semantics-Based Approach for Emotion Detection in Textual Conversations

Umang Gupta, Ankush Chatterjee|arXiv (Cornell University)|2017. 07. 21.

Sentiment Analysis and Opinion Mining참고 문헌 28인용 수 82

한 줄 요약

이 논문은 SS-LSTM을 도입하며, 의미적(GloVe)과 감성 표현(SSWE)을 결합한 이중 임베딩 LSTM 모델로 3회 차 대화에서 감정(기쁨, 슬픔, 분노, 기타)을 감지하고 전통 ML 및 다른 DL 기법 Baseline 대비 성능이 우수합니다.

ABSTRACT

Emotions are physiological states generated in humans in reaction to internal or external events. They are complex and studied across numerous fields including computer science. As humans, on reading "Why don't you ever text me!" we can either interpret it as a sad or angry emotion and the same ambiguity exists for machines. Lack of facial expressions and voice modulations make detecting emotions from text a challenging problem. However, as humans increasingly communicate using text messaging applications, and digital agents gain popularity in our society, it is essential that these digital agents are emotion aware, and respond accordingly. In this paper, we propose a novel approach to detect emotions like happy, sad or angry in textual conversations using an LSTM based Deep Learning model. Our approach consists of semi-automated techniques to gather training data for our model. We exploit advantages of semantic and sentiment based embeddings and propose a solution combining both. Our work is evaluated on real-world conversations and significantly outperforms traditional Machine Learning baselines as well as other off-the-shelf Deep Learning models.

연구 동기 및 목표

텍스트 대화에서 얼굴/목소리 큐가 없기 때문에 감정 탐지를 동기화합니다.
수작업 특징을 피하는 데이터 효율적인 딥 러닝 모델을 개발합니다.
감정 분류를 개선하기 위한 의미 임베딩과 감정 임베딩의 결합 효과를 탐구합니다.

제안 방법

SS-LSTM 제안: 두 개의 병렬 LSTM 가지가 같은 발화를 서로 다른 임베딩(GloVe는 의미, SSWE는 감정)을 사용해 처리합니다.
두 특징 표현을 연결(concatenate)하고 하나의 은닉층을 가진 완전 연결 네트워크에 입력하여 감정 클래스 확률을 출력합니다.
교차 엔트로피 손실과 소프트맥스 함수를 SGD로 학습합니다; 배치 크기 4000, 학습률 0.005.
Twitter Q-A에서 반자동 라벨링을 통한 대규모 학습 데이터 생성과 이모티콘 정규화 및 데이터 증강 기법을 추가로 적용합니다.
ML 기초모델(NB, SVM, GBDT) 및 DL 기초모델(CNN-NAVA, LSTM 변형)과의 비교를 수행합니다.

실험 결과

연구 질문

RQ1감정 특정 임베딩과 의미 임베딩의 조합이 텍스트 대화에서 감정 인식 성능을 향상시킬 수 있는가?
RQ2SS-LSTM 변형들이 실제 대화 데이터에서 표준 DL 및 전통 ML 접근법과 비교하여 어떤 성능 차이를 보이는가?
RQ3임베딩 선택(GloVe vs SSWE)이 감정 탐지를 위한 의미/감정 특성 학습에 미치는 영향은 무엇인가?
RQ4이모티콘 처리 및 데이터 수집 전략이 모델 성능에 어떤 영향을 미치는가?

주요 결과

SS-LSTM이 모든 Baseline 대비 감정 클래스의 평균 F1에서 최고를 달성합니다.
SS-LSTM이 CNN 기반 접근법 및 다른 LSTM/임베딩 변형보다 통계적으로 유의미한 이득(p < 0.005)을 보이며 우수합니다.
GloVe 기반의 의미 임베딩과 SSWE 감정 임베딩의 조합은 강력한 성능을 보이며, SS-LSTM에서의 사용을 정당화합니다.
딥 러닝 모델은 평가 데이터셋에서 일반적으로 전통 ML 기법(NB, SVM, GBDT)보다 성능이 우수합니다.
제안된 데이터 수집 및 이모티콘 정규화 전략이 대화 맥락에서의 감정 탐지를 위한 고품질 학습 데이터에 기여합니다.

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.