QUICK REVIEW

[논문 리뷰] Deep-Sentiment: Sentiment Analysis Using Ensemble of CNN and Bi-LSTM Models

Shervin Minaee, Elham Azimi|arXiv (Cornell University)|2019. 04. 08.

Sentiment Analysis and Opinion Mining참고 문헌 34인용 수 36

한 줄 요약

본 논문은 CNN과 Bi-LSTM을 Glove 임베딩과 결합한 앙상블 감정 분석 모델을 제안하며, IMDB 및 SST2 데이터셋에서 개별 모델에 비해 정확도가 향상됨을 보인다.

ABSTRACT

With the popularity of social networks, and e-commerce websites, sentiment analysis has become a more active area of research in the past few years. On a high level, sentiment analysis tries to understand the public opinion about a specific product or topic, or trends from reviews or tweets. Sentiment analysis plays an important role in better understanding customer/user opinion, and also extracting social/political trends. There has been a lot of previous works for sentiment analysis, some based on hand-engineering relevant textual features, and others based on different neural network architectures. In this work, we present a model based on an ensemble of long-short-term-memory (LSTM), and convolutional neural network (CNN), one to capture the temporal information of the data, and the other one to extract the local structure thereof. Through experimental results, we show that using this ensemble model we can outperform both individual models. We are also able to achieve a very high accuracy rate compared to the previous works.

연구 동기 및 목표

모델 앙상블을 통해 감정 분석 정확도 향상을 동기 부여한다.
로컬 텍스트 구조를 포착하기 위해 CNN을 활용하고, 시간적 의존성을 포착하기 위해 Bi-LSTM을 활용한다.
표준 데이터셋에서 앙상블 예측의 평균이 개별 모델보다 개선된다는 것을 보여준다.

제안 방법

Glove 단어 임베딩을 CNN과 2층 Bi-LSTM 아키텍처의 입력으로 사용한다.
CNN은 각각 100개의 피처 맵을 갖는 네 가지 필터 사이즈(1, 2, 3, 4)를 사용하고, 그 다음 두 개의 완전 연결 계층과 소프트맥스를 따른다.
Bi-LSTM은 Glove 임베딩을 사용한 두 층 Bi-LSTM으로 감정을 예측한다.
CNN과 Bi-LSTM의 예측 확률 점수를 평균하여 앙상블한다.

실험 결과

연구 질문

RQ1CNN과 Bi-LSTM의 앙상블이 각 모델 단독보다 감정 분류 정확도를 향상시키는가?
RQ2IMDB 및 SST2 데이터셋에서의 앙상블 성능은 기존 연구와 어떻게 비교되는가?
RQ3앙상블 성능에 영향을 미치는 학습 설정과 데이터 특성은 무엇인가?

주요 결과

방법	정확도 비율
Proposed LSTM Model	89%
Proposed CNN Model	89.3%
Proposed Ensemble of LSTM and CNN	90%
Proposed LSTM Model (SST2)	80%
Proposed CNN Model (SST2)	80.2%
Proposed Ensemble of LSTM and CNN (SST2)	80.5%

CNN과 Bi-LSTM의 앙상블은 IMDB에서 단일 모델보다 더 높은 정확도를 보인다(앙상블 90% 대 CNN 89.3%, LSTM 89%).
SST2에서도 약간의 향상을 제공한다(앙상블 80.5% 대 CNN 80.2%, LSTM 80%).
IMDB의 선정된 기존 연구들과 비교할 때, 앙상블은 경쟁력 있는 정확도를 달성한다(앙상블 90% 대 기존 방법 최대 89.3%).
CNN은 LSTM보다 예측 점수를 1에 더 가깝게 혹은 0에 더 가깝게 더 일관되게 만드는 경향이 있어 분류에 도움이 된다.

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.