QUICK REVIEW

[논문 리뷰] Global Sentiment Analysis Of COVID-19 Tweets Over Time

Muvazima Mansoor, Kirthika Gurumurthy|arXiv (Cornell University)|2020. 10. 27.

Sentiment Analysis and Opinion Mining참고 문헌 8인용 수 38

한 줄 요약

이 논문은 VADER 라벨링과 ML 분류기(LSTM 및 ANN)를 사용하여 2020년 1월부터 2020년 6월까지 글로벌 COVID-19 트윗 감정을 분석하고, 재택근무(WFH) 및 Online Learning 감정과 사례 데이터를 함께 검토한다.

ABSTRACT

The Coronavirus pandemic has affected the normal course of life. People around the world have taken to social media to express their opinions and general emotions regarding this phenomenon that has taken over the world by storm. The social networking site, Twitter showed an unprecedented increase in tweets related to the novel Coronavirus in a very short span of time. This paper presents the global sentiment analysis of tweets related to Coronavirus and how the sentiment of people in different countries has changed over time. Furthermore, to determine the impact of Coronavirus on daily aspects of life, tweets related to Work From Home (WFH) and Online Learning were scraped and the change in sentiment over time was observed. In addition, various Machine Learning models such as Long Short Term Memory (LSTM) and Artificial Neural Networks (ANN) were implemented for sentiment classification and their accuracies were determined. Exploratory data analysis was also performed for a dataset providing information about the number of confirmed cases on a per-day basis in a few of the worst-hit countries to provide a comparison between the change in sentiment with the change in cases since the start of this pandemic till June 2020.

연구 동기 및 목표

Jan 2020 to Jun 2020의 Twitter 데이터를 사용하여 COVID-19에 대한 글로벌 대중 감정을 평가한다.
Lockdown 기간 동안 Work From Home과 Online Learning에 대한 감정 추세를 평가한다.
선정된 최악의 피해 국가들에서 일일 COVID-19 사례 추세와 감정 역학을 비교한다.
트윗 감정 분류를 위한 기계 학습 모델(LSTM 및 ANN)을 개발하고 비교한다.

제안 방법

세 가지 트윗 데이터셋: coronavirus, online learning, and work-from-home를 2020년 1월부터 2020년 6월까지 생성한다.
VADER로 감정을 라벨링하고 이진(positive/negative) 감정 분석 및 NRC EmoLex를 통한 감정(fear, trust) 탐지를 수행한다.
라벨링된 데이터를 대상으로 트윗 감정 분류를 위한 LSTM 및 ANN 분류기를 학습·평가하고 정확도를 보고한다.
전 세계 및 최악의 국가들에 대해 감정 및 감정 추세를 탐색적 데이터 분석 및 시각화한다.
감정/감정 추세를 Kaggle 데이터셋의 일일 COVID-19 사례 데이터와 상관 분석한다.

실험 결과

연구 질문

RQ1Jan 2020 to Jun 2020까지 COVID-19에 대한 글로벌 및 국가 수준 감정 추세는 무엇인가?
RQ2Lockdown 기간 동안 Work From Home과 Online Learning에 대한 감정은 어떻게 진화하는가?
RQ3두려움(fear)과 신뢰(trust) 감정은 시간에 따라 그리고 국가별로 어떻게 달라지며, 이것이 사례 추세와 어떤 관련이 있는가?
RQ4COVID-19 관련 트윗의 감정 분류 정확도에서 LSTM과 ANN은 어떻게 비교되는가?

주요 결과

LSTM은 coronavirus 트윗에서 84.5% 정확도를 달성했고; ANN은 76% 정확도를 달성했다.
양성(sentiment)과 음성(sentiment) 차이는 Feb–Mar에 가장 컸고, 이후 달들에 좁아졌다.
공포(fear)가 감정 점수에서 우세했고, 신뢰(trust)도 존재했다; 국가별로 감정 프로필이 달랐다(예: 일부 국가에서 더 높은 신뢰).
WFH 감정은 전반적으로 부정보다 긍정적이었고; Online Learning은 긍정적 경향이었지만 주목할 만한 부정도 있었다.
3월 이후 사례 추세는 기하급수적으로 증가했으며, 감정/정서 패턴은 lockdown 정책 및 사례 역학을 반영했다.
세 가지 데이터셋 (coronavirus, WFH, Online Learning)은 글로벌 사례 데이터와의 비교 시각화를 가능하게 했다.

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.