QUICK REVIEW

[논문 리뷰] Fake News Detection on Social Media: A Data Mining Perspective

Kai Shu, Amy Sliva|arXiv (Cornell University)|2017. 08. 07.

Misinformation and Its Impacts참고 문헌 71인용 수 599

한 줄 요약

본 고찰은 데이터 마이닝 관점에서 소셜 미디어의 가짜 뉴스 탐지를 다루며 정의, 특성(내용 및 사회적 맥락), 탐지 모델, 데이터세트, 평가 지표 및 향후 방향을 담고 있다.

ABSTRACT

Social media for news consumption is a double-edged sword. On the one hand, its low cost, easy access, and rapid dissemination of information lead people to seek out and consume news from social media. On the other hand, it enables the wide spread of "fake news", i.e., low quality news with intentionally false information. The extensive spread of fake news has the potential for extremely negative impacts on individuals and society. Therefore, fake news detection on social media has recently become an emerging research that is attracting tremendous attention. Fake news detection on social media presents unique characteristics and challenges that make existing detection algorithms from traditional news media ineffective or not applicable. First, fake news is intentionally written to mislead readers to believe false information, which makes it difficult and nontrivial to detect based on news content; therefore, we need to include auxiliary information, such as user social engagements on social media, to help make a determination. Second, exploiting this auxiliary information is challenging in and of itself as users' social engagements with fake news produce data that is big, incomplete, unstructured, and noisy. Because the issue of fake news detection on social media is both challenging and relevant, we conducted this survey to further facilitate research on the problem. In this survey, we present a comprehensive review of detecting fake news on social media, including fake news characterizations on psychology and social theories, existing algorithms from a data mining perspective, evaluation metrics and representative datasets. We also discuss related research areas, open problems, and future research directions for fake news detection on social media.

연구 동기 및 목표

전통 미디어와 사회적 미디어에서 가짜 뉴스의 정의와 특성을 명확히 한다.
소셜 미디어에서 가짜 뉴스를 탐지하기 위한 데이터 마이닝 접근법을 요약한다.
콘텐츠 기반과 사회 맥락 기반 탐지 방법을 비교한다.
데이터세트, 평가 지표 및 향후 연구를 안내하기 위한 개방 문제를 논의한다.

제안 방법

뉴스 콘텐츠 모델과 사회 맥락 모델로 가짜 뉴스 탐지 방법을 분류한다.
콘텐츠 특성(언어적 및 시각적)과 사회 맥 context 특성(사용자, 게시물 및 네트워크 기반)을 설명한다.
콘텐츠 모델에 대한 지식 기반 및 스타일 기반 탐지 방법을 검토한다.
가짜 뉴스 연구에서 사용되는 데이터세트 유형 및 평가 지표를 개관한다.
소셜 미디어에서의 탐지에 대한 개방 이슈 및 향후 방향에 대해 논의한다.

실험 결과

연구 질문

RQ1가짜 뉴스에 대해 어떤 정의가 사용되며 전통 미디어와 소셜 미디어에서 어떻게 다른가?
RQ2소셜 미디어의 뉴스 콘텐츠와 사회적 맥락에서 어떤 특징이 가짜 뉴스 탐지에 효과적인가?
RQ3어떤 탐지 모델이 존재하며 입력 소스(콘텐츠 대 맥락)별로 어떻게 분류되는가?
RQ4이 연구 분야에서 일반적인 데이터세트와 평가 지표는 무엇인가?
RQ5향후 연구를 위한 개방 도전 과제와 유망한 방향은 무엇인가?

주요 결과

소셜 미디어의 가짜 뉴스는 뉴스 콘텐츠의 의도적 조작으로 인해 콘텐츠 외의 보조 정보가 필요하다.
사회적 맥 context 특성(사용자, 게시물, 네트워크)은 가짜 뉴스 탐지에 중요한 신호를 제공한다.
탐지 방법은 지식 기반 및 스타일 기반의 콘텐츠 모델로 구분되며, 콘텐츠에 의존하지 않는 사회적 맥락 모델도 있다.
전통적인 가짜 뉴스와 소셜 미디어 특정 패턴(봇, 트롤, 에코 체임버) 간 구분이 있다.
이 분야는 초기 단계에 있으며 몇 가지 개방 이슈와 향후 연구 기회가 있다.

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.