QUICK REVIEW

[논문 리뷰] Beyond News Contents: The Role of Social Context for Fake News Detection

Kai Shu, Suhang Wang|arXiv (Cornell University)|2017. 12. 20.

Misinformation and Its Impacts참고 문헌 34인용 수 106

한 줄 요약

TriFN은 발행자-뉴스 관계 및 사용자 참여를 활용하여 가짜 뉴스 탐지에 있어 FakeNewsNet(BuzzFeed와 PolitiFact)에서 베이스라인보다 우수합니다.

ABSTRACT

Social media is becoming popular for news consumption due to its fast dissemination, easy access, and low cost. However, it also enables the wide propagation of fake news, i.e., news with intentionally false information. Detecting fake news is an important task, which not only ensures users to receive authentic information but also help maintain a trustworthy news ecosystem. The majority of existing detection algorithms focus on finding clues from news contents, which are generally not effective because fake news is often intentionally written to mislead users by mimicking true news. Therefore, we need to explore auxiliary information to improve detection. The social context during news dissemination process on social media forms the inherent tri-relationship, the relationship among publishers, news pieces, and users, which has potential to improve fake news detection. For example, partisan-biased publishers are more likely to publish fake news, and low-credible users are more likely to share fake news. In this paper, we study the novel problem of exploiting social context for fake news detection. We propose a tri-relationship embedding framework TriFN, which models publisher-news relations and user-news interactions simultaneously for fake news classification. We conduct experiments on two real-world datasets, which demonstrate that the proposed approach significantly outperforms other baseline methods for fake news detection.

연구 동기 및 목표

콘텐츠만으로는 충분하지 않은 소셜 미디어 era에서 가짜 뉴스 탐지의 동기를 제시한다.
전파 역학을 모델링하기 위한 삼중 관계(publishers, news, users)를 정의한다.
콘텐츠, 사회적 맥락 및 상호 작용으로부터의 공동 표현을 학습하기 위해 TriFN을 개발한다.
실제 데이터셋 실험을 통해 효과를 입증한다.

제안 방법

뉴스 콘텐츠 임베딩, 사용자 임베딩, 사용자-뉴스 상호작용 임베딩, 발행인-뉴스 관계 임베딩, 그리고 반지도 학습 분류의 다섯 구성요소를 결합한 삼중 관계 임베딩 프레임워크 TriFN를 제안한다.
비음수 행렬 분해(NMF)를 사용하여 뉴스 콘텐츠와 사용자를 각각 (D,V) 및 (U,T)로 임베딩한다.
가짜/진짜 라벨에 대한 사용자 신뢰도 점수를 활용하여 사용자-뉴스 상호작용 학습을 가이드하는 가중 거리 항을 통해 잠재 특징을 연결한다.
발행인 편향을 보유한 라벨이 지정된 발행인을 이용한 발행인-뉴스 관계 임베딩으로 발행인 특성과 편향 점수를 정렬한다.
뉴스 잠재 특징에서 라벨로의 선형 매핑을 갖춘 반지도 분류기를 구현하며, D,U,V,T,p,q를 교대로 최소제곱법으로 최적화한다.

실험 결과

연구 질문

RQ1TriFN이 발행인 편향과 사용자 참여를 공동으로 모델링하여 가짜 뉴스 분류를 개선할 수 있는가?
RQ2발행인 편향 모델링과 사용자 참여 학습이 TriFN의 성능에 어떻게 기여하는가?
RQ3사용자 참여가 제한된 상황에서 TriFN이 조기에 가짜 뉴스 탐지를 가능하게 하는가?

주요 결과

데이터세트	방법	정확도	정밀도	재현율	F1
BuzzFeed	TriFN	0.864±0.026	0.849±0.040	0.893±0.013	0.870±0.019
PolitiFact	TriFN	0.878±0.017	0.867±0.034	0.893±0.023	0.880±0.015

TriFN은 두 개의 실제 데이터 세트(BuzzFeed 및 PolitiFact)에서 F1 점수 및其他 지표에서 베이스라인보다 현저히 우수하다.
BuzzFeed에서 TriFN은 F1 0.870을 달성하며 RST(0.633) 및 LIWC+Castillo(0.822)를 능가한다.
PolitiFact에서 TriFN은 F1 0.880을 달성하며 RST(0.615) 및 LIWC+Castillo(0.843)를 능가한다.
이 프레임워크는 발행인 편향과 사용자 신뢰도를 활용하여 뉴스를 표현하고 가짜 뉴스 예측을 개선한다.
실험은 FakeNewsNet에서 80-20 트레인-테스트 분할로 수행되었으며 10회 반복 평균으로 보고된다.

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.