QUICK REVIEW

[论文解读] "Like Sheep Among Wolves": Characterizing Hateful Users on Twitter

Manoel Horta Ribeiro, Pedro Calais|arXiv (Cornell University)|Dec 31, 2017

Hate Speech and Cyberbullying Detection参考文献 30被引用 37

一句话总结

本文提出一种以用户为中心的方法，通过分析其活动模式、网络中心性及语言内容，对推特上的仇恨用户进行表征。研究使用基于随机游走的爬虫采样了100,386名用户，并对4,972名用户（其中544名为仇恨用户）进行了众包标注。研究发现，仇恨用户创建时间更近、活动更频繁、在转发网络中更中心化，且使用的仇恨相关词汇更少，尽管其情绪更负面、使用更多粗俗用语——这一发现挑战了‘独狼’型仇恨者的刻板印象。

ABSTRACT

Hateful speech in Online Social Networks (OSNs) is a key challenge for companies and governments, as it impacts users and advertisers, and as several countries have strict legislation against the practice. This has motivated work on detecting and characterizing the phenomenon in tweets, social media posts and comments. However, these approaches face several shortcomings due to the noisiness of OSN data, the sparsity of the phenomenon, and the subjectivity of the definition of hate speech. This works presents a user-centric view of hate speech, paving the way for better detection methods and understanding. We collect a Twitter dataset of $100,386$ users along with up to $200$ tweets from their timelines with a random-walk-based crawler on the retweet graph, and select a subsample of $4,972$ to be manually annotated as hateful or not through crowdsourcing. We examine the difference between user activity patterns, the content disseminated between hateful and normal users, and network centrality measurements in the sampled graph. Our results show that hateful users have more recent account creation dates, and more statuses, and followees per day. Additionally, they favorite more tweets, tweet in shorter intervals and are more central in the retweet network, contradicting the "lone wolf" stereotype often associated with such behavior. Hateful users are more negative, more profane, and use less words associated with topics such as hate, terrorism, violence and anger. We also identify similarities between hateful/normal users and their 1-neighborhood, suggesting strong homophily.

研究动机与目标

为解决基于内容的仇恨言论检测方法在在线社交网络数据中面临的噪声、主观性和上下文理解等局限性。
探讨是否通过表征仇恨用户而非单条推文，能够实现更优的仇恨言论检测与理解。
研究仇恨用户与正常用户在推特上的活动行为、语言内容及网络位置方面的差异。
通过分析仇恨用户与正常用户的一阶邻域（1-neighborhood），探究同质性效应。

提出的方法

采用基于随机游走的爬虫，从转发网络中采样了100,386名推特用户构成的子图。
根据与仇恨相关词汇词典的词汇匹配结果，将用户识别为潜在仇恨用户。
采用分层抽样策略，选取与仇恨词汇用户保持不同距离的用户，以确保标注样本的多样性。
使用CrowdFlower平台对4,972名用户进行人工标注，标注人员可查看用户完整个人资料以获得上下文理解。
利用情感分析与词汇分析工具对用户时间线内容进行语言分析，评估其负面情绪、粗俗用语及主题相关性。
计算网络中心性指标（如介数中心性与度中心性），以评估仇恨用户在转发图中的全局与局部位置。

实验结果

研究问题

RQ1与正常用户相比，仇恨用户相关的属性与内容是否存在显著差异？
RQ2从全局中心性与局部邻域结构来看，仇恨用户在网络中的位置如何？
RQ3仇恨用户及其一阶邻域在行为与内容方面表现出多大程度的同质性？
RQ4仇恨用户的语言特征与正常用户相比如何，特别是在仇恨相关词汇与情感基调方面？

主要发现

仇恨用户的账号创建时间显著更近，其账号年龄中位数远低于正常用户。
仇恨用户每天发布的推文更多、每天关注的用户更多、点赞的推文也更多，表明其参与度更高。
仇恨用户在转发图中表现出更高的网络中心性，与‘孤立仇恨行为者’的‘独狼’刻板印象相矛盾。
尽管负面情绪与粗俗用语更多，仇恨用户使用的与仇恨、恐怖主义、暴力及愤怒相关的词汇却少于正常用户。
仇恨用户与正常用户的一阶邻域均表现出强烈的同质性，其直接社交圈中行为与语言模式高度相似。
仇恨用户的中位数在转发网络中更具中心性，介数中心性更高，表明其在信息传播中具有更大影响力。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。