QUICK REVIEW

[论文解读] Understanding and Measuring Psychological Stress using Social Media

Sharath Chandra Guntuku, Anneke Buffone|arXiv (Cornell University)|Nov 19, 2018

Mental Health via Writing被引用 30

一句话总结

本研究开发并验证了语言模型，用于从社交媒体数据中预测心理压力，基于601名用户的Facebook和Twitter数据及真实压力感知量表（PSS）评分。结果表明，Facebook语言比Twitter语言更具压力预测能力，并成功通过领域自适应技术将用户级模型扩展至县一级预测，其表现优于社会人口学变量，在个体和区域压力估计中均表现更优。

ABSTRACT

A body of literature has demonstrated that users' mental health conditions, such as depression and anxiety, can be predicted from their social media language. There is still a gap in the scientific understanding of how psychological stress is expressed on social media. Stress is one of the primary underlying causes and correlates of chronic physical illnesses and mental health conditions. In this paper, we explore the language of psychological stress with a dataset of 601 social media users, who answered the Perceived Stress Scale questionnaire and also consented to share their Facebook and Twitter data. Firstly, we find that stressed users post about exhaustion, losing control, increased self-focus and physical pain as compared to posts about breakfast, family-time, and travel by users who are not stressed. Secondly, we find that Facebook language is more predictive of stress than Twitter language. Thirdly, we demonstrate how the language based models thus developed can be adapted and be scaled to measure county-level trends. Since county-level language is easily available on Twitter using the Streaming API, we explore multiple domain adaptation algorithms to adapt user-level Facebook models to Twitter language. We find that domain-adapted and scaled social media-based measurements of stress outperform sociodemographic variables (age, gender, race, education, and income), against ground-truth survey-based stress measurements, both at the user- and the county-level in the U.S. Twitter language that scores higher in stress is also predictive of poorer health, less access to facilities and lower socioeconomic status in counties. We conclude with a discussion of the implications of using social media as a new tool for monitoring stress levels of both individuals and counties.

研究动机与目标

理解心理压力在社交媒体帖子中的语言表达方式，特别是区分状态性压力与特质性压力。
基于标准化心理调查（PSS）数据，开发并验证基于Facebook和Twitter数据的用户级语言模型以预测压力。
解决将个体级压力模型扩展至县一级预测的挑战，采用领域自适应技术。
在用户和地理层面，评估社交媒体语言对压力的预测能力，与社会人口学变量及真实调查数据进行比较。
探索社交媒体语言作为实时、非侵入性工具在大规模监测压力和指导公共卫生干预中的实用性。

提出的方法

从601名用户收集社交媒体数据（Facebook和Twitter），这些用户也完成了压力感知量表（PSS）问卷。
使用语言特征（如LIWC词典）训练有监督语言模型，从用户帖子中预测压力水平。
应用迁移学习和领域自适应技术，将基于Facebook的压力模型迁移至Twitter语言，以应对平台间自我披露差异。
通过流媒体API对Twitter数据进行县一级聚合，将预测能力扩展至地理区域。
采用加权和缩放方法，避免从个体到县一级估计时出现生态谬误。
在用户和县一级对比真实PSS评分验证模型性能，并与社会人口学预测变量（年龄、性别、种族、教育、收入）进行比较。

实验结果

研究问题

RQ1心理压力在社交媒体帖子中如何通过语言表达，特别是与非压力相关话题相比有何差异？
RQ2Facebook语言相较于Twitter语言在多大程度上更具压力预测能力，原因是什么？
RQ3能否有效将基于Facebook数据训练的用户级压力模型，通过Twitter数据适应以预测县一级压力？
RQ4在个体和县一级，基于语言的压力模型与社会人口学变量相比，其预测表现如何？
RQ5基于社交媒体的县一级压力评分与真实世界指标（如健康结果和经济社会状况）之间存在何种关联？

主要发现

压力较大的用户更常提及疲惫、失控感、自我关注和身体疼痛，而非压力用户则更多讨论早餐、家庭时间及旅行。
Facebook语言显著比Twitter语言更具压力预测能力，可能由于自我披露程度和帖子长度的差异。
基于Facebook数据训练并经领域自适应处理的语言模型，在应用于Twitter数据时，成功预测了县一级压力水平，且表现优于社会人口学变量。
基于Twitter语言推断的县一级压力估计与较差的健康状况、设施可及性较低以及较低的社会经济地位相关。
LIWC等语言词典在压力预测中表现优于基于参与度的特征（如发帖频率），凸显内容特征的重要性高于行为特征。
本研究证明，社交媒体语言可作为个体和地理层面心理压力的有效、可扩展且实时的替代指标。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。