Forecasting the onset and course of mental illness with Twitter data

https://goo.gl/sHNxra

We developed computational models to predict the emergence of depression and Post-Traumatic Stress Disorder in Twitter users. 

Twitter data and details of depression history were collected from 204 individuals (105 depressed, 99 healthy). We extracted predictive features measuring affect, linguistic style, and context from participant tweets (N = 279,951) and built models using these features with supervised learning algorithms. 

Resulting models successfully discriminated between depressed and healthy content, and compared favorably to general practitioners’ average success rates in diagnosing depression, albeit in a separate population. Results held even when the analysis was restricted to content posted before first depression diagnosis.