Abstract
The growing need to identify mental health conditions has paved the way for automated computational methods for mental health surveillance on social media. However, inferring the accurate state of a user’s mind requires understanding the history of the user’s mental health condition, which is critical for identifying the mental health landscape of at-risk users. Recent methods have offered performance improvement to address this challenging area; however, they assume that the intervals between all historical social media posts are equally important, making them unable to capture the dynamic temporal patterns of historical posts. In this work, we address this gap and incorporate a time-aware framework that jointly learns the context of the user’s historical posts and temporal posting irregularities by disentangling representations of different time intervals in the users’ historical posts and adaptively selecting the most important interval for each sample at each time step. First, the hidden state of the RNN is disentangled into multiple independently updated small hidden states to model users’ historical posting information. Then, at each time step, the temporal context information is used to modulate the features of different posts, selecting the most important interval within the historical posts. Experimental results on two mental health (i.e., depression and self-harm) Reddit datasets show that our method outperforms state-of-the-art methods for mental health surveillance on social media.
Similar content being viewed by others
References
Adhikari S, Thapa S, Singh P, Huo H, Bharathy G, Prasad M (2021) A comparative study of machine learning and nlp techniques for uses of stop words by patients in diagnosis of Alzheimer’s disease. In: 2021 International joint conference on neural networks (IJCNN). IEEE, pp 1–8
Adhikari S, Thapa S, Naseem U, Singh P, Huo H, Bharathy G, Prasad M (2022) Exploiting linguistic information from Nepali transcripts for early detection of Alzheimer’s disease using natural language processing and machine learning techniques. Int J Hum–Comput Stud 160:102761
Aduragba OT, Yu J, Cristea AI, Shi L (2021) Detecting fine-grained emotions on social media during major disease outbreaks: health and well-being before and during the Covid-19 pandemic. In: AMIA annual symposium proceedings, vol 2021. American Medical Informatics Association, p 187
Aguilera J, Farías DIH, Montes-y-Gómez M, González LC (2021) Detecting traces of self-harm in social media: a simple and interpretable approach. In: Mexican international conference on artificial intelligence. Springer, Berlin, pp 196–207
Aragón ME, López-Monroy AP, Montes-y-Gómez M (2019) Inaoe-cimat at erisk 2019: detecting signs of anorexia using fine-grained emotions. In: CLEF (working notes)
Barros L, Trifan A, Oliveira JL (2021) Vader meets BERT: sentiment analysis for early detection of signs of self-harm through social mining. In: CLEF (working notes), pp 897–907
Baytas IM, Xiao C, Zhang X, Wang F, Jain AK, Zhou J (2017) Patient subtyping via time-aware lstm networks. In: Proceedings of the 23rd ACM SIGKDD international conference on knowledge discovery and data mining, pp 65–74
Beltagy I, Peters ME, Cohan A (2020) Longformer: the long-document transformer. arXiv preprint arXiv:2004.05150
Bucci S, Schwannauer M, Berry N (2019) The digital revolution and its impact on mental health care. Psychol Psychother Theory Res Pract 92(2):277–297
Burke M, Marlow C, Lento T (2010) Social network activity and social well-being. In: Proceedings of the SIGCHI conference on human factors in computing systems, pp 1909–1912
Campillo-Ageitos E, Fabregat H, Araujo L, Martinez-Romo J (2021) Nlp-uned at erisk 2021: self-harm early risk detection with tf-idf and linguistic features. In: Working notes of CLEF, pp 21–24
Cao L, Zhang H, Feng L, Wei Z, Wang X, Li N, He X (2019) Latent suicide risk detection on microblog via suicide-oriented word embeddings and layered attention. In: Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (EMNLP-IJCNLP), pp 1718–1728
Chen Z, Ma Q, Lin Z (2021) Time-aware multi-scale rnns for time series modeling. In: IJCAI, pp 2285–2291
Cho K, Van Merriënboer B, Bahdanau D, Bengio Y (2014) On the properties of neural machine translation: encoder–decoder approaches. arXiv preprint arXiv:1409.1259
Cui Y, Jia M, Lin T-Y, Song Y, Belongie S (2019) Class-balanced loss based on effective number of samples. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 9268–9277
De Choudhury M, Gamon M, Counts S, Horvitz E (2013) Predicting depression via social media. In: Proceedings of the international AAAI conference on web and social media, vol 7
Devlin J, Chang M-W, Lee K, Toutanova K (2019) BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 conference of the North American chapter of the Association for Computational Linguistics: human language technologies, Volume 1 (long and short papers), pp 4171–4186. Association for Computational Linguistics, Minneapolis, MN. https://doi.org/10.18653/v1/N19-1423. https://aclanthology.org/N19-1423
Gaur M, Alambo A, Sain JP, Kursuncu U, Thirunarayan K, Kavuluru R, Sheth A, Welton R, Pathak J (2019) Knowledge-aware assessment of severity of suicide risk for early intervention. In: The World Wide Web conference, pp 514–525
Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780
Kang X, Dou R, Yu H (2022) Tua1 at erisk 2022: exploring affective memories for early detection of depression
Keymanesh M, Gurukar S, Boettner B, Browning C, Calder C, Parthasarathy S (2020) Twitter watch: leveraging social media to monitor and predict collective-efficacy of neighborhoods. In: Complex networks XI: proceedings of the 11th conference on complex networks CompleNet 2020. Springer, Berlin, pp 197–211
Lin T-Y, Goyal P, Girshick R, He K, Dollár P (2017) Focal loss for dense object detection. In: Proceedings of the IEEE international conference on computer vision, pp 2980–2988
Maneriker P, He Y, Parthasarathy S (2021) Sysml: stylometry with structure and multitask learning: implications for darknet forum migrant analysis. In: Proceedings of the 2021 conference on empirical methods in natural language processing, pp 6844–6857
Manolache A, Brad F, Barbalau A, Ionescu RT, Popescu M (2022) Veridark: a large-scale benchmark for authorship verification on the dark web. Adv Neural Inf Process Syst 35:15574–15588
Martínez-Castaño R, Htait A, Azzopardi L, Moshfeghi Y (2020) Early risk detection of self-harm and depression severity using BERT-based transformers: ilab at clef erisk 2020
Martínez-Castaño R, Htait A, Azzopardi L, Moshfeghi Y (2021) BERT-based transformers for early detection of mental health illnesses. In: International conference of the cross-language evaluation forum for european languages. Springer, pp 189–200
Matero M, Idnani A, Son Y, Giorgi S, Vu H, Zamani M, Limbachiya P, Guntuku SC, Schwartz HA (2019) Suicide risk assessment with multi-level dual-context language and BERT. In: Proceedings of the sixth workshop on computational linguistics and clinical psychology, pp 39–44
Maupomé D, Armstrong MD, Rancourt F, Soulas T, Meurs M-J (2021) Early detection of signs of pathological gambling, self-harm and depression through topic extraction and neural networks. In: CLEF (working notes), pp 1031–1045
Naseem U, Dunn AG, Kim J, Khushi M (2022a) Early identification of depression severity levels on reddit using ordinal classification. In: Proceedings of the ACM web conference 2022, pp 2563–2572
Naseem U, Khushi M, Kim J, Dunn AG (2022) Hybrid text representation for explainable suicide risk identification on social media. IEEE Trans Comput Soc Syst 6:66
Naseem U, Kim J, Khushi M, Dunn AG (2022c) Identification of disease or symptom terms in reddit to improve health mention classification. In: Proceedings of the ACM web conference 2022, pp 2573–2581
Naseem U, Lee BC, Khushi M, Kim J, Dunn AG (2022d) Benchmarking for public health surveillance tasks on social media with a domain-specific pretrained language model. arXiv preprint arXiv:2204.04521
Naseem U, Kim J, Khushi M, Dunn A (2023) Graph-based hierarchical attention network for suicide risk detection on social media. In: Companion proceedings of the ACM web conference 2023, pp 995–1003
Nasim M, Weber D, South T, Tuke J, Bean N, Falzon L, Mitchell L (2022) Are we always in strife? A longitudinal study of the echo chamber effect in the Australian twittersphere. arXiv preprint arXiv:2201.09161
Organization WH et al (2022) World mental health report: transforming mental health for all
Ragheb W, Azé J, Bringay S, Servajean M (2019) Attentive multi-stage learning for early risk detection of signs of anorexia and self-harm on social media. In: CLEF 2019 working notes-conference and labs of the evaluation forum, vol 2380, p 126
Ríssola EA, Losada DE, Crestani F (2021) A survey of computational methods for online mental state assessment on social media. ACM Trans Comput Healthc 2(2):1–31
Sawhney R, Manchanda P, Mathur P, Shah R, Singh R (2018a) Exploring and learning suicidal ideation connotations on social media with deep learning. In: Proceedings of the 9th workshop on computational approaches to subjectivity, sentiment and social media analysis, pp 167–175
Sawhney R, Manchanda P, Singh R, Aggarwal S (2018b) A computational approach to feature extraction for identification of suicidal ideation in tweets. In: Proceedings of ACL 2018, student research workshop, pp 91–98
Sawhney R, Joshi H, Gandhi S, Shah R (2020) A time-aware transformer based model for suicide ideation detection on social media. In: Proceedings of the 2020 conference on empirical methods in natural language processing (EMNLP), pp 7685–7697
Sawhney R, Agarwal S, Neerkaje AT, Aletras N, Nakov P, Flek L (2022) Towards suicide ideation detection through online conversational context. In: Proceedings of the 45th international ACM SIGIR conference on research and development in information retrieval, pp 1716–1727
Thapa S, Adhikari S, Naseem U, Singh P, Bharathy G, Prasad M (2020) Detecting Alzheimer’s disease by exploiting linguistic information from Nepali transcript. In: Neural information processing: 27th international conference, ICONIP 2020, Bangkok, Thailand, November 18–22, 2020, Proceedings, Part IV 27. Springer, Berlin, pp 176–184
Thapa S, Ghimire A, Adhikari S, Bhoi AK, Barsocchi P (2022) Cognitive internet of things (iot) and computational intelligence for mental well-being. In: Cognitive and soft computing techniques for the analysis of healthcare data. Elsevier, Berlin, pp 59–77
Tsakalidis A, Nanni F, Hills A, Chim J, Song J, Liakata M (2022) Identifying moments of change from longitudinal user text. arXiv preprint arXiv:2205.05593
Un Nisa Q, Muhammad R (2021) Towards transfer learning using BERT for early detection of self-harm of social media users. In: Proceedings of the working notes of CLEF, pp 21–24
Wang X, Brown DE, Gerber MS (2012) Spatio-temporal modeling of criminal incidents using geographic, demographic, and twitter-derived information. In: 2012 IEEE international conference on intelligence and security informatics. IEEE, pp 36–41
Weber D, Nasim M, Mitchell L, Falzon L (2020) A method to evaluate the reliability of social media data for social network analysis. In: 2020 IEEE/ACM international conference on advances in social networks analysis and mining (ASONAM). IEEE, pp 317–321
Williams ML, Burnap P, Sloan L (2017) Crime sensing with big data: the affordances and limitations of using open-source communications to estimate crime patterns. Brit J Criminol 57(2):320–340
Zhang T, Yang K, Ji S, Ananiadou S (2023) Emotion fusion for mental illness detection from social media: a survey. Inf Fusion 92:231–246
Zhou X, Coiera E, Tsafnat G, Arachi D, Ong M-S, Dunn AG et al (2015) Using social connection information to improve opinion mining: identifying negative sentiment about hpv vaccines on Twitter
Zogan H, Razzak I, Jameel S, Xu G (2021) Depressionnet: a novel summarization boosted deep framework for depression detection on social media. arXiv preprint arXiv:2105.10878
Funding
None.
Author information
Authors and Affiliations
Contributions
UN conceptualized the project, carried out the experiments, and wrote the first draft collaboratively with ST. ST analyzed the results and wrote the draft. QZ, LH, and JR analyzed the results and provided technical interpretations along with reviewing and editing the draft. MN supervised the project and contributed to reviewing and editing the manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Naseem, U., Thapa, S., Zhang, Q. et al. Incorporating historical information by disentangling hidden representations for mental health surveillance on social media. Soc. Netw. Anal. Min. 14, 9 (2024). https://doi.org/10.1007/s13278-023-01167-9
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s13278-023-01167-9