Speech-Based Automatic Recognition Technology for Major Depression Disorder | SpringerLink
Skip to main content

Speech-Based Automatic Recognition Technology for Major Depression Disorder

  • Conference paper
  • First Online:
Human Centered Computing (HCC 2019)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11956))

Included in the following conference series:

Abstract

Depression is one of the common mental illnesses nowadays. It can greatly harm the physical and mental health of patients and cause huge losses to individuals, families and society. Because of the lack of hardware and social prejudice against depression, there are a large number of misdiagnosis and missed diagnosis in hospitals. It is necessary to find an objective and efficient way to help the identification of depression. Previous studies have demonstrated the potential value of speech in this area. The model based on speech can distinguish patients from normal people to a great extent. On this basis, we hope to further predict the severity of depression through speech. In this paper, a total of 240 subjects were recruited to participate in the experiment. Their depression scores were measured using the PHQ9 scale, and their corresponding speech data were recorded under the self-introduction situation. Then, the effective voice features were extracted and the PCA was conducted for feature dimensionality reduction. Finally, utilizing several classical machine learning method, the depression degree classification models were constructed. This study is an attempt of the interdisciplinary study of psychology and computer science. It is hoped that it will provide new ideas for the related work of mental health monitoring.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
¥17,985 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
JPY 3498
Price includes VAT (Japan)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
JPY 5719
Price includes VAT (Japan)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
JPY 7149
Price includes VAT (Japan)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Huang, L.: Analysis of misdiagnosis of anxiety and depression in grass-roots hospitals Asia-pacific traditional medicine 8(04), 207–208 (2012)

    Google Scholar 

  2. Kocsis, R.N.: Diagnostic and statistical manual of mental disorders: fifth edition (DSM-5). Int. J. Offender Ther. Comp. Criminol. 57(12), 1546–15468 (2013)

    Google Scholar 

  3. Hashim, N.W., Wilkes, M., Salomon, R., et al.: Evaluation of voice acoustics as predictors of clinical depression scores. J. Voice 31(2), 6 (2017)

    Article  Google Scholar 

  4. Helfer, B.S., Quatieri, T.F., Williamson, J.R., et al.: Classification of depression state based on articulatory precision. In: Bimbot, F., Cerisara, C., Fougeron, C., et al. (eds.) 14th Annual Conference of the International Speech Communication Association, pp. 2171–2175 (2013)

    Google Scholar 

  5. Mundt, J.C., Vogel, A.P., Feltner, D.E., et al.: Vocal acoustic biomarkers of depression severity and treatment response. Biol. Psychiatry 72(7), 580–587 (2012)

    Article  Google Scholar 

  6. Pan, W., Wang, J., Liu, T., et al.: Depression recognition based on speech analysis. Chin. Sci. Bull. 63(20), 2081–2092 (2018)

    Article  Google Scholar 

  7. Scherer, S., Stratou, G., Gratch, J., et al.: Investigating voice quality as a speaker-independent indicator of depression and PTSD. In: Bimbot, F., Cerisara, C., Fougeron, C., et al. (eds.) 14th Annual Conference of the International Speech Communication Association, pp. 847–851 (2013)

    Google Scholar 

  8. Scherer, S., Stratou, G., Lucas, G., et al.: Automatic audiovisual behavior descriptors for psychological disorder analysis. Image Vis. Comput. 32(10), 648–658 (2014)

    Article  Google Scholar 

  9. Cohn, J.F., Kruez, T.S., Matthews, I., et al.: Detecting depression from facial actions and vocal prosody (2009)

    Google Scholar 

  10. Cummins, N., Epps, J., Breakspear, M., et al.: An Investigation of Depressed Speech Detection: Features and Normalization (2011)

    Google Scholar 

  11. Williamson, J.R., Godoy, E., Cha, M., et al.: Detecting Depression using Vocal, Facial and Semantic Communication Cues. Assoc Computing Machinery, New York (2016)

    Book  Google Scholar 

Download references

Acknowledgements

The work was supported financially by the China Southern Power Grind (Grant No. GDKJXM20180673).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zhixin Yang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Yang, Z., Li, H., Li, L., Zhang, K., Xiong, C., Liu, Y. (2019). Speech-Based Automatic Recognition Technology for Major Depression Disorder. In: Milošević, D., Tang, Y., Zu, Q. (eds) Human Centered Computing. HCC 2019. Lecture Notes in Computer Science(), vol 11956. Springer, Cham. https://doi.org/10.1007/978-3-030-37429-7_55

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-37429-7_55

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-37428-0

  • Online ISBN: 978-3-030-37429-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics