Using Machine Learning-Based Approaches for the Detection and Classification of Human Papillomavirus Vaccine Misinformation: Infodemiology Study of Reddit Discussions
- PMID: 34383667
- PMCID: PMC8380585
- DOI: 10.2196/26478
Using Machine Learning-Based Approaches for the Detection and Classification of Human Papillomavirus Vaccine Misinformation: Infodemiology Study of Reddit Discussions
Abstract
Background: The rapid growth of social media as an information channel has made it possible to quickly spread inaccurate or false vaccine information, thus creating obstacles for vaccine promotion.
Objective: The aim of this study is to develop and evaluate an intelligent automated protocol for identifying and classifying human papillomavirus (HPV) vaccine misinformation on social media using machine learning (ML)-based methods.
Methods: Reddit posts (from 2007 to 2017, N=28,121) that contained keywords related to HPV vaccination were compiled. A random subset (2200/28,121, 7.82%) was manually labeled for misinformation and served as the gold standard corpus for evaluation. A total of 5 ML-based algorithms, including a support vector machine, logistic regression, extremely randomized trees, a convolutional neural network, and a recurrent neural network designed to identify vaccine misinformation, were evaluated for identification performance. Topic modeling was applied to identify the major categories associated with HPV vaccine misinformation.
Results: A convolutional neural network model achieved the highest area under the receiver operating characteristic curve of 0.7943. Of the 28,121 Reddit posts, 7207 (25.63%) were classified as vaccine misinformation, with discussions about general safety issues identified as the leading type of misinformed posts (2666/7207, 36.99%).
Conclusions: ML-based approaches are effective in the identification and classification of HPV vaccine misinformation on Reddit and may be generalizable to other social media platforms. ML-based methods may provide the capacity and utility to meet the challenge involved in intelligent automated monitoring and classification of public health misinformation on social media platforms. The timely identification of vaccine misinformation on the internet is the first step in misinformation correction and vaccine promotion.
Keywords: HPV vaccine; Reddit; deep learning; infodemiology; infoveillance; machine learning; misinformation; social media.
©Jingcheng Du, Sharice Preston, Hanxiao Sun, Ross Shegog, Rachel Cunningham, Julie Boom, Lara Savas, Muhammad Amith, Cui Tao. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 05.08.2021.
Conflict of interest statement
Conflicts of Interest: None declared.
Figures
Similar articles
-
Characterizing and Identifying the Prevalence of Web-Based Misinformation Relating to Medication for Opioid Use Disorder: Machine Learning Approach.J Med Internet Res. 2021 Dec 22;23(12):e30753. doi: 10.2196/30753. J Med Internet Res. 2021. PMID: 34941555 Free PMC article.
-
Dimensions of Misinformation About the HPV Vaccine on Instagram: Content and Network Analysis of Social Media Characteristics.J Med Internet Res. 2020 Dec 3;22(12):e21451. doi: 10.2196/21451. J Med Internet Res. 2020. PMID: 33270038 Free PMC article.
-
Characterizing the Prevalence of Obesity Misinformation, Factual Content, Stigma, and Positivity on the Social Media Platform Reddit Between 2011 and 2019: Infodemiology Study.J Med Internet Res. 2022 Dec 30;24(12):e36729. doi: 10.2196/36729. J Med Internet Res. 2022. PMID: 36583929 Free PMC article.
-
The Use of Natural Language Processing Methods in Reddit to Investigate Opioid Use: Scoping Review.JMIR Infodemiology. 2024 Sep 13;4:e51156. doi: 10.2196/51156. JMIR Infodemiology. 2024. PMID: 39269743 Free PMC article. Review.
-
Facilitators and Barriers of COVID-19 Vaccine Promotion on Social Media in the United States: A Systematic Review.Healthcare (Basel). 2022 Feb 8;10(2):321. doi: 10.3390/healthcare10020321. Healthcare (Basel). 2022. PMID: 35206935 Free PMC article. Review.
Cited by
-
Disparities in awareness of the HPV vaccine and HPV-associated cancers among racial/ethnic minority populations: 2018 HINTS.Ethn Health. 2023 May;28(4):586-600. doi: 10.1080/13557858.2022.2116630. Epub 2022 Aug 31. Ethn Health. 2023. PMID: 36045478 Free PMC article.
-
AVPCancerFree: Impact of a digital behavior change intervention on parental HPV vaccine -related perceptions and behaviors.Hum Vaccin Immunother. 2022 Nov 30;18(5):2087430. doi: 10.1080/21645515.2022.2087430. Epub 2022 Jun 14. Hum Vaccin Immunother. 2022. PMID: 35699953 Free PMC article. Clinical Trial.
-
Detecting and monitoring concerns against HPV vaccination on social media using large language models.Sci Rep. 2024 Jun 21;14(1):14362. doi: 10.1038/s41598-024-64703-3. Sci Rep. 2024. PMID: 38906941 Free PMC article.
-
Automatic detection of health misinformation: a systematic review.J Ambient Intell Humaniz Comput. 2023 May 27:1-13. doi: 10.1007/s12652-023-04619-4. Online ahead of print. J Ambient Intell Humaniz Comput. 2023. PMID: 37360776 Free PMC article.
References
-
- Human Papillomavirus (HPV) - reasons to get vaccinated. Centers for Disease Control and Prevention. 2019. [2021-07-02]. https://www.cdc.gov/hpv/parents/vaccine/six-reasons.html.
-
- Saraiya M, Unger E, Thompson T, Lynch CF, Hernandez BY, Lyu CW, Steinau M, Watson M, Wilkinson EJ, Hopenhayn C, Copeland G, Cozen W, Peters ES, Huang Y, Saber MS, Altekruse S, Goodman MT, HPV Typing of Cancers Workgroup US assessment of HPV types in cancers: implications for current and 9-valent HPV vaccines. J Natl Cancer Inst. 2015 Jun;107(6):djv086. doi: 10.1093/jnci/djv086. http://europepmc.org/abstract/MED/25925419 - DOI - PMC - PubMed
-
- HPV vaccine: who needs it, how it works. Mayo Clinic. 2019. [2020-02-09]. https://www.mayoclinic.org/diseases-conditions/hpv-infection/in-depth/hp....
-
- Zimet GD, Rosberger Z, Fisher WA, Perez S, Stupiansky NW. Beliefs, behaviors and HPV vaccine: correcting the myths and the misinformation. Prev Med. 2013 Nov;57(5):414–8. doi: 10.1016/j.ypmed.2013.05.013. https://linkinghub.elsevier.com/retrieve/pii/S0091-7435(13)00176-X - DOI - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources