Research Studies
Study Tracker
Combating Misinformation in the Digital Age: A Machine Learning Approach to Protect Community Water Fluoridation and Promote Oral Health Equity.Abstract
Original abstract online at
https://onlinelibrary.wiley.com/doi/10.1111/jphd.70020
Objectives
Community water fluoridation (CWF) is an effective public health intervention for preventing dental caries. However, the widespread dissemination of misinformation on social media platforms, such as Twitter (now “X”), threatens public acceptance and may exacerbate oral health inequities. This study aimed to develop, evaluate, and deploy machine learning (ML) and deep learning (DL) models to identify misinformation about CWF on Twitter and assess implications for public health communication and surveillance.
Methods
We collected 19,960 English-language tweets about CWF posted between 2014 and 2024 using keyword-based queries. Tweets originated globally; however, only US-based geotagged tweets were used for sociodemographic analysis because reliable demographic and oral health surveillance data (e.g., US Census, BRFSS, NHANES) were available for linkage. Veracity was determined using authoritative public health criteria from the CDC, WHO, and ADA, with a subset of tweets manually annotated as factually correct or misinformation. Six machine learning and deep learning models were trained and evaluated. Additional analyses included sentiment scoring, thematic content coding, and geospatial-demographic comparisons.
Results
The Support Vector Classifier achieved the highest accuracy (91.6%). A hybrid BERT + XGBoost model (89.9% accuracy) was selected for deployment due to its strong performance and interpretability. Overall, 78.8% of tweets were classified as misinformation, with dominant themes including fluoride toxicity, distrust of government, and individual autonomy. Misinformation tweets were shorter, more engaging, and concentrated in socioeconomically disadvantaged areas with a high prevalence of misinformation tweets, where poverty rates and untreated dental caries were also greater. Sentiment analysis showed pro-CWF tweets were, on average, more positive.
Conclusions
ML and DL models can effectively detect CWF-related misinformation on social media. Integrated with equity-focused communication strategies, these tools may help sustain public trust in CWF and reduce misinformation-related oral health inequities.
Conflicts of Interest
The authors declare no conflicts of interest.
Authors Affiliations
- 1 The School of Dentistry, The University of Queensland, Brisbane, Australia.
- 2 QIMR Berghofer, Brisbane, Queensland, Australia.
Data Availability Statement
The data that support the findings of this study are available on request from the corresponding author. The data are not publicly available due to privacy or ethical restrictions.
Supporting Information
| Filename | Description |
|---|---|
| jphd70020-sup-0001-Supplementary1.docxWord 2007 document , 258.5 KB | Data S1: Supporting Information. |
Please note: The publisher is not responsible for the content or functionality of any supporting information supplied by the authors. Any queries (other than missing content) should be directed to the corresponding author for the article.
Funding: This work was supported by University of Queensland (#2024439).
Note from Fluoride Action Network:
This issue of the journal was a supplement supported by CareQuest, Institute for Oral Health, a group that advocates to protect Community Water Fluoridation.
- 1 Centers for Disease Control and Prevention, “Ten Great Public Health Achievements–United States, 1900–1999,” MMWR. Morbidity and Mortality Weekly Report 48, no. 12 (1999): 241–243.
- 2 and , “Does Cessation of Community Water Fluoridation Lead to an Increase in Tooth Decay? A Systematic Review,” Community Dentistry and Oral Epidemiology 44, no. 1 (2016): 20–31, https://doi.org/10.1111/cdoe.12151.
- 3 , , and , “The Effect of Water Fluoridation and Social Inequalities on Dental Caries in 5-Year-Old Children,” International Journal of Epidemiology 28, no. 2 (1999): 300–305, https://doi.org/10.1093/ije/28.2.300.
- 4 and , “Fluorides in Dental Public Health Programs,” Dental Clinics of North America 52, no. 2 (2008): 387–401, https://doi.org/10.1016/j.cden.2007.12.002.
- 5 , , and , “The Association Between Community Water Fluoridation and Adult Tooth Loss,” American Journal of Public Health 100, no. 10 (2010): 1980–1985, https://doi.org/10.2105/AJPH.2009.180778.
- 6 , “When Public Action Undermines Public Health: A Critical Examination of Antifluoridationist Literature,” Australia and New Zealand Health Policy 4, no. 1 (2007): 25, https://doi.org/10.1186/1743-8462-4-25.
- 7 and , “Community Water Fluoridation and Health Misinformation: Lessons Learned From Calgary,” Journal of the Canadian Dental Association 80 (2014): e85.
- 8 , , and , “Addressing Health-Related Misinformation on Social Media,” JAMA 320, no. 23 (2018): 2417–2418, https://doi.org/10.1001/jama.2018.16865.
- 9 , , and , “The Persistence and Peril of Misinformation,” American Scientist 105, no. 6 (2017): 372–375, https://doi.org/10.1511/2017.105.6.372rti.org+1Scholars@Duke+1.
- 10 , , and Drinking Water Fluoridation in Calgary Working Group, “Drinking Water Fluoridation and Oral Health Inequities in Canadian Children,” Canadian Journal of Public Health 106, no. 5 (2015): e308–e314, https://doi.org/10.17269/cjph.106.5042.
- 11 Queensland Health, Public Water Fluoridation in Queensland, 2016, https://www.health.qld.gov.au/__data/assets/pdf_file/0030/444810/water-fluoridation-report.pdf.
- 12 , , , and , “Systematic Literature Review on the Spread of Health-Related Misinformation on Social Media,” Social Science & Medicine 240 (2019): 112552, https://doi.org/10.1016/j.socscimed.2019.112552.
- 13 , , , , and , “Fake News Detection on Social Media: A Data Mining Perspective,” ACM Sigkdd Explorations Newsletter 19, no. 1 (2017): 22–36, https://doi.org/10.1145/3137597.3137600.
- 14 , , , et al., “Coronavirus Goes Viral: Quantifying the COVID-19 Misinformation Epidemic on Twitter,” Cureus 12, no. 3 (2020): e7255, https://doi.org/10.7759/cureus.7255.
- 15 , , , et al., “Susceptibility to Misinformation About COVID-19 Around the World,” Royal Society Open Science 7, no. 10 (2020): 201199, https://doi.org/10.1098/rsos.201199.
- 16 Twitter Developer Platform, API Documentation, 2024, https://developer.twitter.com/en/docs/twitter-api.
- 17 , , and , “You Are What You Tweet: Connecting the Geographic Variation in America’s Obesity Rate to Twitter Content,” PLoS One 10, no. 9 (2015): e0133505, https://doi.org/10.1371/journal.pone.0133505.
- 18 American Dental Association, Fluoridation Policy, 2022, https://www.ada.org/en/public-programs/advocating-for-the-public/fluoride-and-fluoridation.
- 19 , , and , Natural Language Processing With Python (O’Reilly Media, 2009).
- 20 , “Interrater Reliability: The Kappa Statistic,” Biochemia Medica 22, no. 3 (2012): 276–282.
- 21 , , , et al., “ Transformers: State-of-the-Art Natural Language Processing,” in Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations (Association for Computational Linguistics, 2020), 38–45, https://doi.org/10.18653/v1/2020.emnlp-demos.6.
- 22 and , “VADER: A Parsimonious Rule-Based Model for Sentiment Analysis of Social Media Text,” ICWSM 8 (2014): 216–225.
- 23 , , and , “ GloVe: Global Vectors for Word Representation,” in Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP) (Association for Computational Linguistics, 2014), 1532–1543, https://aclanthology.org/D14-1162/.
- 24 , , , and , “SMOTE: Synthetic Minority Over-Sampling Technique,” Journal of Artificial Intelligence Research 16 (2002): 321–357, https://doi.org/10.1613/jair.953.
- 25 and , “The Class Imbalance Problem: A Systematic Study,” Intelligent Data Analysis 6, no. 5 (2002): 429–449.
- 26 and , “The Precision-Recall Plot Is More Informative Than the ROC Plot When Evaluating Binary Classifiers on Imbalanced Datasets,” PLoS One 10, no. 3 (2015): e0118432, https://doi.org/10.1371/journal.pone.0118432.
- 27 and , “A Unified Approach to Interpreting Model Predictions,” Advances in Neural Information Processing Systems 30 (2017): 4765–4774.
- 28 Flask Documentation, 2024, https://flask.palletsprojects.com/.
- 29 , Flask Web Development: Developing Web Applications With Python (O’Reilly Media, 2018).
- 30 and , “A Comprehensive Analysis of COVID-19 Misinformation, Public Health Impacts, and Communication Strategies: Scoping Review,” Journal of Medical Internet Research 26 (2024): e56931, https://doi.org/10.2196/56931.
- 31 , , and , “The COVID-19 Misinformation Challenge in Public Health,” JAMA Health Forum 2, no. 4 (2021): e210338, https://doi.org/10.1001/jamahealthforum.2021.0338.
- 32 , , and , “The Spread of Medical Fake News in Social Media – The Pilot Quantitative Study,” Health Policy and Technology 7, no. 2 (2018): 115–118, https://doi.org/10.1016/j.hlpt.2018.03.002.
- 33 , , , et al., “COVID-19-Related Infodemic and Its Impact on Public Health: A Global Social Media Analysis,” American Journal of Tropical Medicine and Hygiene 103, no. 4 (2020): 1621–1629, https://doi.org/10.4269/ajtmh.20-0812.
- 34 , , , et al., “Health Department Use of Social Media to Identify Foodborne Illness,” MMWR. Morbidity and Mortality Weekly Report 63, no. 32 (2014): 681–685.
- 35 , , and , “Social Media Analysis for Public Health: Use of Twitter for Population-Level Health Research,” JMIR Public Health and Surveillance 2, no. 1 (2016): e27, https://doi.org/10.2196/publichealth.5296.
- 36 , , , et al., “Building a National Neighborhood Dataset From Geotagged Twitter Data for Indicators of Happiness, Diet, and Physical Activity,” JMIR Public Health and Surveillance 3, no. 4 (2017): e64, https://doi.org/10.2196/publichealth.7293.
- 37 U.S. Census Bureau, American Community Survey (ACS), 2023, https://www.census.gov/programs-surveys/acs.
- 38 Centers for Disease Control and Prevention (CDC), Behavioral Risk Factor Surveillance System (BRFSS) , 2022, https://www.cdc.gov/brfss/.
- 39 National Center for Health Statistics (NCHS), National Health and Nutrition Examination Survey (NHANES), 2023, https://www.cdc.gov/nchs/nhanes/.
- 40 , , , , and , What is Health Equity? (Robert Wood Johnson Foundation, 2018), https://www.rwjf.org/en/library/research/2017/05/what-is-health-equity-.html.
- 41 , , and , “The Spread of True and False News Online,” Science 359, no. 6380 (2018): 1146–1151, https://doi.org/10.1126/science.aap9559.
- 42 , , , , and , “Emotion Shapes the Diffusion of Moralized Content in Social Networks,” Proceedings of the National Academy of Sciences 114, no. 28 (2017): 7313–7318, https://doi.org/10.1073/pnas.1618923114.
- 43 , , , et al., “Health Misinformation and the Power of Narratives,” American Journal of Bioethics 19, no. 3 (2019): 5–7, https://doi.org/10.1080/15265161.2019.1572802.
- 44 , “The Role of Health Literacy in Addressing COVID-19 Misinformation in Underserved Populations,” Journal of Health Communication 25, no. 10 (2020): 843–846, https://doi.org/10.1080/10810730.2020.1864520.
- 45 , , and , “Beyond Misinformation: Understanding and Coping With the “Post-Truth” Era,” Journal of Applied Research in Memory and Cognition 6, no. 4 (2017): 353–369, https://doi.org/10.1016/j.jarmac.2017.07.008.
- 46 , “Why Debunking Health Misinformation Often Fails,” Nature 583, no. 7817 (2020): 353, https://doi.org/10.1038/d41586-020-01834-3.
