Leveraging Big Data Analytics to Identify and Address Health Insurance Enrollment Disparities Among Vulnerable Populations: A Machine Learning Framework
DOI:
https://doi.org/10.47941/ijce.3026Keywords:
Health Insurance Disparities, Machine Learning, Social Determinants of Health, Medicaid Enrollment, Predictive AnalyticsAbstract
Despite ongoing healthcare reforms and Medicaid expansion, significant disparities persist in health insurance enrollment among underserved populations. This article presents a big data–driven framework that identifies and addresses enrollment gaps by integrating multiple datasets—healthcare claims, U.S. Census demographics, and social determinants of health (SDOH)—to produce actionable insights at granular geographic levels. Using unsupervised machine learning techniques, including k-means and hierarchical clustering, the framework uncovers hidden patterns of under-enrollment across ZIP codes in Medicaid expansion states. Factors such as limited digital access, language barriers, and low health literacy demonstrate statistical correlations with reduced insurance uptake. The framework employs predictive modeling to forecast communities with the highest risk of continued under-enrollment based on historical and demographic trends. Data-informed interventions are proposed, including multilingual outreach programs, mobile enrollment units, and culturally competent assistance initiatives, with potential impact evaluated through simulation models and ROI forecasting under policy-adjusted scenarios. Built on scalable, privacy-preserving architecture incorporating de-identification standards and role-based access controls, the framework integrates with cloud-native platforms such as Databricks and AWS for real-time data processing and visualization. This work demonstrates how AI and big data analytics can drive policy innovation, resource optimization, and health equity, offering public health officials, Medicaid administrators, and data strategists’ evidence-based solutions for improving healthcare access across socioeconomically disadvantaged populations.
Downloads
References
Benjamin D. Sommers, et al. "Closing Gaps or Holding Steady? The Affordable Care Act, Medicaid Expansion, and Racial Disparities in Coverage, 2010–2021." Journal of Health Politics, Policy and Law, vol. 50, no. 2, April 01, 2025, pp. 253-290. https://read.dukeupress.edu/jhppl/article/50/2/253/391182/Closing-Gaps-or-Holding-Steady-The-Affordable-Care
Yilu Lin, et al. "Effects of Medicaid Expansion on Poverty Disparities in Health Insurance Coverage." International Journal for Equity in Health, vol. 20, no. 17, July 26, 2021. https://equityhealthj.biomedcentral.com/articles/10.1186/s12939-021-01486-3
Agung Dwi Laksono, et al. "Regional Inequalities of National Health Insurance Enrollment in Indonesia." Journal of Public Health and Development, vol. 23, no. 2, April 30, 2025. https://he01.tci-thaijo.org/index.php/AIHD-MU/article/view/272460
Sohail Imran, et al. "Big Data Analytics in Healthcare — A Systematic Literature Review and Roadmap for Practical Implementation." IEEE/CAA Journal of Automatica Sinica, vol. 8, no. 1, January 2021, pp. 1-22. https://www.ieee-jas.net/article/doi/10.1109/JAS.2020.1003384
Sandro Fiore, et al. "A Big Data Analytics Framework for Scientific Data Management." 2013 IEEE International Conference on Big Data, December 23, 2013, pp. 1-8. https://ieeexplore.ieee.org/document/6691720
Yuuki Tachioka. "Privacy Preservation Satisfying Utility Requirements Based on Multi-Objective Optimization." 2022 Joint 12th International Conference on Soft Computing and Intelligent Systems, November 29 - December 02, 2022. https://ieeexplore.ieee.org/document/10002081
Ketan Rajshekhar Shahapure; Charles Nicholas. "Cluster Quality Analysis Using Silhouette Score." 2020 IEEE 7th International Conference on Data Science and Advanced Analytics (DSAA), 20 November 2020. https://ieeexplore.ieee.org/document/9260048
Jason Starr; Morgan Kain. "Agent-Based Simulation of Social Determinants of Health for Equitable COVID-19 Intervention." 2022 7th International Conference on Intelligent Informatics and Biomedical Science (ICIIBMS), 12 December 2022. https://ieeexplore.ieee.org/document/9971638/references#references
Rohrer. "Maximizing Simulation ROI with AutoMod." Proceedings of the 2003 Winter Simulation Conference, January 30, 2004. https://ieeexplore.ieee.org/document/1261425/citations#citations
Sri Mulyati, et al. "Stunting Incidence Segmentation: A Cluster Analysis Approach and Targeted Intervention Strategies." IIETA Journal of Computational Methods in Engineering, March 16, 2025. https://iieta.org/journals/ijcmem/paper/10.18280/ijcmem.130113
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Narendra Reddy Mudiyala

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution (CC-BY) 4.0 License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.