Application of the K-Means Method in the Analysis of Epidemiological Coefficients of COVID-19 in a Macroregion of Southern Bahia
Keywords:
K-means, Covid-19, Epidemiological coefficientsAbstract
The COVID-19 pandemic, caused by the SARS-CoV-2 coronavirus, reached Brazil in 2020, with unique impacts in each region. In the South Regional Health Center (NRS South) of Bahia, cities experienced the pandemic in a distinct manner, which prompted a cluster analysis to group these locations based on similarities in epidemiological coefficients, such as incidence, prevalence, and lethality. Data were obtained from the Health Department of the State of Bahia (SESAB) and segmented into three distinct chronological periods. The cluster analysis revealed the stratification of cities, highlighting specific patterns of the pandemic's behavior in each group. This innovative approach provides valuable insights to guide strategic decisions, identifying both the locations that required the implementation of specific measures and those where such actions have already proven effective. Thus, the stratification of incidence, prevalence, and lethality by city in the NRS Sul of Bahia emerges as an additional tool for understanding and combating COVID-19 in this region.
Downloads
References
ALVES, H. J. P.; FERNANDES, F. A.; LIMA, K. P.; BATISTA, B. D. O.; FERNANDES, T. J. The COVID-19 pandemic in Brazil: an application of the k-means clustering method. Research, Society and Development, [S. l.], v. 9, n. 10, p. e5829109059, 2020. DOI: https://doi.org/10.33448/rsd-v9i10.9059.
DE SOUZA, W.M., BUSS, L.F., CANDIDO, D.D.S., et al. Epidemiological and clinical characteristics of the COVID-19 epidemic in Brazil. Nature Human Behaviour, Londres, v. 4, n. 8, p. 856-865, 2020. DOI: https://doi.org/10.1038/s41562-020-0928-4
FÁVERO, L. P.; BELFIORE, P. Data Science for Business and Decision Making. Cambridge: Academic Press, 2019.
FERREIRA, D. F. Estatística multivariada 3. ed. Lavras: Ed. UFLA, 2018.
FONTANA, A.; NALDI, M. C. Estudo de comparação de métodos para estimação de números de grupos em problemas de agrupamento de dados. São Paulo: Universidade de São Paulo, 2009. (Relatório Técnico, n. 340).
GOHARI, K., KAZEMNEJAD, A., SHEIDAEI, A. et al. Clustering of countries according to the COVID-19 incidence and mortality rates. BMC Public Health, Londres, v. 22, n. 632, 2022. DOI: https://doi.org/10.1186/s12889-022-13086-z.
HASTIE, T.; TIBSHIRANI, R.;FRIEDMAN J. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. 2. ed. Nova York: Springer, 2009. DOI: https://doi.org/10.1007/978-0-387-84858-7
MAKIH, A.; TARIGAN, A. Analysis of Covid-19 Case Data in Classification of the Distribution in DKI Jakarta Using Algorithm Method K-Means Clustering. International Research Journal of Advanced Engineering and Science, [S. l.], v. 6, n. 2, p. 154-157, 2021.
MENEZES, A. M. B. Noções básicas de epidemiologia. Porto Alegre: Artmed, 2001. v. 1. p. 15-30.
MUHTASIM; MASUD, M. A. Clustering Countries on COVID-19 Data among Different Waves Using K-Means Clustering. Journal of Computer and Communications, [S. l.], v. 11, p. 1-14, 2023. DOI: https://doi.org/10.4236/jcc.2023.117001.
OLIVEIRA, R. R. et al. Evolução da COVID-19 no Brasil e suas implicações para políticas públicas de saúde. Revista Panamericana de Salud Pública, Washington, v. 45, p. e47, 2021. DOI: 10.26633/RPSP.2021.47. Disponível em: https://iris.paho.org/handle/10665.2/53918. Acesso em: 12 maio 2025.
ORGANIZAÇÃO MUNDIAL DA SAÚDE. Relatório sobre a situação global da COVID-19. Genebra: OMS, 2020. Disponível em: https://www.who.int/emergencies/diseases/novel-coronavirus-2019. Acesso em: 13 maio 2025.
ROTHMAN, K. J.; GREENLAND, S.; LASH, T. L. Epidemiologia moderna. 3. ed. Porto Alegre: Artmed, 2008.
SANTOS, J. A. et al. Challenges and Strategies for COVID-19 Control in Bahia, Brazil. Journal of Public Health Research, [S. l.], v. 11, n. 4, p. 2430, 2022.
