|
|
Please use this identifier to cite or link to this item:
http://hdl.handle.net/10174/42000
|
| Title: | Predicting Asthma Hospitalizations from Climate and Air Pollution Data: A Machine Learning-Based Approach |
| Authors: | Reis, Jean Souza dos Duarte, Edicle de Souza Fernandes Costa, Rafaela Lisboa Silva, Fabricio Daniel dos Santos Cortes, Taisa Rodrigues Coelho, Rachel Helena Velasco, Sofia Rafaela Maito Neves, Danielson Jorge Delgado Filho, José Firmino Sousa Barreto, Cairo Eduardo Carvalho Cabral Júnior, Jório Bezerra Reis, Herald Souza dos Mendes, Keila Rêgo Lins, Mayara Christine Correia Ferreira, Thomás Rocha Vanderlei, Mário Henrique Guilherme dos Santos Alonso, Marcelo Felix Mariano, Glauber Lopes Gomes, Heliofábio Barros Gomes, Helber Barros |
| Keywords: | health data analysis epidemiology respiratory diseases predictive modeling |
| Issue Date: | 24-Jan-2025 |
| Publisher: | MPDI |
| Citation: | Reis, J.S.d.; Costa, R.L.; Silva, F.D.d.S.; de Souza, E.D.F.; Cortes, T.R.; Coelho, R.H.; Velasco, S.R.M.; Neves, D.J.D.; Sousa Filho, J.F.; Barreto, C.E.C.; et al. Predicting Asthma Hospitalizations from Climate and Air Pollution Data: A Machine Learning-Based Approach. Climate 2025, 13, 23. https://doi.org/10.3390/cli13020023 |
| Abstract: | This study explores the predictability of monthly asthma notifications using models built from different machine learning techniques in Maceió, a municipality with a tropical climate located in the northeast of Brazil. Two sets of predictors were combined and tested, the first containing meteorological variables and pollutants, called exp1, and the second only meteorological variables, called exp2. For both experiments, tests were also carried out incorporating lagged information from the time series of asthma records. The models were trained on 80% of the data and validated on the remaining 20%. Among the five methods evaluated—random forest (RF), eXtreme Gradient Boosting (XGBoost), Multiple Linear Regression (MLR), support vector machine (SVM), and K-nearest neighbors (KNN)—the RF models showed superior performance, notably those of exp1 when incorporating lagged asthma notifications as an additional predictor. Minimum temperature and sulfur dioxide emerged as key variables, probably due to their associations with respiratory health and pollution levels, emphasizing their role in asthma exacerbation. The autocorrelation of the residuals was assessed due to the inclusion of lagged variables in some experiments. The results highlight the importance of pollutant and meteorological factors in predicting asthma cases, with implications for public health monitoring. Despite the limitations presented and discussed, this study demonstrates that forecast accuracy improves when a wider range of lagged variables are used, and indicates the suitability of RF for health datasets with complex time series. |
| URI: | https://www.mdpi.com/2225-1154/13/2/23 http://hdl.handle.net/10174/42000 |
| Type: | article |
| Appears in Collections: | CREATE - Publicações - Artigos em Revistas Internacionais Com Arbitragem Científica
|
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
|