Please use this identifier to cite or link to this item: http://hdl.handle.net/10174/42000

Title: Predicting Asthma Hospitalizations from Climate and Air Pollution Data: A Machine Learning-Based Approach
Authors: Reis, Jean Souza dos
Duarte, Edicle de Souza Fernandes
Costa, Rafaela Lisboa
Silva, Fabricio Daniel dos Santos
Cortes, Taisa Rodrigues
Coelho, Rachel Helena
Velasco, Sofia Rafaela Maito
Neves, Danielson Jorge Delgado
Filho, José Firmino Sousa
Barreto, Cairo Eduardo Carvalho
Cabral Júnior, Jório Bezerra
Reis, Herald Souza dos
Mendes, Keila Rêgo
Lins, Mayara Christine Correia
Ferreira, Thomás Rocha
Vanderlei, Mário Henrique Guilherme dos Santos
Alonso, Marcelo Felix
Mariano, Glauber Lopes
Gomes, Heliofábio Barros
Gomes, Helber Barros
Keywords: health data analysis
epidemiology
respiratory diseases
predictive modeling
Issue Date: 24-Jan-2025
Publisher: MPDI
Citation: Reis, J.S.d.; Costa, R.L.; Silva, F.D.d.S.; de Souza, E.D.F.; Cortes, T.R.; Coelho, R.H.; Velasco, S.R.M.; Neves, D.J.D.; Sousa Filho, J.F.; Barreto, C.E.C.; et al. Predicting Asthma Hospitalizations from Climate and Air Pollution Data: A Machine Learning-Based Approach. Climate 2025, 13, 23. https://doi.org/10.3390/cli13020023
Abstract: This study explores the predictability of monthly asthma notifications using models built from different machine learning techniques in Maceió, a municipality with a tropical climate located in the northeast of Brazil. Two sets of predictors were combined and tested, the first containing meteorological variables and pollutants, called exp1, and the second only meteorological variables, called exp2. For both experiments, tests were also carried out incorporating lagged information from the time series of asthma records. The models were trained on 80% of the data and validated on the remaining 20%. Among the five methods evaluated—random forest (RF), eXtreme Gradient Boosting (XGBoost), Multiple Linear Regression (MLR), support vector machine (SVM), and K-nearest neighbors (KNN)—the RF models showed superior performance, notably those of exp1 when incorporating lagged asthma notifications as an additional predictor. Minimum temperature and sulfur dioxide emerged as key variables, probably due to their associations with respiratory health and pollution levels, emphasizing their role in asthma exacerbation. The autocorrelation of the residuals was assessed due to the inclusion of lagged variables in some experiments. The results highlight the importance of pollutant and meteorological factors in predicting asthma cases, with implications for public health monitoring. Despite the limitations presented and discussed, this study demonstrates that forecast accuracy improves when a wider range of lagged variables are used, and indicates the suitability of RF for health datasets with complex time series.
URI: https://www.mdpi.com/2225-1154/13/2/23
http://hdl.handle.net/10174/42000
Type: article
Appears in Collections:CREATE - Publicações - Artigos em Revistas Internacionais Com Arbitragem Científica

Files in This Item:

File Description SizeFormat
climate-13-00023-v4.pdf2.37 MBAdobe PDFView/Open
FacebookTwitterDeliciousLinkedInDiggGoogle BookmarksMySpaceOrkut
Formato BibTex mendeley Endnote Logotipo do DeGóis 

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Dspace Dspace
DSpace Software, version 1.6.2 Copyright © 2002-2008 MIT and Hewlett-Packard - Feedback
UEvora B-On Curriculum DeGois