|
Please use this identifier to cite or link to this item:
http://hdl.handle.net/10174/32472
|
Title: | Analyzing the Performance of Feature Selection on Regression Problems: A Case Study on Older Adults’ Functional Profile |
Authors: | Rojo, Javier Pinho, Lara Fonseca, César Lopes, Manuel Helal, Sumi Hernández, Juan Garcia-Alonso, Jose Murillo, Juan Manuel |
Keywords: | aging informatics ehealth Feature selection regression healthcare data analytic machine learning |
Issue Date: | 17-Jun-2022 |
Publisher: | IEEE Transactions on Emerging Topics in Computing |
Citation: | Rojo, J., Pinho, L.G., Fonseca, C., Lopes, M.J., Helal, A., Hernández, J., Murillo, J., Garcia-Alonso, J. (2022). Analyzing the Performance of Feature Selection on Regression Problems: a Case Study on Older Adults' Functional Profile. IEEE Transactions on Emerging Topics in Computing. https://doi.org/10.1109/TETC.2022.3181679 |
Abstract: | Healthcare systems are capable of collecting a significant number of patient health-related parameters. Analyzing them to find the reasons that cause a given disease is challenging. Feature Selection techniques have been used to address this issue—reducing these parameters to a smaller set with the most ”determinant” information. However, existing proposals usually focus on classification problems—aimed to detect whether a person is or is not suffering from an illness or from a finite set of illnesses. However, there are many situations in which health professionals need a numerical assessment to quantify the severity of an illness, thus dealing with a regression problem instead. Proposals using Feature Selection here are very limited. This paper examines several Feature Selection techniques to gauge their applicability to the regression-type problems, comparing these techniques by applying them to a real-life scenario on the functional profiles of older adults. Data from 829 functional profiles assessments in 49 residential homes were used in this study. The number of features was reduced from 31 to 25—with a correlation between inputs and outputs of 0.99 according to the R2 score and a Mean Square Error (MSE) of 0.11—or to 14 features—with a correlation of 0.98 and MSE of 5.73. |
URI: | 10.1109/TETC.2022.3181679 http://hdl.handle.net/10174/32472 |
Type: | article |
Appears in Collections: | CHRC - Publicações - Artigos em Revistas Internacionais Com Arbitragem Científica
|
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
|