@Article{info:doi/10.2196/65001, author="Huang, Tracy and Ngan, Chun-Kit and Cheung, Yin Ting and Marcotte, Madelyn and Cabrera, Benjamin", title="A Hybrid Deep Learning--Based Feature Selection Approach for Supporting Early Detection of Long-Term Behavioral Outcomes in Survivors of Cancer: Cross-Sectional Study", journal="JMIR Bioinform Biotech", year="2025", month="Mar", day="13", volume="6", pages="e65001", keywords="machine learning; data driven; clinical domain--guided framework; survivors of cancer; cancer; oncology; behavioral outcome predictions; behavioral study; behavioral outcomes; feature selection; deep learning; neural network; hybrid; prediction; predictive modeling; patients with cancer; deep learning models; leukemia; computational study; computational biology", abstract="Background: The number of survivors of cancer is growing, and they often experience negative long-term behavioral outcomes due to cancer treatments. There is a need for better computational methods to handle and predict these outcomes so that physicians and health care providers can implement preventive treatments. Objective: This study aimed to create a new feature selection algorithm to improve the performance of machine learning classifiers to predict negative long-term behavioral outcomes in survivors of cancer. Methods: We devised a hybrid deep learning--based feature selection approach to support early detection of negative long-term behavioral outcomes in survivors of cancer. Within a data-driven, clinical domain--guided framework to select the best set of features among cancer treatments, chronic health conditions, and socioenvironmental factors, we developed a 2-stage feature selection algorithm, that is, a multimetric, majority-voting filter and a deep dropout neural network, to dynamically and automatically select the best set of features for each behavioral outcome. We also conducted an experimental case study on existing study data with 102 survivors of acute lymphoblastic leukemia (aged 15-39 years at evaluation and >5 years postcancer diagnosis) who were treated in a public hospital in Hong Kong. Finally, we designed and implemented radial charts to illustrate the significance of the selected features on each behavioral outcome to support clinical professionals' future treatment and diagnoses. Results: In this pilot study, we demonstrated that our approach outperforms the traditional statistical and computation methods, including linear and nonlinear feature selectors, for the addressed top-priority behavioral outcomes. Our approach holistically has higher F1, precision, and recall scores compared to existing feature selection methods. The models in this study select several significant clinical and socioenvironmental variables as risk factors associated with the development of behavioral problems in young survivors of acute lymphoblastic leukemia. Conclusions: Our novel feature selection algorithm has the potential to improve machine learning classifiers' capability to predict adverse long-term behavioral outcomes in survivors of cancer. ", issn="2563-3570", doi="10.2196/65001", url="https://bioinform.jmir.org/2025/1/e65001", url="https://doi.org/10.2196/65001", url="http://www.ncbi.nlm.nih.gov/pubmed/40080820" }