Pakistan Science Abstracts
Article details & metrics
No Detail Found!!
Advanced Hybrid Feature Selection Using Harvest Algorithm, Convolutional Neural Networks and Cfs
Author(s):
1. Preethi K: Department of Computer Applications, School of Information Technology, Madurai Kamaraj University, Madurai, Tamilnadu, India
2. Ramakrishnan M: Department of Computer Applications, School of Information Technology, Madurai Kamaraj University, Madurai, Tamilnadu, India.
Abstract:
Predictive modeling and data analysis are severely hampered by the enormous dimensionality and complexity of medical datasets. Enhancing model performance, interpretability, and computing efficiency all depend on careful feature selection. In order to extract the most pertinent characteristics from the Medical Information Mart for Intensive Care III (MIMIC-III) dataset, this paper offers a novel hybrid feature selection method that combines the Harvest Algorithm, Convolutional Neural Networks (CNN), and Correlation-Based Feature Selection (CFS). De-identified medical records, including diagnostic codes, vital signs, prescriptions, and other clinical observations, are all included in the MIMIC-III dataset. Our suggested approach makes use of CNN for deep feature extraction, the Harvest Algorithm for the first feature subset creation, and CFS for the final feature selection based on correlation measures. Test findings show that when compared to conventional feature selection techniques Random Forest with Information Gain Method (RF-IG) and SVM with Recursive Feature Elimination (SVM-RFE), our hybrid strategy greatly increases the accuracy and efficiency of prediction models. The chosen characteristics demonstrate the potential of our approach in clinical decision support and medical data analysis by offering significant insights into important variables influencing patient outcomes.
Page(s): 268-289
DOI: DOI not available
Published: Journal: International Journal of Communication Networks and Information Security, Volume: 16, Issue: S1, Year: 2024
Keywords:
Feature selection , convolutional neural networks CNN , MIMICIII Dataset , Hybrid Methods , Medical Data Analysis , CorrelationBased Feature Selection CFS , Harvest Algorithm
References:
[1] Liu H.,Motoda H.,Yu L.,Zhao Z.,Dash M. .2022 .Applications. IEEE Transactions on Systems, Man, and Cybernetics, 46(3) : 344-353.
[2] Wu L.,Duan Y.,Zhang Y.,Wang F. .2023 ."Feature selection with dynamic mutual information,". IEEE Transactions on Neural Networks and Learning Systems, 33(1) : 47-58.
[3] Johnson A. E. W.,Pollard T. J.,Shen L.,Lehman L.,Feng M.,Ghassemi M.,Mark R. G. .2016 .MIMIC-III, a freely accessible critical care database. Scientific Data, 3(1) : 1-9.
[4] Harutyunyan H.,Khachatrian H.,Kale D. C.,Ver Steeg G.,Galstyan A. .2019 .Multitask learning and benchmarking with clinical time series data. Scientific Data, 6(1) : 1-18.
[5] Hussain K.,Mohd Salleh,Shi S.,Y. S. .2019 .Metaheuristic research: a comprehensive survey. Artificial Intelligence Review, 52(4) : 2191-2233.
[6] Shahrzad M.,Moradi P. .2022 .A novel harvest-based optimization algorithm for feature selection in high-dimensional data. Journal of Information Science, 49(2) : 194-213.
[7] Gao P.,Zhang Q.,Wang F.,Xiao L.,Fujita H.,Zhang Y. .2020 .Siamese attentional keypoint network for high performance visual tracking. Knowledge-Based Systems, 193 : 105448.
[8] Abdelpakey M. H.,Shehata M. S. .2019 .DP-Siam: Dynamic Policy Siamese Network for robust object tracking. IEEE Transactions on Image Processing, 28(11) : 5596-5609.
[9] Wu X.,Kumar V.,Quinlan J. R.,Ghosh J.,Yang Q.,Motoda H.,Steinberg D. .2008 .Top 10 algorithms in data mining. Knowledge and Information Systems, 14(1) : 1-37.
[10] Hall M. A.,Smith L. A. .1998 .Practical feature subset selection for machine learning. Computer Science, 98(6) : 181-191.
[11] Guyon I.,Elisseeff A. .2003 .“An introduction to variable and feature selection”. Journal of Machine Learning Research, 3 : 1157-1182.
[12] Sun S.,Zhang C.,D. C. .2019 .“Review of hybrid methods for feature selection”. IEEE Transactions on Neural Networks and Learning Systems, 30(11) : 2903-2916.
[13] Fathian M.,Amiri B.,Maroosi A. .2007 .”Application of honey-bee mating optimization algorithm on clustering”. Applied Mathematics and Computation, 190(2) : 1502-1513.
[14] LeCun Y.,Bengio Y.,Hinton G. .2015 .Deep learning. Nature, 521(7553) : 436-444.
[15] Hall M. A. .1999 .Correlation-based feature selection for machine learning. Doctoral dissertation, : .
[16] Peng H.,Long F.,Ding C. .2005 .Feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy. IEEE Transactions on Pattern Analysis and Machine Intelligence, 27(8) : 1226-1238.
[17] Johnson A. E. W.,Ghassemi M. M.,Nemati S.,Niehaus K. E.,Clifton D. A.,Clifford G. D. .2016 .Machine learning and decision support in critical care. Proceedings of the IEEE, 104(2) : 444-466.
[18] Vieira S. M.,Mendonça L. F.,Farinha G. J.,Sousa J. M. C. .2013 .Modified binary PSO for feature selection using SVM applied to mortality prediction of septic patients. Applied Soft Computing, 13(8) : 3494-3504.
[19] Liaw A.,Wiener M. .2002 .Classification and regression by randomForest. R news, 2(3) : 18-22.
[20] Pal M. .2005 .Random forest classifier for remote sensing classification. International Journal of Remote Sensing, 26(1) : 217-222.
[21] Guyon I.,Weston J.,Barnhill S.,Vapnik V. .2002 .Gene selection for cancer classification using support vector machines. Machine Learning, 46(1-3) : 389-422.
[22] Duan K. B.,Rajapakse J. C.,Wang H.,Azuaje F. .2005 .Multiple SVM -RFE for gene selection in cancer classification with expression da ta. IEEE Transactions on Nanobioscience, 4(3) : 228-234.
[23] Rakotomamonjy A. .2003 .Variable selection using SVM-based criteria. Journal of Machine Learning Research, 3 : 1357-1370.
[24] Bennett K. P.,Campbell C. .2000 .Support vector machines: Hype or hallelujah?. ACM SIGKDD Explorations Newsletter, 2(2) : 1-13.
[25] Joachims T. .1998 .Text categorization with support vector machines: Learning with many relevant features. European Conference on Machine Learning, : 137-142.
[26] Johnson A. E. W.,Pollard T. J.,Shen L.,Lehman L. W.,Feng M.,Ghassemi M.,Mark R. G. .2016 .MIMIC-III, a freely accessible critical care database. Scientific Data, 3 : 160035.
Citations
Citations are not available for this document.
0

Citations

0

Downloads

6

Views