Abstract : Given the prevalence of extremely complex collections of data, feature selection—one of the essential challenges of machine learning—has received greater emphasis. It shows the vital aspects of a specific problem. Traditionally, it has been used to solve many different issues in fields such as banking (e.g., detecting fraud), surveillance systems, healthcare institutions (e.g., cancer detection), and biological data processing. It helps to get rid of all irrelevant variables and boosts classifier efficiency and precision. Initially, the model's construction should be made simpler by having fewer parameters, taking less time to train, improving applicability to prevent excessive filling, and staying away from the curse known as dimensionality. Filter methods have significance over feature selection since they can be used with any kind of machine-learning model and significantly shorten the time it takes for machine-learning algorithms to run. The assessments are intended to review the operation of various filter methods, evaluate how effectively they perform in terms of run time and estimated efficiency, and offer recommendations for usage.
Keywords : Feature selection, Filter method, Variance Threshold, SelectKBest, Information Gain, Select Percentile.
Cite : Kumar, A., & Gouri, M. H. (2023). An Approach To Using Feature Selection For Classification And Regression Dilemma Through Filter Method (1st ed., p. 57). Noble Science Press. https://doi.org/10.52458/9789388996747.nsp2023.eb.ch-08
References :
B. Venkatesh, J. Anuradha (2019), A Review of Feature Selection and Its Methods, Bulgarian Academy of Sciences, Cybernetics and Information Technologies, Volume 19.
Anna Karen Garate-Escamila, Amir Hajjam El Hassani, Emmanuel Andres (2020), Classification models for heart disease prediction using feature selection and PCA, Informatics in Medicine Unlocked, ScienceDirect,
Elsevier Xuelian Deng, Yuqing Li, Jian Weng, Jilian Zhang (2018), Feature selection for text classification: A review, Multimed Tools Appl, Springer.
Hongfang Zhou, Xiqian Wang, Rourou Zhu (2021), Feature selection based on mutual information with correlation coefficient, Applied Intelligence, Springer
Andrea Bommert, Xudong Sun, Bernd Bischl, Jörg Rahnenführer, Michel Lang, (2019), Benchmark for filter methods for feature selection in high-dimensional classification data, Computational Statistics and Data Analysis, ScienceDirect, Elsevier.
Haoyue Liu, Student Member, MengChu Zhou, Qing Gary Liu (2019), An Embedded Feature Selection Method for Imbalanced Data Classification, IEEE/CAA Journal of Automatica Sinica.
Beatriz Remeseiroa, Veronica Bolon-Canedob (2109), A review of feature selection methods in medical applications, Computers in Biology and Medicine, ScienceDirect, Elsevier.
Rung?Ching Chen, Christine Dewi, Su?Wen Huang, Rezzy Eko Caraka (2020), Selecting critical features for data classification based on machine learning methods, Journal of Big Data, Springer Open.
Rui Zhang, Feiping Nie, Xuelong Li, Xian Wei (2018), Feature Selection with Multiview Data: A Survey, Information Fusion, ScienceDirect, Elsevier.
Kamal A. ElDahshan, Abdullah A. AlHabshy and Luay Thamer Mohammed (2022), Filter and Embedded Feature Selection Methods to Meet Big Data Visualization Challenges, Computers, Materials & Continua.
Saul Solorio-Fernandez, J. Ariel Carrasco-Ochoa, Jose Fco. Martínez-Trinidad (2019), A review of unsupervised feature selection methods, Artificial Intelligence Review, Springer.
Veronica Bolon-Canedo, Amparo Alonso-Betanzos (2019), Ensembles for feature selection: A review and future trends, Information Fusion, ScienceDirect, Elsevier.
Utkarsh Mahadeo Khaire, R. Dhanalakshmi (2019), Stability of feature selection algorithm: A review, Journal of King Saud University –Computer and Information Sciences, ScienceDirect.
Gang Koua, Pei Yanga, Yi Pengb, Feng Xiaoa, Yang Chena, Fawaz E. Alsaadic (2019), Evaluation of feature selection methods for text classification with small datasets using multiple criteria decision-making methods, Applied Soft Computing Journal.
Mohammad Pourhomayoun, Mahdi Shakibi (2021), Predicting mortality risk in patients with COVID-19 using machine learning to help medical decision-making, Smart Health, ScienceDirect, Elsevier.