Abstract
In this article, a graph-theoretic method for supervised feature selection using matrix exponential of pairwise correlation value has been illustrated. In machine learning, high-dimensional data sets have a enormous number of redundant and irrelevant features. The sum of mean and standard deviation of exponential matrix has been set as the threshold for selecting relevant features. Principles of vertex cover and independent set have then been used to remove redundant features. In the next step, mutual information value has been used to select relevant features that were initially rejected. The results show that this method has performed better than the benchmark algorithms when experimented on multiple standard data sets.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
E.R. Dougherty, J. Hua, W. Tembe, Performance of feature-selection methods in the classification of high-dimension data. Pattern Recogn. 42, 409–424 (2009)
M. Dash, H. Liu, Feature selection for classification. Intell. Data Anal. 1, 131156 (1997)
G.H. John, R. Kohavi, Wrappers for feature subset selection. Artif. Intell. 97, 273–324 (1997)
P.N. Koch, T.W. Simpson, J.K. Allen, F. Mistree, Statistical approximations for multidisciplinary design optimization: the problem of size. J. Aircr. 36(1), 275286 (1999)
G. Li, S.-W. Wang, C. Rosenthal, H. Rabitz, High dimensional model representations generated from low dimensional data samples. I. mp-Cut-HDMR. J. Math. Chem. 30(1), 130 (2001b)
G.H. John, R. Kohavi, K. Peger, Irrelevant features and the subset selection problem, in ICML (1994)
S.E. Schaeffer, Graph clustering. Comput. Sci. Rev. 1, 2764 (2007)
Z. Zhang, E.R. Hancock, A graph-based approach to feature selection, in GBRPR (2011)
S. Bandyopadhyay, T. Bhadra, P. Mitra, U. Maulik, Integration of dense subgraph nding with feature clustering for unsupervised feature selection. Pattern Recogn. Lett. 40, 104112 (2014)
Q. Song, J. Ni, G. Wang, A fast clustering-based feature subset selection algorithm for high-dimensional data. IEEE Trans. Knowl. Data Eng. 25, 114 (2013)
P. Moradi, M. Rostami, A graph theoretic approach for unsupervised feature selection. Eng. Appl. Artif. Intell. 44, 3345 (2015a)
A. Strehl, J. Ghosh, Cluster ensembles a knowledge reuse framework for combining multiple partitions. J. Mach. Learn. Res. 3, 583617 (2002)
T.M. Cover, J.A. Thomas, Elements of Information Theory (Wiley, New York, USA, 2012)
M. Lichman, UCI machine learning repository (2013). http://archive.ics.uci.edu/ml
M.A. Hall, Correlation-based feature selection for machine learning
H. Peng, F. Long, C.H. Ding, Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans. Pattern Anal. Mach. Intell. 27, 1226–1238 (2005)
A.K. Das, S. Goswami, B. Chakraborty, A. Chakrabarti, A graph-theoretic approach for visualization of data set feature association, in ACSS (2016)
M. Dash, H. Liu, Feature selection for clustering, in PAKDD (2000)
N. Deo, Graph Theory with Applications to Engineering and Computer Science (PHI, 1979)
S. Goswami, A. Chakrabarti, B. Chakraborty, An efficient feature selection technique for clustering based on a new measure of feature importance. J. Intell. Fuzzy Syst. 112
S. Goswami, A.K. Das, A. Chakrabarti, B. Chakraborty, A feature cluster taxonomy based feature selection technique. Expert Syst. Appl. 79, 7689 (2017)
P. Moradi, M. Rostami, Integration of graph clustering with ant colony optimization for feature selection. Knowl. Based Syst. 84, 144161 (2015b)
Z. Zhang, E.R. Hancock, Hypergraph based information-theoretic feature selection. Pattern Recogn. Lett. 33, 19911999 (2012)
E. Estrada, J.A. Rodguez-Velzquez, Subgraph centrality in complex networks. Phys. Rev. E Stat. Nonlin. Soft Matter Phys. 71(5 Pt 2), 056103 (2005)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Kumar, G., Jain, G., Panday, M., Das, A.K., Goswami, S. (2020). Graph-Based Supervised Feature Selection Using Correlation Exponential. In: Mandal, J., Bhattacharya, D. (eds) Emerging Technology in Modelling and Graphics. Advances in Intelligent Systems and Computing, vol 937. Springer, Singapore. https://doi.org/10.1007/978-981-13-7403-6_4
Download citation
DOI: https://doi.org/10.1007/978-981-13-7403-6_4
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-7402-9
Online ISBN: 978-981-13-7403-6
eBook Packages: EngineeringEngineering (R0)