Abstract
The goal of the study is to increase the computation efficiency of the face recognition that uses feature vectors to describe facial images on photos and videos. These high-dimensional feature vectors are nowadays produced by convolutional neural networks. The methods to aggregate the features generated for each video frame are used to process the video sequences. A novel hierarchical recognition algorithm is proposed. In contrast to known approaches our algorithm seeks the nearest neighbors only among reference images of most reliable classes selected at the preceding stage to carry out the sequential analysis of a more detailed description level (with a greater dimensionality of the feature vector). At each stage principal components are compared, the number of the components being chosen according to a given portion of explained variations. Datasets like Labeled Faces in the Wild, YouTubeFaces, IARPA Janus Benchmark C and different neural-net face descriptors are used to compare the algorithm with other methods. In contrast with the conventional nearest-neighbor method, the proposed approach is shown to allow a 2- to 16-times reduction of the classifier running time.
Similar content being viewed by others
REFERENCES
Savchenko, A.V., Deep neural networks and maximum likelihood search for approximate nearest neighbor in video-based image recognition, Opt. Mem. Neural Networks, 2017, vol. 26, no. 2, pp. 129–136.
Kharchevnikova, A.S. and Savchenko, A.V., Neural networks in video-based age and gender recognition on mobile platforms, Opt. Mem. Neural Networks, 2018, vol. 27, no. 4, pp. 246–259.
Goodfellow, I., Bengio, Y., and Courville, A., Deep Learning, MIT Press, 2016.
Goldberger, J., et al., Neighbourhood components analysis, Advances in Neural Information Processing Systems(NIPS), 2005, pp. 513–520.
Yao, Y., Granular computing and sequential three-way decisions, in Proceedings of International Conference on Rough Sets and Knowledge Technology, Springer, 2013, pp. 16–27.
Savchenko, A.V., Sequential three-way decisions in multi-category image recognition with deep features based on distance factor, Inf. Sci., 2019, vol. 489, pp. 18–36.
Karpathy, A., et al., Large-scale video classification with convolutional neural networks, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014, pp. 1725–1732.
Rawat, W. and Wang, Z., Deep convolutional neural networks for image classification: A comprehensive review, Neural Comput., 2017, vol. 29, no. 9, pp. 2352–2449.
Chan, T.H., et al., PCANet: A simple deep learning baseline for image classification, IEEE Trans. Image Process., 2015, vol. 24, no. 12, pp. 5017–5032.
Jolliffe, I.T. and Cadima, J., Principal component analysis: A review and recent developments, Philos. Trans. R. Soc., A, 2016, vol. 374, no. 2065.
Muja, M. and Lowe, D.G., Fast approximate nearest neighbors with automatic algorithm configuration, VISAPP, 2009, vol. 2, no. 2, pp. 331–340.
Hajebi, K., et al., Fast approximate nearest-neighbor search with k-nearest neighbor graph, Proceedings of Twenty-Second International Joint Conference on Artificial Intelligence, 2011.
Malkov, Y.A. and Yashunin, D.A., Efficient and robust approximate nearest neighbor search using Hierarchical Navigable Small World graphs, IEEE Trans. Pattern Anal. Mach. Intell., 2018.
Chen, L., Yang, H., and Gao, Z., Joint attentive spatial-temporal feature aggregation for video-based person re-identification, IEEE Access, 2019, vol. 7, pp. 41230–41240.
Sokolova, A.D., Kharchevnikova, A.S., and Savchenko, A.V., Organizing multimedia data in video surveillance systems based on face verification with convolutional neural networks, Proceedings of International Conference on Analysis of Images, Social Networks and Texts (AIST), Cham: Springer, 2017, pp. 223–230.
Hara, K., Saito, D., and Shouno, H., Analysis of function of rectified linear unit used in deep learning, Proceedings of International Joint Conference on Neural Networks (IJCNN), IEEE, 2015, pp. 1–8.
Kullback, S., Information Theory and Statistics, Courier Corporation, 1997.
Savchenko, A.V., Belova, N.S., and Savchenko, L.V., Fuzzy analysis and deep convolution neural networks in still-to-video recognition, Opt. Mem. Neural Networks, 2018, vol. 27, no. 1, pp. 23–31.
Lowe, D.G., Distinctive image features from scale-invariant keypoints, Int. J. Comput. Vision, 2004, vol. 60, no. 2, pp. 91–110.
Parkhi, O.M., Vedaldi, A., and Zisserman, A., Deep face recognition, Proc. Br. Mach. Vision Conf., 2015, vol. 1, no. 3, p. 6.
Wu, X., He, R., Sun, Z., and Tan, T., A light CNN for deep face representation with noisy labels, IEEE Trans. Inf. Forensics Secur., 2018, vol. 13, no. 11, pp. 2884–2896.
Cao, Q., Shen, L., **e, W., Parkhi, O.M., and Zisserman, A., VGGface2: A dataset for recognising faces across pose and age, Proceedings of the 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG), 2018, pp. 67–74.
Schroff, F., Kalenichenko, D., and Philbin, J., FaceNet: A unified embedding for face recognition and clustering, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015, pp. 815–823.
Savchenko, A.V., Granular computing and sequential analysis of deep embeddings in fast still-to-video face recognition, Proceedings of the 12th International Symposium on Applied Computational Intelligence and Informatics (SACI), IEEE, 2018, pp. 515–520.
Huang, G.B., et al., Labeled faces in the wild: A database for studying face recognition in unconstrained environments, Proceedings of the Workshop on Faces in 'Real-Life’ Images: Detection, Alignment, and Recognition, 2008.
Wolf, L., Hassner, T., and Maoz, I., Face recognition in unconstrained videos with matched background similarity, Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), 2011, pp. 529–534.
Maze, B., et al., IARPA Janus Benchmark-C: Face dataset and protocol, Proceedings of the International Conference on Biometrics (ICB), IEEE, 2018, pp. 158–165.
Li, D. and Tian, Y., Survey and experimental study on metric learning methods, Neural Networks, 2018, vol. 105, pp. 447–462.
Savchenko, A.V., Probabilistic neural network with complex exponential activation functions in image recognition, IEEE Trans. Neural Networks Learn. Syst., 2020, vol. 31, no. 2, pp. 651–660.
Koziarski, M. and Cyganek, B., Impact of low resolution on image recognition with deep neural networks: An experimental study, Int. J. Appl. Math. Comput. Sci., 2018, vol. 28, no. 4, pp. 735–744.
Funding
This research was prepared within the framework of the Academic Fund Program at the National Research University Higher School of Economics (HSE University) in 2019 (grant no. 19-04-004) and within the framework of the Russian Academic Excellence Project “5-100”.
Author information
Authors and Affiliations
Corresponding authors
Ethics declarations
The authors declare that they have no conflicts of interest.
About this article
Cite this article
Sokolova, A.D., Savchenko, A.V. Computation-Efficient Face Recognition Algorithm Using a Sequential Analysis of High Dimensional Neural-Net Features. Opt. Mem. Neural Networks 29, 19–29 (2020). https://doi.org/10.3103/S1060992X2001004X
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.3103/S1060992X2001004X