Search by item HOME > Access full text > Search by item

JBE, vol. 24, no. 4, pp.564-572, July, 2019

DOI: https://doi.org/10.5909/JBE.2019.24.4.564

Integral Regression Network for Facial Landmark Detection

Do Yeop Kim and Ju Yong Chang

C.A E-mail: jychang@kw.ac.kr

Abstract:

With the development of deep learning, the performance of facial landmark detection methods has been greatly improved. The heat map regression method, which is a representative facial landmark detection method, is widely used as an efficient and robust method. However, the landmark coordinates cannot be directly obtained through a single network, and the accuracy is reduced in determining the landmark coordinates from the heat map. To solve these problems, we propose to combine integral regression with the existing heat map regression method. Through experiments using various datasets, we show that the proposed integral regression network significantly improves the performance of facial landmark detection.



Keyword: Face Alignment, Facial Landmark Detection, Deep Learning

Reference:
[1] A. Bulat and G. Tzimiropoulos, “How far are we from solving the 2d & 3d face alignment problem? (and a dataset of 230,000 3d facial landmarks),” IEEE International Conference on Computer Vision, pp. 1021-1030, 2017.
[2] X. Sun, B. Xiao, F. Wei, S. Liang, and Y. Wei, “Integral human pose regression,” European Conference on Computer Vision, pp. 529-545, 2018.
[3] Y. Sun, X. Wang, and X. Tang, “Deep convolutional network cascade for facial point detection,” IEEE Conference on Computer Vision and Pattern Recognition, 2013.
[4] E. Zhou, H. Fan, Z. Cao, Y. Jiang, and Q. Yin, “Extensive facial landmark localization with coarse-to-fine convolutional network cascade,” IEEE International Conference on Computer Vision Workshops, 2013.
[5] Z. Zhang, P. Luo, C. C. Loy, and X. Tang, “Facial landmark detection by deep multi-task learning,” European Conference on Computer Vision, 2014.
[6] R. Ranjan, V. M. Patel, and R. Chellappa, “Hyperface: A deep multi-task learning framework for face detection, landmark localization, pose estimation, and gender recognition,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 41, no. 1, pp. 121-135. 2019.
[7] K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image recognition,” IEEE Conference on Computer Vision and Pattern Recognition, pp. 770-778, 2016.
[8] A. Krizhevsky, I. Sutskever, and G. E. Hinton, “Imagenet classification with deep convolutional neural networks,” Advances in Neural Information Processing Systems, pp. 1097-1105, 2012.
[9] X. Zhu, Z. Lei, X. Liu, H. Shi, and S. Z. Li, “Face alignment across large poses: A 3d solution,” IEEE Conference on Computer Vision and Pattern Recognition, pp. 146-155, 2016.
[10] C. Sagonas, E. Antonakos, G. Tzimiropoulos, S. Zafeiriou, and M. Pantic, “300 faces in-the-wild challenge: Database and results,” Image and Vision Computing, vol. 47, pp. 3-18, 2016.
[11] S. Zaferiou, “The menpo facial landmark localisation challenge,” IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017.
[12] J. Shen, S. Zafeiriou, G. G. Chrysos, J. Kossaifi, G. Tzimiropoulos, and M. Pantic, “The first facial landmark tracking in-the-wild challenge: Benchmark and results,” IEEE International Conference on Computer Vision Workshops, 2015.
[13] V. Jain and E. Learned-Miller, “Fddb: A benchmark for face detection in unconstrained settings,” UMass Amherst Technical Report, 2010.
[14] M. Köstinger, P. Wohlhart, P. M. Roth, and H. Bischof, “Annotated facial landmarks in the wild: A large-scale, real world database for facial landmark localization,” IEEE International Conference on Computer Vision Workshops, 2011.
[15] D. P. Kingma and J. Ba, “Adam: A method for stochastic optimization,” International Conference on Learning Representations, 2015.
[16] A. Paszke, S. Gross, S. Chintala, G. Chanan, E. Yang, Z. DeVito, Z. Lin, A. Desmaison, L. Antiga, and A. Lerer, “Automatic differentiation in pytorch,” Advances in Neural Information Processing Systems Workshops, 2017.
[17] A. Newell, K. Yang, and J. Deng, “Stacked hourglass network for human pose estimation,” European Conference on Computer Vision, pp. 483-499, 2016.

Comment


Editorial Office
1108, New building, 22, Teheran-ro 7-gil, Gangnam-gu, Seoul, Korea
Homepage: www.kibme.org TEL: +82-2-568-3556 FAX: +82-2-568-3557
Copyrightⓒ 2012 The Korean Institute of Broadcast and Media Engineers
All Rights Reserved