Search by item HOME > Access full text > Search by item

JBE, vol. 23, no. 6, pp.780-789, November, 2018

DOI: https://doi.org/10.5909/JBE.2018.23.6.780

Armed person detection using Deep Learning

Geonuk Kim, Minhun Lee, Yoojin Huh, Gisu Hwang, and Seoung-Jun Oh

C.A E-mail: sjoh@kw.ac.kr

Abstract:

Nowadays, gun crimes occur very frequently not only in public places but in alleyways around the world. In particular, it is essential to detect a person armed by a pistol to prevent those crimes since small guns, such as pistols, are often used for those crimes. Because conventional works for armed person detection have treated an armed person as a single object in an input image, their accuracy is very low. The reason for the low accuracy comes from the fact that the gunman is treated as a single object although the pistol is a relatively much smaller object than the person. To solve this problem, we propose a novel algorithm called APDA(Armed Person Detection Algorithm). APDA detects the armed person using in a post-processing the positions of both wrists and the pistol achieved by the CNN-based human body feature detection model and the pistol detection model, respectively. We show that APDA can provide both 46.3% better recall and 14.04% better precision than SSD-MobileNet.



Keyword: Object-related human detection, Pose estimation, Object detection, CNN, Deep learning

Reference:
[1] Kwangsoo Kim, Ungtae Kim and Sooyeong Kwak, “Real-time Vio- lence Video Detection based on Movement Change Characteristics” JBE, Vol.22, No. 2, pp. 234-239, March 2017, http://dx.doi.org/ 10.5909/JBE.2017.22.2.234 (accessed Aug. 1, 2018).
[2] Sanggi Kim and Dongseog Han, “Real Time Traffic Light Detection Algorithm Based on Color Map and Multilayer HOG-SVM” JBE, Vol. 22, No. 1, pp. 62-69, Jenuary 2017, http://dx.doi.org/ 10.5909/JBE. 2017.22.1.62 (accessed Aug. 3, 2018).
[3] Seulbeen Kim and Wonjun Kim, “User Identification Method using Palm Creases and Veins based on Deep Learning” JBE, Vol. 23, No. 3, pp. 395-402, May 2018, http://dx.doi.org/10.5909/JBE. 2018.23.3.395 (accessed Aug. 3, 2018).
[4] K. He, X. Zhang, S. Ren and J. Sun, “Deep Residual Learning for Image Recognition” In Proceeding of the IEEE Conference on Computer Vision and Pattern Recognition(CVPR), pp. 770-778, 2016, https://doi.org/10.1109/cvpr.2016.90 (accessed Aug. 10, 2018).
[5] J. Hu, L. Shen and G. Sun, “Squeeze-and-Excitation Network” arXiv: 1709.01507, 2017, https://arxiv.org/pdf/1709.01507 (accessed Aug. 10, 2018).
[6] W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, Cheng-Yang Fu and Alexander C. Berg, “SSD: Single Shot MultiBox Detector” In Proceeding of the European Conference on Computer Vision(ECCV), pp.21-37, 2016, https://doi.org/10.1007/978-3-319-46448-0_2 (accessed Sep 8, 2018).
[7] A. G. Howard, M. Zhu, B. Chen, D. Kalenichenko, W. Wang, T. Weyand, M. Andreetto and H. Adam, “MobileNets: Efficient Con- volutional Neural Network for Mobile Vision Applications” arXiv: 1704.04861, 2017, https://arxiv.org/abs/1704.04861 (accessed Sep 20, 2018).
[8] J. Redmon, S. Divvala, R. Girshick and A. Farhadi, “You Only Look Once: Unified, Real-Time Object Detection” In Proceeding of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 779-788, 2016, https://doi.org/10.1109/cvpr.2016.91(accessed Sep 8, 2018).
[9] Z. Cao, T. Simon, Shih-E. Wei and Y. Sheikh, “Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields” In Proceeding of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.1302-1310, 2017, https://doi.org/10.1109/cvpr.2017.143 (accessed Sep 8, 2018).
[10] Y. Lecun, B. Boser, J. S. Denker, D. Henderson, R. E. Howard, W. Hubbard and L. D. Jackel, “Backpropagation Applied to Handwritten Zip Code Recognition” Neural Computation, vol. 1, no. 4, pp 541-551, Winter 1989, 10.1162/neco.1989.1.4.541 (accessed Aug. 5, 2018).
[11] S. Ren, K. He, R. Girshick, and J. Sun, “Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 39, no. 6, June 2017, https://doi.org/10.1109/tpami.2016.2577031 (accessed Aug. 10, 2018).
[12] A. Glowacz, M. Kmiec and A. Dziech, “Towards Robust Visual Knife Detection in Images: Active Appearance Models Initialised with Shape-specific Interest Points” In Multimedia Communications, Ser- vices and Security : 5th International Conference. vol. 287, pp. 148- 158, 2012, https://doi.org/10.1007/978-3-642-30721-8_15 (accessed Aug 9, 2018).
[13] L. Malagón-Borja, and O. Fuentes, “Object detection using image reconstruction with PCA” Image and Vision Computing, vol. 27, no. 1-2, pp. 2–9, 2009, https://doi.org/10.1016/j.imavis.2007.03.004 (accessed Aug 11, 2018).
[14] Derpanis KG, “The Harris corner detector” http://www.cse.yorku. ca/~kosta/CompVis_Notes/harris_detector.pdf(accessed Aug. 10, 2018)
[15] M. Grega, A. Matiolanski, P. Guzik and M. Leszczuk, “Automated Detection of Firearms and Knives in a CCTV Image” Sensors, vol. 16, no. 1. Jan 2016, https://doi.org/10.3390/s16010047 (accessed Aug 2, 2018).
[16] J. Canny, “A Computational Approach to Edge Detection” In IEEE Trans. Pattern Anal, Machine Intell., vol. PAMI-8, issue 6, pp. 679- 698, Nov 1986, https://doi.org/10.1016/b978-0-08-051581-6.50024-6 (accessed Aug 1, 2018).
[17] B.S. Manjunath, Philippe Salembier and Thomas Sikora, Introduction to MPEG-7, Multimedia Content Description Interface. Wiley, USA, 2002, https://doi.org/10.1007/springerreference_72884 (accessed Aug 15, 2018).
[18] Gyanendra K. Verma and Anamika Dhillon, “A HandHeld Gun Detection using Faster R-CNN Deep Learning” In Proceeding of the 7th International Conference on Computer and Communication Technology, pp. 84-88, November 2017, https://doi.org/10.1145/ 3154979.3154988 (accessed Aug 8, 2018).
[19] IMFDB: Internet Movie Firearms Database, http://www.imfdb.org/wiki/Main_Page (accessed Aug.15, 2018).
[20] J.M. Keller, M.R. Gray and J.A. junior, “A Fuzzy K-Nearest Neighbor Algorithm” In IEEE Transactions on Systems, Man, and Cybernetics, Vol. SMC-15, issue 4, pp. 580-585, 1985, https://doi.org/10.1109/tsmc. 1985.6313426 (accessed Aug 9, 2018).
[21] G. Papandreou, T. Zhu, N. Kanazawa, A. Toshev, J. Tompson, C. Bregler and K. Murphy, “Towards Accurate Multi-person Pose Estimation in the Wild” In Proceeding of the IEEE Conference on Computer Vision and Pattern Recognition(CVPR), 2017, https://doi.org/10.1109/cvpr.2017.395 (accessed Sep 5, 2018).
[22] A. Saxena, S. H. Chung and A. Y. Ng, “3-D Depth Reconstruction from a Single Still Image” International Journal of Computer Vision, Vol. 76 Issue 1, pp. 53-69, January 2008, https://doi.org/10.1007/ s11263-007-0071-y (accessed Sep 1, 2018).

Comment


Editorial Office
1108, New building, 22, Teheran-ro 7-gil, Gangnam-gu, Seoul, Korea
Homepage: www.kibme.org TEL: +82-2-568-3556 FAX: +82-2-568-3557
Copyrightⓒ 2012 The Korean Institute of Broadcast and Media Engineers
All Rights Reserved