|Table of Contents|

Vehicle and Pedestrian Detection Model Based on Lightweight SSD(PDF)

《南京师大学报(自然科学版)》[ISSN:1001-4616/CN:32-1239/N]

Issue:
2019年01期
Page:
73-
Research Field:
·人工智能算法与应用专栏·
Publishing date:

Info

Title:
Vehicle and Pedestrian Detection Model Based on Lightweight SSD
Author(s):
Zheng Dong1Li Xiangqun1Xu Xinzheng12
(1.School of Computer Science and Technology,China University of Mining and Technology,Xuzhou 221116,China)(2.Key Laboratory of Data Science and Intelligence Application,Fujian Province University,Zhangzhou 363000,China)
Keywords:
object detectionconvolutional neural networklightweight neural networkSSDmobileNetv2
PACS:
TP193
DOI:
10.3969/j.issn.1001-4616.2019.01.012
Abstract:
In recent years,the object detection algorithm based on deep learning has developed rapidly. However,it can’t be widely used in embedded platforms because the network is too large. This paper optimized the model size of SSD(Single Shot Multi-box Detector)network,introduced the lightweight convolutional neural network—MobileNetv2,analyzed the inverted residual and linear bottleneck structure in MobileNetv2,and compared SSD and its lightweight version—SSDLite. We proposed a lightweight vehicle and pedestrian detection model which named LVP-DN(Lightweight Vehicle and Pedestrian Detection Network). First,the MobilNetv2 was used to instead of VGG as the basic network to perform feature extraction. Then,the SSDLite was used to replace the original structure,in order to reduce the model size and speed up the detection process. It is improved that the accuracy of network detection for pedestrians by optimizing the ratio of the default box. We compared the impact of three factors on network performance on the KITTI and PASCAL VOC datasets. The factors are the input image size,different basic network and whether used the pre-training models. The experimental results show that compared with other popular object detection models,the vehicle and pedestrian detection models proposed in this paper have achieved good results in the evaluation standards such as accuracy,speed,and model size.

References:

[1] SZEGEDY C,TOSHEV A,ERHAN D. Deep neural networks for object detection[C]//International Conference on Neural Information Processing Systems. USA:MIT Press,2013,26:2553-2561.
[2]SERMANET P,EIGEN D,ZHANG X,et al. OverFeat:integrated recognition,localization and detection using convolutional networks[J]. Eprint Arxiv,2013:1312.6229.
[3]REN S,HE K,GIRSHICK R,et al. Faster R-CNN:towards real-time object detection with region proposal networks[C]//International Conference on Neural Information Processing Systems. Canada:MIT Press,2015:91-99.
[4]DAI J,LI Y,HE K,et al. R-FCN:Object detection via region-based fully convolutional networks[J]. Eprint Arxiv,2016:1605.06409.
[5]HE K,GKIOXARI G,DOLLáR P,et al. Mask R-CNN[C]//IEEE International Conference on Computer Vision. Italy:IEEE,2017:2980-2988.
[6]LIU W,ANGUELOV D,ERHAN D,et al. SSD:single shot multibox detector[C]//Computer Vision-ECCV 2016. Amsterdam,the Netherlands:Springer International Publishing,2016:21-37.
[7]REDMON J,DIVVALA S,GIRSHICK R,et al. You only look once:unified,real-time object detection[C]//Computer Vision and Pattern Recognition. USA:IEEE,2016:779-788.
[8]GIRSHICK R,DONAHUE J,DARRELL T,et al. Rich feature hierarchies for accurate object detection and semantic segmentation[C]//IEEE Conference on Computer Vision and Pattern Recognition. USA:IEEE Computer Society,2014:580-587.
[9]SANDLER M,HOWARD A,ZHU M,et al. MobileNetV2:inverted residuals and linear bottlenecks[J]. Eprint Arxiv,2018.
[10]GIRSHICK R. Fast R-CNN[C]//IEEE International Conference on Computer Vision. USA:IEEE Computer Society,2015:1440-1448.
[11]REDMON J,FARHADI A. YOLO9000:better,faster,stronger[C]//IEEE Conference on Computer Vision and Pattern Recognition. Italy:IEEE Computer Society,2017:6517-6525.
[12]REDMON J,FARHADI A. YOLOv3:an incremental improvement[J]. Eprint Arxiv,2018:104.02767.
[13]HE K,ZHANG X,REN S,et al. Deep residual learning for image recognition[C]//IEEE Conference on Computer Vision and Pattern Recognition. USA:IEEE Computer Society,2016:770-778.
[14]LIN T Y,DOLLAR P,GIRSHICK R,et al. Feature pyramid networks for object detection[C]//IEEE Conference on Computer Vision and Pattern Recognition. Italy:IEEE Computer Society,2017:936-944.
[15]WONG A,SHAFIEE M J,LI F,et al. Tiny SSD:a tiny single-shot detection deep convolutional neural network for real-time embedded object detection[C]//Conference on Computer and Robot Vision. Foronto:IEEE,2018(15):95-101.
[16]IANDOLA F N,HAN S,MOSKEWICZ M W,et al. SqueezeNet:AlexNet-level accuracy with 50×fewer parameters and<0.5 MB model size[J]. Eprint Arxiv,2016:1602.07360.
[17]EVERINGHAM M. The PASCAL visual object classes challenge[J]. Lecture notes in computer science,2005,111(1):98-136.
[18]CHOLLET F. Xception:deep learning with depthwise separable convolutions[C]//IEEE Conference on Computer Vision and Pattern Recognition. Italy:IEEE Computer Society,2017:1800-1807.
[19]HOWARD A G,ZHU M,CHEN B,et al. MobileNets:efficient convolutional neural networks for mobile vision applications[J]. Eprint Arxiv,2017.
[20]GEIGER A,LENZ P,URTASUN R. Are we ready for autonomous driving?The KITTI vision benchmark suite[C]//IEEE Conference on Computer Vision and Pattern Recognition. USA:IEEE Computer Society,2012:3354-3361.
[21]GEIGER A,LENZ P,STILLER C,et al. Vision meets robotics:the KITTI dataset[J]. International journal of robotics research,2013,32(11):1231-1237.
[22]LECUN Y,BOTTOU L,BENGIO Y,et al. Gradient-based learning applied to document recognition[J]. Proceedings of the IEEE,1998,86(11):2278-2324.
[23]SIMONYAN K,ZISSERMAN A. Very deep convolutional networks for large-scale image recognition[C]//International Conference on Leoorning Representations. USA:IEEE,2014.
[24]GIRSHICK R. Fast R-CNN[C]//IEEE International Conference on Computer Vision. USA:IEEE Computer Society,2015:1440-1448.
[25]KIM H,LEE Y,YIM B,et al. On-road object detection using deep neural network[C]//IEEE International Conference on Consumer Electronics-Asia. Korea:IEEE,2016:1-4.
[26]HUANG J,GUADARRAMA S,MURPHY K,et al. Speed/accuracy trade-offs for modern convolutional object detectors[C]//IEEE International Conference on Computer Vision. USA:IEEE Computer Society,2016:3296-3297.

Memo

Memo:
-
Last Update: 2019-03-30