参考文献/References:
[1] FU X,ZHAO Y,WEI Y,et al. Rich features embedding for cross-modal retri:a simple baseline[J]. IEEE transactions on multimedia,2020,22(9):2354-2365.
[2]CEYHUN C,HASAN S B. Content based image retri with sparse representations and local feature descriptors:a comparative study[J]. Pattern recognition,2017,68:1-13.
[3]LOWE D G. Distinctive image features from scale-invariant keypoints[J]. International journal of computer vision,2004,60(2):91-110.
[4]OLIVA A,TORRALBA A. Modeling the shape of the scene:a holistic representation of the spatial envelope[J]. International journal of computer vision,2001,42(3):145-175.
[5]DALAL N,TRIGGS B. Histograms of oriented gradients for human detection[C]//2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition,San Diego,CA,USA:IEEE Computer Society,2005:886-893.
[6]KRIZHEVSKY A,SUTSKEVER I,HINTON G. ImageNet classification with deep convolutional neural networks[C]//The 26th Annual Conference on Neural Information Processing Systems 2012,Lake Tahoe,Nevada,USA,ACM,2012:1106-1114.
[7]SZEGEDY C,WEI L,JIA Y,et al. Going deeper with convolutions[C]//2015 IEEE Conference on Computer Vision and Pattern Recognition,Boston,MA,USA:IEEE Computer Society,2015:1-9.
[8]SIMONYAN K,ZISSERMAN A. Very deep convolutional networks for large-scale image recognition[C]//The 3rd International Conference on Learning Representations,San Diego,CA,USA:IEEE,2015:45-52.
[9]HE K,ZHANG X,REN S,et al. Deep residual learning for image recognition[C]//2016 IEEE Conference on Computer Vision and Pattern Recognition,Las Vegas,NV,USA:IEEE Computer Society,2016:770-778.
[10]HYEONWOO N,ANDRE A,JACK S,et al. Large-scale image retri with attentive deep local features[C]//2017 IEEE International Conference on Computer Vision,Venice,Italy:IEEE,2017:3476-3485.
[11]LIN T Y,DOLLAR P,GIRSHICK R,et al. Feature pyramid networks for object detection[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition,Honolulu,HI,USA:IEEE,2017:936-944.
[12]LIU Z,LUO P,QIU S,et al. Deep Fashion:powering robust clothes recognition and retri with rich annotations[C]//2016 IEEE Conference on Computer Vision and Pattern Recognition,Las Vegas,NV,USA:IEEE,2016:1096-1104.
[13]KONG T,YAO A,CHEN Y,et al. HyperNet:towards accurate region proposal generation and joint object detection[C]//2016 IEEE Conference on Computer Vision and Pattern Recognition,Las Vegas,NV,USA:IEEE,2016:845-853.