Multi-Proxy Constraint Loss for Vehicle Re-Identification
Abstract
:1. Introduction
- (1)
- We propose a novel sampling strategy considering different viewpoints, effectively selecting the samples captured by different cameras. This sampling strategy contributes to sample the images corresponding to different proxies in a mini-batch. Moreover, it helps to mine hard positive and negative sample pairs.
- (2)
- A multi-proxy constraint loss function is implemented to learn the multiple intra-class proxies and constrain the distance to hard positive proxy less than to hard negative proxy. The feature embedding space supervised by this loss function is more compact, resulting in a larger inter-class distance.
- (3)
- Our proposed approach can be seamlessly plugged into existing methods to improve performance with less effort. We conduct extensive experiments on two large-scale vehicle Re-ID datasets, achieving promising results.
2. Related Works
3. The Proposed Method
3.1. Sampling Strategy Considering Viewpoints
3.2. Multi-Proxy Constraint Loss
3.3. Network Architecture
4. Experiments
4.1. Implementation Details
4.2. Datasets and Evaluation Metrics
4.3. Comparisons to the State-of-the-Art
4.3.1. Performance Comparisons on VeRi-776 Dataset
4.3.2. Performance Comparisons on VehicleID Dataset
4.4. Ablation Analysis
4.4.1. The Validation of Multi-Proxy Constraint Loss
4.4.2. The Influence of the Number of Proxies
4.4.3. The Influence of Sampling Strategy
5. Conclusions
Author Contributions
Funding
Acknowledgments
Conflicts of Interest
References
- Jain, V.; Sasindran, Z.; Rajagopal, A.; Biswas, S.; Bharadwaj, H.S.; Ramakrishnan, K.R. Deep automatic license plate recognition system. In Proceedings of the Tenth Indian Conference on Computer Vision, Graphics and Image Processing, Guwahati, Assam, India, 18–22 December 2016; pp. 1–8. [Google Scholar]
- Spanhel, J.; Sochor, J.; Juranek, R.; Herout, A.; Marsik, L.; Zemcik, P. Holistic recognition of low quality license plates by CNN using track annotated data. In Proceedings of the 2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), Lecce, Italy, 29 August–1 September 2017. [Google Scholar]
- Liu, X.; Zhang, S.; Huang, Q.; Gao, W. RAM: A Region-Aware Deep Model for Vehicle Re-Identification. In Proceedings of the 2018 IEEE International Conference on Multimedia and Expo (ICME), San Diego, CA, USA, 23–27 July 2018. [Google Scholar]
- He, B.; Li, J.; Zhao, Y.; Tian, Y. Part-regularized Near-duplicate Vehicle Re-identification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA, 16–20 June 2019; Volume 1, pp. 3997–4005. [Google Scholar]
- Lou, Y.; Bai, Y.; Liu, J.; Wang, S. VERI-Wild: A Large Dataset and a New Method for Vehicle Re-Identification in the Wild. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA, 16–20 June 2019; pp. 3235–3243. [Google Scholar]
- Wang, Z.; Tang, L.; Liu, X.; Yao, Z.; Yi, S.; Shao, J.; Yan, J.; Wang, S.; Li, H.; Wang, X. Orientation Invariant Feature Embedding and Spatial Temporal Regularization for Vehicle Re-identification. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 22–29 October 2017; pp. 379–387. [Google Scholar]
- Zhou, Y.; Liu, L.; Shao, L. Vehicle Re-Identification by Deep Hidden Multi-View Inference. IEEE Trans. Image Process. 2018, 27, 3275–3287. [Google Scholar] [CrossRef] [PubMed]
- Zhou, Y.; Shao, L. Viewpoint-aware Attentive Multi-view Inference for Vehicle Re-identification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA, 18–22 June 2018; pp. 6489–6498. [Google Scholar]
- Tang, Z.; Naphade, M.; Birchfield, S.; Tremblay, J.; Hodge, W.; Kumar, R.; Wang, S.; Yang, X. PAMTRI: Pose-aware multi-task learning for vehicle re-identification using highly randomized synthetic data. In Proceedings of the IEEE International Conference on Computer Vision, Seoul, Korea, 27 October–3 November 2019; pp. 211–220. [Google Scholar]
- Khorramshahi, P.; Kumar, A.; Peri, N.; Rambhatla, S.S.; Chen, J.; Chellappa, R. A Dual-Path Model With Adaptive Attention For Vehicle Re-Identification. In Proceedings of the IEEE International Conference on Computer Vision, Seoul, Korea, 27 October–3 November 2019. [Google Scholar]
- Chu, R.; Sun, Y.; Li, Y.; Liu, Z.; Zhang, C.; Wei, Y. Vehicle Re-identification with Viewpoint-aware Metric Learning. In Proceedings of the IEEE International Conference on Computer Vision, Seoul, Korea, 27 October–3 November 2019. [Google Scholar]
- Bai, Y.; Lou, Y.; Gao, F.; Wang, S.; Wu, Y.; Duan, L. Group Sensitive Triplet Embedding for Vehicle Re-identification. IEEE Trans. Multimed. 2018, 20, 2385–2399. [Google Scholar] [CrossRef]
- Liu, X.; Liu, W.; Mei, T.; Ma, H. PROVID: Progressive and Multimodal Vehicle Reidentification for Large-Scale Urban Surveillance. IEEE Trans. Multimed. 2018, 20, 645–658. [Google Scholar] [CrossRef]
- Liu, H.; Tian, Y.; Wang, Y.; Pang, L.; Huang, T. Deep Relative Distance Learning: Tell the Difference between Similar Vehicles. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 2167–2175. [Google Scholar]
- Cui, C.; Sang, N.; Gao, C.; Zou, L. Vehicle re-identification by fusing multiple deep neural networks. In Proceedings of the 2017 Seventh International Conference on Image Processing Theory, Tools and Applications (IPTA), Montreal, QC, Canada, 28 November–1 December 2017. [Google Scholar]
- Wei, X.-S.; Zhang, C.-L.; Liu, L.; Shen, C.; Wu, J. Coarse-to-fine: A RNN-based hierarchical attention model for vehicle re-identification. arXiv 2018, arXiv:1812.04239. [Google Scholar]
- Liu, W.; Wen, Y.; Yu, Z.; Yang, M. Large-Margin Softmax Loss for Convolutional Neural Networks. In Proceedings of the 33rd International Conference on Machine Learning, New York, NY, USA, 19–24 June 2016. [Google Scholar]
- Ranjan, R.; Castillo, C.D.; Chellappa, R. L2-constrained Softmax Loss for Discriminative Face Verification. arXiv 2017, arXiv:1703.09507. [Google Scholar]
- Wang, F.; Xiang, X.; Cheng, J.; Yuille, A.L. NormFace: L2 Hypersphere Embedding for Face Verification. In Proceedings of the 25th ACM International Conference on Multimedia, Mountain View, CA, USA, 23–27 October 2017. [Google Scholar]
- Liu, W.; Wen, Y.; Yu, Z.; Li, M.; Raj, B.; Song, L. SphereFace: Deep hypersphere embedding for face recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21–26 July 2017; pp. 6738–6746. [Google Scholar]
- Deng, J.; Guo, J.; Xue, N.; Zafeiriou, S. ArcFace: Additive Angular Margin Loss for Deep Face Recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA, 18–22 June 2018. [Google Scholar]
- Wang, H.; Wang, Y.; Zhou, Z.; Ji, X.; Gong, D.; Zhou, J.; Li, Z.; Liu, W. CosFace: Large Margin Cosine Loss for Deep Face Recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA, 18–22 June 2018; pp. 5265–5274. [Google Scholar]
- Wang, F.; Cheng, J.; Liu, W.; Liu, H. Additive Margin Softmax for Face Verification. IEEE Signal Process. Lett. 2018, 25, 926–930. [Google Scholar] [CrossRef] [Green Version]
- Hermans, A.; Beyer, L.; Leibe, B. In Defense of the Triplet Loss for Person Re-Identification. arXiv 2017, arXiv:1703.07737. [Google Scholar]
- Ye, M.; Liang, C.; Yu, Y.; Wang, Z.; Leng, Q.; Xiao, C.; Chen, J.; Hu, R. Person Reidentification via Ranking Aggregation of Similarity Pulling and Dissimilarity Pushing. IEEE Trans. Multimed. 2016, 18, 2553–2566. [Google Scholar] [CrossRef]
- Cheng, D.; Gong, Y.; Zhou, S.; Wang, J.; Zheng, N. Person re-identification by multi-channel parts-based CNN with improved triplet loss function. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 1335–1344. [Google Scholar]
- Fan, X.; Jiang, W.; Luo, H.; Fei, M. SphereReID: Deep hypersphere manifold embedding for person re-identification. J. Vis. Commun. Image Represent. 2019, 60, 51–58. [Google Scholar] [CrossRef] [Green Version]
- Chen, X.; Sui, H.; Fang, J.; Feng, W.; Zhou, M. Vehicle Re-Identification Using Distance-Based Global and Partial Multi-Regional Feature Learning. IEEE Trans. Intell. Transp. Syst. 2020, 1–11. [Google Scholar] [CrossRef]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep Residual Learning for Image Recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016. [Google Scholar]
- Liu, X.; Liu, W.; Ma, H.; Fu, H. Large-scale vehicle re-identification in urban surveillance videos. In Proceedings of the IEEE International Conference on Multimedia and Expo, Seattle, WA, USA, 11–15 July 2016; pp. 1–6. [Google Scholar]
- Peng, J.; Wang, H.; Zhao, T.; Fu, X. Learning multi-region features for vehicle re-identification with context-based ranking method. Neurocomputing 2019, 359, 427–437. [Google Scholar] [CrossRef]
- Kumar, R.; Weill, E.; Aghdasi, F.; Sriram, P. Vehicle Re-Identification: An Efficient Baseline Using Triplet Embedding. In Proceedings of the 2019 International Joint Conference on Neural Networks (IJCNN), Budapest, Hungary, 14–19 July 2019. [Google Scholar]
- Zhong, X.; Feng, M.; Huang, W.; Wang, Z.; Satoh, S. Poses Guide Spatiotemporal Model for Vehicle Re-identification. In MultiMedia Modeling; Springer International Publishing: Cham, Switzerland, 2019; pp. 426–439. [Google Scholar]
- Shen, Y.; Xiao, T.; Li, H.; Yi, S.; Wang, X. Learning Deep Neural Networks for Vehicle Re-ID with Visual-Spatio-Temporal Path Proposals. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 22–29 October 2017; pp. 1918–1927. [Google Scholar]
- Wu, C.; Liu, C.; Chiang, C.; Tu, W.; Chien, S. Vehicle Re-Identification with the Space-Time Prior. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA, 18–22 June 2018. [Google Scholar]
- Guo, H.; Zhao, C.; Liu, Z.; Wang, J.; Lu, H. Learning Coarse-to-Fine Structured Feature Embedding for Vehicle Re-Identification. In Thirty-Second {AAAI} Conference on Artificial Intelligence; McIlraith, S.A., Weinberger, K.Q., Eds.; {AAAI} Press: Palo Alto, CA, USA, 2018; pp. 6853–6860. [Google Scholar]
Model | Annotations | Rank-1 | Rank-5 | mAP |
---|---|---|---|---|
PROVID [13] 1 | ID + Attribute + Plate | 61.44 | 78.78 | 27.77 |
RHH-HA [16] | ID + Attribute | 74.79 | 87.31 | 56.80 |
Path-LSTM [34] 1 | ID | 83.49 | 90.04 | 58.27 |
RAM [3] | ID + Attribute | 88.60 | 94.00 | 61.50 |
BS [32] | ID | 90.23 | 96.42 | 67.55 |
FDA-Net [5] | ID + Attribute | 84.27 | 92.43 | 55.49 |
MRM [31] | ID | 91.77 | 95.82 | 68.55 |
AFL [35] | ID + Attribute | 86.29 | 94.39 | 57.43 |
PGST [33] 1 | ID + Pose + CameraID | 72.42 | 91.25 | 54.15 |
PGST + visual-SNN [33] 1 | ID + Pose + CameraID | 89.36 | 94.40 | 69.74 |
OIFE [6] | ID + Keypoints | 89.43 | - | 48.00 |
OIFE + ST [6] 1 | ID + Keypoints | 92.35 | - | 51.42 |
VAMI [8] | ID + Attribute | 77.03 | 90.82 | 50.13 |
VAMI + ST [8] 1 | ID + Attribute | 85.92 | 91.84 | 61.32 |
PAMTRI [9] | ID, Key points, Attribute | 92.86 | 96.97 | 71.88 |
AAVER (ResNet-50) [10] | ID, Key points | 88.97 | 94.70 | 58.52 |
MPCL (Ours) | ID + CameraID | 96.31 | 98.33 | 78.65 |
Model | Annotations | Small | Medium | Large | |||
---|---|---|---|---|---|---|---|
Rank1 | Rank5 | Rank1 | Rank5 | Rank1 | Rank5 | ||
CCL [14] | ID + Attribute | 49.00 | 73.50 | 42.80 | 66.80 | 38.20 | 61.60 |
C2F [36] | ID + Attribute | 61.10 | 81.70 | 56.20 | 76.20 | 51.40 | 72.20 |
RAM [3] | ID + Attribute | 75.20 | 91.50 | 72.30 | 87.00 | 67.70 | 84.50 |
GSTE [15] | ID | 75.90 | 84.20 | 74.80 | 83.60 | 74.00 | 82.70 |
FDA-Net [5] | ID + Attribute | 64.03 | 82.8 | 57.82 | 78.34 | 49.43 | 70.48 |
OIFE [6] | ID + Keypoints | - | - | - | - | 67.00 | 82.90 |
VAMI [8] | ID + Attribute | 63.12 | 83.25 | 52.87 | 75.12 | 47.34 | 70.29 |
MPCL (Ours) | ID | 81.75 | 92.63 | 78.79 | 90.71 | 75.91 | 88.91 |
Dataset | Method | Rank-1 | Rank-5 | Rank-10 | mAP | |
---|---|---|---|---|---|---|
VeRi-776 | SoftMax— | 94.46 | 97.68 | 98.45 | 76.58 | |
Distance-based classification | 95.83 | 98.03 | 98.57 | 77.31 | ||
Multi-proxy constraint | 96.31 | 98.33 | 98.75 | 78.65 | ||
VehicleID | Small | SoftMax— | 79.63 | 90.69 | 94.25 | 68.01 |
Distance-based classification | 80.88 | 91.13 | 93.69 | 71.30 | ||
Multi-proxy constraint | 81.75 | 92.63 | 95.75 | 72.31 | ||
Medium | SoftMax— | 74.96 | 88.50 | 92.75 | 62.28 | |
Distance-based classification | 78.50 | 89.67 | 93.75 | 67.02 | ||
Multi-proxy constraint | 78.80 | 90.71 | 94.21 | 67.98 | ||
Large | SoftMax— | 71.63 | 86.47 | 90.53 | 57.80 | |
Distance-based classification | 75.00 | 88.47 | 92.06 | 62.45 | ||
Multi-proxy constraint | 75.91 | 88.91 | 93.10 | 63.70 |
Method | Rank-1 | Rank-5 | mAP |
---|---|---|---|
Sampling considering viewpoints | 96.31 | 98.33 | 78.65 |
Random sampling | 95.41 | 97.56 | 77.72 |
© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Chen, X.; Sui, H.; Fang, J.; Zhou, M.; Wu, C. Multi-Proxy Constraint Loss for Vehicle Re-Identification. Sensors 2020, 20, 5142. https://doi.org/10.3390/s20185142
Chen X, Sui H, Fang J, Zhou M, Wu C. Multi-Proxy Constraint Loss for Vehicle Re-Identification. Sensors. 2020; 20(18):5142. https://doi.org/10.3390/s20185142
Chicago/Turabian StyleChen, Xu, Haigang Sui, Jian Fang, Mingting Zhou, and Chen Wu. 2020. "Multi-Proxy Constraint Loss for Vehicle Re-Identification" Sensors 20, no. 18: 5142. https://doi.org/10.3390/s20185142
APA StyleChen, X., Sui, H., Fang, J., Zhou, M., & Wu, C. (2020). Multi-Proxy Constraint Loss for Vehicle Re-Identification. Sensors, 20(18), 5142. https://doi.org/10.3390/s20185142