Abstract
The one-shot person Re-ID scenario faces two kinds of uncertainties when constructing the prediction model from X to Y. The first is model uncertainty, which captures the noise of the parameters in DNNs due to a lack of training data. The second is data uncertainty, which can be divided into two subtypes: image noise, where severe occlusion and the complex background contain irrelevant information about the identity; and label noise, where mislabeling affects visual appearance learning. We find that the state-of-the-art one-shot person Re-ID addresses the first issue of model uncertainty via a dynamic sampling strategy, while the second issue of data uncertainty remains. In this paper, to simultaneously address both issues, we propose a novel SPUE-Net for one-shot person Re-ID. By introducing a self-paced sampling strategy, our method can estimate the pseudolabels of unlabeled samples iteratively to expand the labeled samples gradually and remove model uncertainty without extra supervision. We divide the pseudolabel samples into two subsets to make the use of training samples more reasonable and effective. In addition, we apply a co-operative learning method of local uncertainty estimation combined with determinacy estimation to achieve better-hidden space feature mining and to improve the precision of selected pseudolabeled samples, which reduces data uncertainty. Extensive comparative evaluation experiments on video-based and image-based datasets show that SPUE-Net has significant advantages over state-of-the-art methods.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Xu J, Zhao R, Zhu F, Wang H, Ouyang W (2018) Attention-aware compositional network for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2119–2128
Wu Y, Lin Y, Dong X, Yan Y, Ouyang W, Yang Y (2018) Exploit the unknown gradually: one-shot video-based person re-identification by stepwise learning. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5177–5186
Pang Z, Guo J, Sun W, Xiao Y, Yu M (2021) Cross-domain person re-identification by hybrid supervised and unsupervised learning
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778
Zheng L, Shen L, Tian L, Wang S, Wang J, Tian Q (2015) Scalable person re-identification: a benchmark. In: Proceedings of the IEEE international conference on computer vision, pp 1116–1124
Lin Y, Zheng L, Zheng Z, Wu Y, Hu Z, Yan C, Yang Y (2019) Improving person re-identification by attribute and identity learning. Pattern Recognit
Zheng L, Bie Z, Sun Y, Wang J, Su C, Wang S, Tian Q (2016) Mars: a video benchmark for large-scale person re-identification. In: European conference on computer vision. Springer, pp 868–884
Liu X, Liu W, Mei T, Ma H (2017) Provid: progressive and multimodal vehicle reidentification for large-scale urban surveillance. IEEE Trans Multimed 20(3):645–658
Zheng Z, Yang X, Yu Z, Zheng L, Yang Y, Kautz J (2019) Joint discriminative and generative learning for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2138–2147
Wu A, Zheng W-S, Guo X, Lai J-H (2019) Distilled person re-identification: towards a more scalable system. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1187–1196
Wang C, Zhang Q, Huang C, Liu W, Wang X (2018) Mancs: a multi-task attentional network with curriculum sampling for person re-identification. In: Proceedings of the european conference on computer vision (ECCV), pp 365–381
Li J, Ma AJ, Yuen PC (2018) Semi-supervised region metric learning for person re-identification. Int J Comput Vis 126(8):855–874
Figueira D, Bazzani L, Minh HQ, Cristani M, Bernardino A, Murino V (2013) Semi-supervised multi-feature learning for person re-identification, pp 111–116
Kendall A, Gal Y (2017) What uncertainties do we need in bayesian deep learning for computer vision?. In: Advances in neural information processing systems, pp 5574–5584
Gal Y (2016) Uncertainty in deep learning. University of Cambridge, vol 1(3)
Bender G, Kindermans P-J, Zoph B, Vasudevan V, Le Q (2018) Understanding and simplifying one-shot architecture search. In: International conference on machine learning, pp 549–558
Yang X, Wang M, Hong R, Tian Q, Rui Y (2017) Enhancing person re-identification in a self-trained subspace. ACM Trans Multimed Comput Commun Appl (TOMM) 13(3):27
Bak S, Carr P (2017) One-shot metric learning for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2990–2999
Wu Y, Lin Y, Dong X, Yan Y, Bian W, Yang Y (2019) Progressive learning for person re-identification with one example. IEEE Trans Image Process 28(6):2872–2881
Zhang Y, Ma B, Feng Y, Li M (2021) Pmt-net: progressive multi-task network for one-shot person re-identification. Inf Sci 568:133–146
Hui L, Xiao J, Sun M, Lim EG, Zhao Y (2021) Progressive sample mining and representation learning for one-shot person re-identification. Pattern Recognit 110:107614
Shao J, Ma X (2022) Hierarchical pseudo-label learning for one-shot person re-identification. Appl Intell 52(8):9225–9238
Bengio Y, Louradour J, Collobert R, Weston J (2009) Curriculum learning, pp 41–48
Lee YJ, Grauman K (2011) Learning the easy things first: self-paced visual category discovery, pp 1721–1728
Zhang D, Meng D, Han J (2017) Co-saliency detection via a self-paced multiple-instance learning framework. IEEE Trans Pattern Anal Mach Intell 39(5):865–878
Jiang L, Meng D, Yu S, Lan Z, Shan S, Hauptmann AG (2014) Self-paced learning with diversity, pp 2078–2086
Shi Y, Jain AK (2019) Probabilistic face embeddings
Chang J, Lan Z, Cheng C, Wei Y (2020) Data uncertainty learning in face recognition. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 5710–5719
Yu T, Li D, Yang Y, Hospedales TM, Xiang T (2019) Robust person re-identification by modelling feature uncertainty. In: Proceedings of the IEEE international conference on computer vision, pp 552–561
Kendall A, Gal Y, Cipolla R (2018) Multi-task learning using uncertainty to weigh losses for scene geometry and semantics. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7482–7491
Depeweg S, Hernandez-Lobato J-M, Doshi-Velez F, Udluft S (2018) Decomposition of uncertainty in bayesian deep learning for efficient and risk-sensitive learning. In: International conference on machine learning. PMLR, pp 1184–1193
He Y, Zhu C, Wang J, Savvides M, Zhang X (2019) Bounding box regression with uncertainty for accurate object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2888–2897
Choi J, Chun D, Kim H, Lee H-J (2019) Gaussian yolov3: an accurate and fast object detector using localization uncertainty for autonomous driving. In: Proceedings of the IEEE international conference on computer vision, pp 502–511
Zheng Z, Yang Y (2020) Rectifying pseudo label learning via uncertainty estimation for domain adaptive semantic segmentation. arXiv:2003.03773
Zafar U, Ghafoor M, Zia T, Ahmed G, Latif A, Malik KR, Sharif AM (2019) Face recognition with bayesian convolutional networks for robust surveillance systems. EURASIP J Image Video Process 2019(1):10
Shi Y, Jain AK (2019) Probabilistic face embeddings. In: Proceedings of the IEEE international conference on computer vision, pp 6902–6911
Marathe A, Walambe R, Kotecha K, Jain DK (2022) In rain or shine: understanding and overcoming dataset bias for improving robustness against weather corruptions for autonomous vehicles
Kalayeh MM, Basaran E, Gökmen M, Kamasak ME, Shah M (2018) Human semantic parsing for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1062–1071
Ristani E, Tomasi C (2018) Features for multi-target multi-camera tracking and re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 6036–6046
Zhang Z, Lan C, Zeng W, Chen Z (2019) Densely semantically aligned person re-identification, pp 667–676
Zheng Z, Zheng L, Yang Y (2018) Pedestrian alignment network for large-scale person re-identification. IEEE Trans Circuits Syst Video Technol
Wu L, Wang Y, Gao J, Li X (2018) Where-and-when to look: Deep siamese attention networks for video-based person re-identification. IEEE Trans Multimed 21(6):1412– 1424
Holkar A, Walambe R, Kotecha K (2022) Few-shot learning for face recognition in the presence of image discrepancies for limited multi-class datasets. Image Vis Comput 120:104420
Jy A, Jx A, Zma B, Jg A (2022) Mpc 2 2 l: multiview predictive coding with contrastive learning for person re-identification. Pattern Recognit
Ye M, Ma AJ, Zheng L, Li J, Yuen PC (2017) Dynamic label graph matching for unsupervised video re-identification. In: Proceedings of the IEEE international conference on computer vision, pp 5142–5150
Zhong Z, Zheng L, Cao D, Li S (2017) Re-ranking person re-identification with k-reciprocal encoding. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1318–1327
Kumar MP, Packer B, Koller D (2010) Self-paced learning for latent variable models, pp 1189–1197
Settles B (2009) Active learning literature survey. Technical report, university of wisconsin-madison department of computer sciences
Johnson RT, Johnson DW (2008) Active learning: cooperation in the classroom. Annual Report Educ Psychol Japan 47:29–30
Blum A, Mitchell T (1998) Combining labeled and unlabeled data with co-training. In: Proceedings of the eleventh annual conference on computational learning theory, pp 92–100
Han T, Xie W, Zisserman A (2020) Self-supervised co-training for video representation learning. Adv Neural Inf Process Syst, vol 33
Kingma DP, Welling M (2013) Auto-encoding variational bayes. arXiv:1312.6114
Alemi AA, Fischer I, Dillon JV (2018) Uncertainty in the variational information bottleneck. arXiv:1807.00906
Zhang J, Zhao C, Ni B, Xu M, Yang X (2019) Variational few-shot learning. In: Proceedings of the IEEE/CVF international conference on computer vision (ICCV)
Ristani E, Solera F, Zou R, Cucchiara R, Tomasi C (2016) Performance measures and a data set for multi-target, multi-camera tracking. In: European conference on computer vision. Springer, pp 17–35
Liu Z, Wang D, Lu H (2017) Stepwise metric promotion for unsupervised video person re-identification. In: Proceedings of the IEEE international conference on computer vision, pp 2429–2438
Ioffe S, Szegedy C (2015) Batch normalization: accelerating deep network training by reducing internal covariate shift. arXiv:1502.03167
Acknowledgements
This work was supported in part by the National Key Research and Development Program of China under Grant 2020YFC0832502.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Zhang, Y., Ma, B., Liu, L. et al. Self-paced uncertainty estimation for one-shot person re-identification. Appl Intell 53, 15080–15094 (2023). https://doi.org/10.1007/s10489-022-04245-1
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-022-04245-1