Self-paced uncertainty estimation for one-shot person re-identification | Applied Intelligence Skip to main content
Log in

Self-paced uncertainty estimation for one-shot person re-identification

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

The one-shot person Re-ID scenario faces two kinds of uncertainties when constructing the prediction model from X to Y. The first is model uncertainty, which captures the noise of the parameters in DNNs due to a lack of training data. The second is data uncertainty, which can be divided into two subtypes: image noise, where severe occlusion and the complex background contain irrelevant information about the identity; and label noise, where mislabeling affects visual appearance learning. We find that the state-of-the-art one-shot person Re-ID addresses the first issue of model uncertainty via a dynamic sampling strategy, while the second issue of data uncertainty remains. In this paper, to simultaneously address both issues, we propose a novel SPUE-Net for one-shot person Re-ID. By introducing a self-paced sampling strategy, our method can estimate the pseudolabels of unlabeled samples iteratively to expand the labeled samples gradually and remove model uncertainty without extra supervision. We divide the pseudolabel samples into two subsets to make the use of training samples more reasonable and effective. In addition, we apply a co-operative learning method of local uncertainty estimation combined with determinacy estimation to achieve better-hidden space feature mining and to improve the precision of selected pseudolabeled samples, which reduces data uncertainty. Extensive comparative evaluation experiments on video-based and image-based datasets show that SPUE-Net has significant advantages over state-of-the-art methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
¥17,985 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price includes VAT (Japan)

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

References

  1. Xu J, Zhao R, Zhu F, Wang H, Ouyang W (2018) Attention-aware compositional network for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2119–2128

  2. Wu Y, Lin Y, Dong X, Yan Y, Ouyang W, Yang Y (2018) Exploit the unknown gradually: one-shot video-based person re-identification by stepwise learning. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5177–5186

  3. Pang Z, Guo J, Sun W, Xiao Y, Yu M (2021) Cross-domain person re-identification by hybrid supervised and unsupervised learning

  4. He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778

  5. Zheng L, Shen L, Tian L, Wang S, Wang J, Tian Q (2015) Scalable person re-identification: a benchmark. In: Proceedings of the IEEE international conference on computer vision, pp 1116–1124

  6. Lin Y, Zheng L, Zheng Z, Wu Y, Hu Z, Yan C, Yang Y (2019) Improving person re-identification by attribute and identity learning. Pattern Recognit

  7. Zheng L, Bie Z, Sun Y, Wang J, Su C, Wang S, Tian Q (2016) Mars: a video benchmark for large-scale person re-identification. In: European conference on computer vision. Springer, pp 868–884

  8. Liu X, Liu W, Mei T, Ma H (2017) Provid: progressive and multimodal vehicle reidentification for large-scale urban surveillance. IEEE Trans Multimed 20(3):645–658

    Article  Google Scholar 

  9. Zheng Z, Yang X, Yu Z, Zheng L, Yang Y, Kautz J (2019) Joint discriminative and generative learning for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2138–2147

  10. Wu A, Zheng W-S, Guo X, Lai J-H (2019) Distilled person re-identification: towards a more scalable system. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1187–1196

  11. Wang C, Zhang Q, Huang C, Liu W, Wang X (2018) Mancs: a multi-task attentional network with curriculum sampling for person re-identification. In: Proceedings of the european conference on computer vision (ECCV), pp 365–381

  12. Li J, Ma AJ, Yuen PC (2018) Semi-supervised region metric learning for person re-identification. Int J Comput Vis 126(8):855–874

    Article  Google Scholar 

  13. Figueira D, Bazzani L, Minh HQ, Cristani M, Bernardino A, Murino V (2013) Semi-supervised multi-feature learning for person re-identification, pp 111–116

  14. Kendall A, Gal Y (2017) What uncertainties do we need in bayesian deep learning for computer vision?. In: Advances in neural information processing systems, pp 5574–5584

  15. Gal Y (2016) Uncertainty in deep learning. University of Cambridge, vol 1(3)

  16. Bender G, Kindermans P-J, Zoph B, Vasudevan V, Le Q (2018) Understanding and simplifying one-shot architecture search. In: International conference on machine learning, pp 549–558

  17. Yang X, Wang M, Hong R, Tian Q, Rui Y (2017) Enhancing person re-identification in a self-trained subspace. ACM Trans Multimed Comput Commun Appl (TOMM) 13(3):27

    Google Scholar 

  18. Bak S, Carr P (2017) One-shot metric learning for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2990–2999

  19. Wu Y, Lin Y, Dong X, Yan Y, Bian W, Yang Y (2019) Progressive learning for person re-identification with one example. IEEE Trans Image Process 28(6):2872–2881

    Article  MathSciNet  MATH  Google Scholar 

  20. Zhang Y, Ma B, Feng Y, Li M (2021) Pmt-net: progressive multi-task network for one-shot person re-identification. Inf Sci 568:133–146

    Article  Google Scholar 

  21. Hui L, Xiao J, Sun M, Lim EG, Zhao Y (2021) Progressive sample mining and representation learning for one-shot person re-identification. Pattern Recognit 110:107614

    Article  Google Scholar 

  22. Shao J, Ma X (2022) Hierarchical pseudo-label learning for one-shot person re-identification. Appl Intell 52(8):9225–9238

    Article  Google Scholar 

  23. Bengio Y, Louradour J, Collobert R, Weston J (2009) Curriculum learning, pp 41–48

  24. Lee YJ, Grauman K (2011) Learning the easy things first: self-paced visual category discovery, pp 1721–1728

  25. Zhang D, Meng D, Han J (2017) Co-saliency detection via a self-paced multiple-instance learning framework. IEEE Trans Pattern Anal Mach Intell 39(5):865–878

    Article  Google Scholar 

  26. Jiang L, Meng D, Yu S, Lan Z, Shan S, Hauptmann AG (2014) Self-paced learning with diversity, pp 2078–2086

  27. Shi Y, Jain AK (2019) Probabilistic face embeddings

  28. Chang J, Lan Z, Cheng C, Wei Y (2020) Data uncertainty learning in face recognition. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 5710–5719

  29. Yu T, Li D, Yang Y, Hospedales TM, Xiang T (2019) Robust person re-identification by modelling feature uncertainty. In: Proceedings of the IEEE international conference on computer vision, pp 552–561

  30. Kendall A, Gal Y, Cipolla R (2018) Multi-task learning using uncertainty to weigh losses for scene geometry and semantics. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7482–7491

  31. Depeweg S, Hernandez-Lobato J-M, Doshi-Velez F, Udluft S (2018) Decomposition of uncertainty in bayesian deep learning for efficient and risk-sensitive learning. In: International conference on machine learning. PMLR, pp 1184–1193

  32. He Y, Zhu C, Wang J, Savvides M, Zhang X (2019) Bounding box regression with uncertainty for accurate object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2888–2897

  33. Choi J, Chun D, Kim H, Lee H-J (2019) Gaussian yolov3: an accurate and fast object detector using localization uncertainty for autonomous driving. In: Proceedings of the IEEE international conference on computer vision, pp 502–511

  34. Zheng Z, Yang Y (2020) Rectifying pseudo label learning via uncertainty estimation for domain adaptive semantic segmentation. arXiv:2003.03773

  35. Zafar U, Ghafoor M, Zia T, Ahmed G, Latif A, Malik KR, Sharif AM (2019) Face recognition with bayesian convolutional networks for robust surveillance systems. EURASIP J Image Video Process 2019(1):10

    Article  Google Scholar 

  36. Shi Y, Jain AK (2019) Probabilistic face embeddings. In: Proceedings of the IEEE international conference on computer vision, pp 6902–6911

  37. Marathe A, Walambe R, Kotecha K, Jain DK (2022) In rain or shine: understanding and overcoming dataset bias for improving robustness against weather corruptions for autonomous vehicles

  38. Kalayeh MM, Basaran E, Gökmen M, Kamasak ME, Shah M (2018) Human semantic parsing for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1062–1071

  39. Ristani E, Tomasi C (2018) Features for multi-target multi-camera tracking and re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 6036–6046

  40. Zhang Z, Lan C, Zeng W, Chen Z (2019) Densely semantically aligned person re-identification, pp 667–676

  41. Zheng Z, Zheng L, Yang Y (2018) Pedestrian alignment network for large-scale person re-identification. IEEE Trans Circuits Syst Video Technol

  42. Wu L, Wang Y, Gao J, Li X (2018) Where-and-when to look: Deep siamese attention networks for video-based person re-identification. IEEE Trans Multimed 21(6):1412– 1424

    Article  Google Scholar 

  43. Holkar A, Walambe R, Kotecha K (2022) Few-shot learning for face recognition in the presence of image discrepancies for limited multi-class datasets. Image Vis Comput 120:104420

    Article  Google Scholar 

  44. Jy A, Jx A, Zma B, Jg A (2022) Mpc 2 2 l: multiview predictive coding with contrastive learning for person re-identification. Pattern Recognit

  45. Ye M, Ma AJ, Zheng L, Li J, Yuen PC (2017) Dynamic label graph matching for unsupervised video re-identification. In: Proceedings of the IEEE international conference on computer vision, pp 5142–5150

  46. Zhong Z, Zheng L, Cao D, Li S (2017) Re-ranking person re-identification with k-reciprocal encoding. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1318–1327

  47. Kumar MP, Packer B, Koller D (2010) Self-paced learning for latent variable models, pp 1189–1197

  48. Settles B (2009) Active learning literature survey. Technical report, university of wisconsin-madison department of computer sciences

  49. Johnson RT, Johnson DW (2008) Active learning: cooperation in the classroom. Annual Report Educ Psychol Japan 47:29–30

    Article  Google Scholar 

  50. Blum A, Mitchell T (1998) Combining labeled and unlabeled data with co-training. In: Proceedings of the eleventh annual conference on computational learning theory, pp 92–100

  51. Han T, Xie W, Zisserman A (2020) Self-supervised co-training for video representation learning. Adv Neural Inf Process Syst, vol 33

  52. Kingma DP, Welling M (2013) Auto-encoding variational bayes. arXiv:1312.6114

  53. Alemi AA, Fischer I, Dillon JV (2018) Uncertainty in the variational information bottleneck. arXiv:1807.00906

  54. Zhang J, Zhao C, Ni B, Xu M, Yang X (2019) Variational few-shot learning. In: Proceedings of the IEEE/CVF international conference on computer vision (ICCV)

  55. Ristani E, Solera F, Zou R, Cucchiara R, Tomasi C (2016) Performance measures and a data set for multi-target, multi-camera tracking. In: European conference on computer vision. Springer, pp 17–35

  56. Liu Z, Wang D, Lu H (2017) Stepwise metric promotion for unsupervised video person re-identification. In: Proceedings of the IEEE international conference on computer vision, pp 2429–2438

  57. Ioffe S, Szegedy C (2015) Batch normalization: accelerating deep network training by reducing internal covariate shift. arXiv:1502.03167

Download references

Acknowledgements

This work was supported in part by the National Key Research and Development Program of China under Grant 2020YFC0832502.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Bo Ma.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhang, Y., Ma, B., Liu, L. et al. Self-paced uncertainty estimation for one-shot person re-identification. Appl Intell 53, 15080–15094 (2023). https://doi.org/10.1007/s10489-022-04245-1

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-022-04245-1

Keywords

Navigation