Towards More Powerful Multi-column Convolutional Network for Crowd Counting | SpringerLink
Skip to main content

Towards More Powerful Multi-column Convolutional Network for Crowd Counting

  • Conference paper
  • First Online:
Image and Graphics (ICIG 2021)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 12888))

Included in the following conference series:

  • 2075 Accesses

Abstract

Scale variation has always been one of the most challenging problems for crowd counting. By using multi-column convolutions with different receptive fields to deal with different scales in the scene, the multi-column convolutional networks have achieved good performance. However, there is still great potential waiting to be explored for multi-column convolutional networks. To this end, we propose to design a multi-column neural network that can more effectively adapt to scene scale variations automatically, by applying Neural Architecture Search technology. First, we combine Progressive Neural Architecture Search scheme with crowd counting to construct our Progressive Multi-column Architecture Serach (PMAS) framework. Furthermore, to reduce the bias caused by the weight-share scheme, which is widely adopted in efficient Neural Architecture Search, we propose a novel pre-architecture-based weight-share scheme. Experiments on several challenging datasets demonstrate the effectiveness of our method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
¥17,985 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
JPY 3498
Price includes VAT (Japan)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
JPY 14871
Price includes VAT (Japan)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
JPY 18589
Price includes VAT (Japan)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Zhang, Y., Zhou, D., Chen, S., Gao, S., Ma, Y.: Single-image crowd counting via multi-column convolutional neural network. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition, pp. 589–597. IEEE (2016)

    Google Scholar 

  2. Cheng, Z.-Q., Li, J.-X., Dai, Q., Wu, X., Hauptmann, A.: Learning spatial awareness to improve crowd counting. In: 2019 IEEE/CVF International Conference on Computer Vision, pp. 6152–6161. IEEE (2019)

    Google Scholar 

  3. Ranjan, V., Le, H.M., Hoai, M.: Iterative crowd counting. In: Proceedings of the European Conference on Computer Vision, pp. 270–285. IEEE (2018)

    Google Scholar 

  4. Cheng, Z.-Q., Li, J.-X., Dai, Q., Wu, X., He, J.-Y., Hauptmann, A.G.: Improving the learning of multi-column convolutional neural network for crowd counting. In: Proceedings of the 27th ACM International Conference on Multimedia, pp. 1897–1906. ACM (2019)

    Google Scholar 

  5. Li, Y., Zhang, X., Chen, D.: CSRNet: dilated convolutional neural networks for understanding the highly congested scenes. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 1091–1100. IEEE (2018)

    Google Scholar 

  6. Wang, Z., Xiao, Z., Xie, K., Qiu, Q., Zhen, X., Cao, X.: In defense of single-column networks for crowd counting. In BMVC, p. 78 (2018)

    Google Scholar 

  7. Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. In: International Conference on Learning Representations (2015)

    Google Scholar 

  8. Liu, C., et al.: Progressive neural architecture search. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11205, pp. 19–35. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01246-5_2

    Chapter  Google Scholar 

  9. Perez-Rua, J.-M., Baccouche, M., Pateux, S.: Efficient progressive neural architecture search. In: BMVC, p. 150 (2018)

    Google Scholar 

  10. Pham, H., Guan, M.Y., Zoph, B., Le, Q.V., Dean, J.: Efficient neural architecture search via parameters sharing. In: International Conference on Machine Learning, pp. 4092–4101 (2018)

    Google Scholar 

  11. Wang, Z., Simoncelli, E.P., Bovik, A.C.: Multiscale structural similarity for image quality assessment. In: The Thrity-Seventh Asilomar Conference on Signals, Systems & Computers, vol. 2, pp. 1398–1402 (2003)

    Google Scholar 

  12. Liu, L., Qiu, Z., Li, G., Liu, S., Ouyang, W., Lin, L.: Crowd counting with deep structured scale integration network. In: 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pp. 1774–1783 (2019)

    Google Scholar 

  13. Real, E., et al.: Large-scale evolution of image classifiers. In: ICML 2017 Proceedings of the 34th International Conference on Machine Learning, vol. 70, pp. 2902–2911 (2017)

    Google Scholar 

  14. Yu, K., Sciuto, C., Jaggi, M., Musat, C.,Salzmann, M.: Evaluating the search phase of neural architecture search. In: Eighth International Conference on Learning Representations (2020)

    Google Scholar 

  15. Bender, G., et al.: Can weight sharing outperform random architecture search? An investigation with Tu-NAS. In: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 14323–14332. IEEE (2020)

    Google Scholar 

  16. Li, G., Qian, G., Delgadillo, I.C., Muller, M., Thabet, A., Ghanem, B.: SGAS: sequential greedy architecture search. In: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1620–1630. IEEE (2020)

    Google Scholar 

  17. Idrees, H., Saleemi, I., Seibert, C., Shah, M.: Multi-source multi-scale counting in extremely dense crowd images. In: 2013 IEEE Conference on Computer Vision and Pattern Recognition, pp. 2547–2554. IEEE (2013)

    Google Scholar 

  18. Zoph, B., Vasudevan, V., Shlens, J., Le, Q. V.: Learning transferable architectures for scalable image recognition. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 8697–8710. IEEE (2018)

    Google Scholar 

  19. Zoph, B., Le, Q.V.: Neural architecture search with reinforcement learning. In: ICLR (2016)

    Google Scholar 

  20. Ghiasi, G., Lin, T.-Y., Le, Q.V.: NAS-FPN: learning scalable feature pyramid architecture for object detection. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 7036–7045. IEEE (2019)

    Google Scholar 

  21. Brostow, G.J., Cipolla, R.: Unsupervised Bayesian detection of independent motion in crowds. In: 2006 IEEE Computer Society Conference on Computer Vision and Pat-tern Recognition, vol. 1, pp. 594–601. IEEE (2016)

    Google Scholar 

  22. Lin, S.-F., Chen, J.-Y., Chao, H.-X.: Estimation of number of people in crowded scenes using perspective transformation. Syst. Man Cybern. 31(6), 645–654 (2001)

    Google Scholar 

  23. Chan, A.B., Vasconcelos, N.: Counting people with low-level features and Bayesian regression. IEEE Trans. Image Process. 21(4), 2160–2177 (2012)

    Article  MathSciNet  Google Scholar 

  24. Chen, K., Gong, S., Xiang, T., Loy, C.C.: Cumulative attribute space for age and crowd density estimation. In: 2013 IEEE Conference on Computer Vision and Pattern Recognition, pp. 2467–2474. IEEE (2013)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Weihai Li .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Zhang, J., Chu, Q., Li, W., Liu, B., Zhang, W., Yu, N. (2021). Towards More Powerful Multi-column Convolutional Network for Crowd Counting. In: Peng, Y., Hu, SM., Gabbouj, M., Zhou, K., Elad, M., Xu, K. (eds) Image and Graphics. ICIG 2021. Lecture Notes in Computer Science(), vol 12888. Springer, Cham. https://doi.org/10.1007/978-3-030-87355-4_32

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-87355-4_32

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-87354-7

  • Online ISBN: 978-3-030-87355-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics