Abstract
Scale variation has always been one of the most challenging problems for crowd counting. By using multi-column convolutions with different receptive fields to deal with different scales in the scene, the multi-column convolutional networks have achieved good performance. However, there is still great potential waiting to be explored for multi-column convolutional networks. To this end, we propose to design a multi-column neural network that can more effectively adapt to scene scale variations automatically, by applying Neural Architecture Search technology. First, we combine Progressive Neural Architecture Search scheme with crowd counting to construct our Progressive Multi-column Architecture Serach (PMAS) framework. Furthermore, to reduce the bias caused by the weight-share scheme, which is widely adopted in efficient Neural Architecture Search, we propose a novel pre-architecture-based weight-share scheme. Experiments on several challenging datasets demonstrate the effectiveness of our method.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Zhang, Y., Zhou, D., Chen, S., Gao, S., Ma, Y.: Single-image crowd counting via multi-column convolutional neural network. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition, pp. 589–597. IEEE (2016)
Cheng, Z.-Q., Li, J.-X., Dai, Q., Wu, X., Hauptmann, A.: Learning spatial awareness to improve crowd counting. In: 2019 IEEE/CVF International Conference on Computer Vision, pp. 6152–6161. IEEE (2019)
Ranjan, V., Le, H.M., Hoai, M.: Iterative crowd counting. In: Proceedings of the European Conference on Computer Vision, pp. 270–285. IEEE (2018)
Cheng, Z.-Q., Li, J.-X., Dai, Q., Wu, X., He, J.-Y., Hauptmann, A.G.: Improving the learning of multi-column convolutional neural network for crowd counting. In: Proceedings of the 27th ACM International Conference on Multimedia, pp. 1897–1906. ACM (2019)
Li, Y., Zhang, X., Chen, D.: CSRNet: dilated convolutional neural networks for understanding the highly congested scenes. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 1091–1100. IEEE (2018)
Wang, Z., Xiao, Z., Xie, K., Qiu, Q., Zhen, X., Cao, X.: In defense of single-column networks for crowd counting. In BMVC, p. 78 (2018)
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. In: International Conference on Learning Representations (2015)
Liu, C., et al.: Progressive neural architecture search. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11205, pp. 19–35. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01246-5_2
Perez-Rua, J.-M., Baccouche, M., Pateux, S.: Efficient progressive neural architecture search. In: BMVC, p. 150 (2018)
Pham, H., Guan, M.Y., Zoph, B., Le, Q.V., Dean, J.: Efficient neural architecture search via parameters sharing. In: International Conference on Machine Learning, pp. 4092–4101 (2018)
Wang, Z., Simoncelli, E.P., Bovik, A.C.: Multiscale structural similarity for image quality assessment. In: The Thrity-Seventh Asilomar Conference on Signals, Systems & Computers, vol. 2, pp. 1398–1402 (2003)
Liu, L., Qiu, Z., Li, G., Liu, S., Ouyang, W., Lin, L.: Crowd counting with deep structured scale integration network. In: 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pp. 1774–1783 (2019)
Real, E., et al.: Large-scale evolution of image classifiers. In: ICML 2017 Proceedings of the 34th International Conference on Machine Learning, vol. 70, pp. 2902–2911 (2017)
Yu, K., Sciuto, C., Jaggi, M., Musat, C.,Salzmann, M.: Evaluating the search phase of neural architecture search. In: Eighth International Conference on Learning Representations (2020)
Bender, G., et al.: Can weight sharing outperform random architecture search? An investigation with Tu-NAS. In: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 14323–14332. IEEE (2020)
Li, G., Qian, G., Delgadillo, I.C., Muller, M., Thabet, A., Ghanem, B.: SGAS: sequential greedy architecture search. In: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1620–1630. IEEE (2020)
Idrees, H., Saleemi, I., Seibert, C., Shah, M.: Multi-source multi-scale counting in extremely dense crowd images. In: 2013 IEEE Conference on Computer Vision and Pattern Recognition, pp. 2547–2554. IEEE (2013)
Zoph, B., Vasudevan, V., Shlens, J., Le, Q. V.: Learning transferable architectures for scalable image recognition. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 8697–8710. IEEE (2018)
Zoph, B., Le, Q.V.: Neural architecture search with reinforcement learning. In: ICLR (2016)
Ghiasi, G., Lin, T.-Y., Le, Q.V.: NAS-FPN: learning scalable feature pyramid architecture for object detection. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 7036–7045. IEEE (2019)
Brostow, G.J., Cipolla, R.: Unsupervised Bayesian detection of independent motion in crowds. In: 2006 IEEE Computer Society Conference on Computer Vision and Pat-tern Recognition, vol. 1, pp. 594–601. IEEE (2016)
Lin, S.-F., Chen, J.-Y., Chao, H.-X.: Estimation of number of people in crowded scenes using perspective transformation. Syst. Man Cybern. 31(6), 645–654 (2001)
Chan, A.B., Vasconcelos, N.: Counting people with low-level features and Bayesian regression. IEEE Trans. Image Process. 21(4), 2160–2177 (2012)
Chen, K., Gong, S., Xiang, T., Loy, C.C.: Cumulative attribute space for age and crowd density estimation. In: 2013 IEEE Conference on Computer Vision and Pattern Recognition, pp. 2467–2474. IEEE (2013)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Zhang, J., Chu, Q., Li, W., Liu, B., Zhang, W., Yu, N. (2021). Towards More Powerful Multi-column Convolutional Network for Crowd Counting. In: Peng, Y., Hu, SM., Gabbouj, M., Zhou, K., Elad, M., Xu, K. (eds) Image and Graphics. ICIG 2021. Lecture Notes in Computer Science(), vol 12888. Springer, Cham. https://doi.org/10.1007/978-3-030-87355-4_32
Download citation
DOI: https://doi.org/10.1007/978-3-030-87355-4_32
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-87354-7
Online ISBN: 978-3-030-87355-4
eBook Packages: Computer ScienceComputer Science (R0)