One-Shot Learning of Sketch Categories with Co-regularized Sparse Coding

Qi, Yonggang; Zheng, Wei-Shi; Xiang, Tao; Song, Yi-Zhe; Zhang, Honggang; Guo, Jun

doi:10.1007/978-3-319-14364-4_8

Yonggang Qi²⁷,
Wei-Shi Zheng²⁸,
Tao Xiang²⁹,
Yi-Zhe Song²⁹,
Honggang Zhang²⁷ &
…
Jun Guo²⁷

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 8888))

Included in the following conference series:

International Symposium on Visual Computing

2543 Accesses

Abstract

Categorizing free-hand human sketches has profound implications in applications such as human computer interaction and image retrieval. The task is non-trivial due to the iconic nature of sketches, signified by large variances in both appearance and structure when compared with photographs. Prior works often utilize off-the-shelf low-level features and assume the availability of a large training set, rendering them sensitive towards abstraction and less scalable to new categories. To overcome this limitation, we propose a transfer learning framework which enables one-shot learning of sketch categories. The framework is based on a novel co-regularized sparse coding model which exploits common/shareable parts among human sketches of seen categories and transfer them to unseen categories. We contribute a new dataset consisting of 7,760 human segmented sketches from 97 object categories. Extensive experiments reveal that the proposed method can classify unseen sketch categories given just one training sample with a 33.04% accuracy, offering a two-fold improvement over baselines.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

¥17,985 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: JPY 3498; Price includes VAT (Japan)

eBook: JPY 11439; Price includes VAT (Japan)

Softcover Book: JPY 14299; Price includes VAT (Japan)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Discovering discriminative patches for free-hand sketch analysis

Article 22 March 2016

Adaptive Fine-Grained Sketch-Based Image Retrieval

Co-segmentation Without any Pixel-Level Supervision with Application to Large-Scale Sketch Classification

References

Eitz, M., Hays, J., Alexa, M.: How do humans sketch objects? ACM Trans. Graph. 31, 44 (2012)
Google Scholar
Eitz, M., Hildebrand, K., Boubekeur, T., Alexa, M.: Sketch-based image retrieval: Benchmark and bag-of-features descriptors. IEEE Trans. Vis. Comput. Graph. 17, 1624–1636 (2011)
Article Google Scholar
Hu, R., Collomosse, J.P.: A performance evaluation of gradient field hog descriptor for sketch based image retrieval 117, 790–806 (2013)
Google Scholar
Hu, R., Barnard, M., Collomosse, J.: Gradient field descriptor for sketch based retrieval and localization. In: ICIP, pp. 1025–1028 (2010)
Google Scholar
Li, Y., Song, Y.Z., Gong, S.: Sketch recognition by ensemble matching of structured features. In: BMVC (2013)
Google Scholar
Cao, X., Zhang, H., Liu, S., Guo, X., Lin, L.: Sym-fish: A symmetry-aware flip invariant sketch histogram shape descriptor. In: ICCV, pp. 313–320 (2013)
Google Scholar
Frome, A., Corrado, G.S., Shlens, J., Bengio, S., Dean, J., Ranzato, M., Mikolov, T.: Devise: A deep visual-semantic embedding model. In: NIPS, pp. 2121–2129 (2013)
Google Scholar
Lampert, C.H., Nickisch, H., Harmeling, S.: Learning to detect unseen object classes by between-class attribute transfer. In: CVPR, pp. 951–958 (2009)
Google Scholar
Li, F.F., Fergus, R., Perona, P.: One-shot learning of object categories. IEEE Trans. Pattern Anal. Mach. Intell. 28, 594–611 (2006)
Article Google Scholar
Wright, J., Yang, A.Y., Ganesh, A., Sastry, S.S., Ma, Y.: Robust face recognition via sparse representation. IEEE Trans. Pattern Anal. Mach. Intell. 31, 210–227 (2009)
Article Google Scholar
Tommasi, T., Caputo, B.: The more you know, the less you learn: From knowledge transfer to one-shot learning of object categories. In: BMVC, pp. 1–11 (2009)
Google Scholar
Fu, Y., Hospedales, T.M., Xiang, T., Gong, S.: Learning multimodal latent attributes. IEEE Trans. Pattern Anal. Mach. Intell. 36, 303–316 (2014)
Article Google Scholar
Farhadi, A., Endres, I., Hoiem, D., Forsyth, D.A.: Describing objects by their attributes. In: CVPR, pp. 1778–1785 (2009)
Google Scholar
Gangeh, M.J., Ghodsi, A., Kamel, M.S.: Kernelized supervised dictionary learning. IEEE Transactions on Signal Processing 61, 4753–4767 (2013)
Article MathSciNet Google Scholar
Wang, J., Yang, J., Yu, K., Lv, F., Huang, T.S., Gong, Y.: Locality-constrained linear coding for image classification. In: CVPR, pp. 3360–3367 (2010)
Google Scholar
He, R., Zheng, W.S., Hu, B.G.: Maximum correntropy criterion for robust face recognition. IEEE Trans. Pattern Anal. Mach. Intell. 33, 1561–1576 (2011)
Article Google Scholar
Zhang, S., Yao, H., Sun, X., Lu, X.: Sparse coding based visual tracking: Review and experimental comparison. Pattern Recognition 46, 1772–1788 (2013)
Article Google Scholar
Portugal, L.F., Judice, J.J., Vicente, L.N.: A comparison of block pivoting and interior-point algorithms for linear least squares problems with nonnegative variables. Mathematics of Computation 63, 625–643 (1994)
Article MATH MathSciNet Google Scholar
Grauman, K., Darrell, T.: Fast contour matching using approximate earth mover’s distance. In: CVPR (1), pp. 220–227 (2004)
Google Scholar
Lim, J.J., Zitnick, C.L., Dollár, P.: Sketch tokens: A learned mid-level representation for contour and object detection. In: CVPR, pp. 3158–3165 (2013)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Information and Communication Engineering, BUPT, Beijing, China
Yonggang Qi, Honggang Zhang & Jun Guo
School of Information Science and Technology, Sun Yat-sen University, China
Wei-Shi Zheng
School of EECS, Queen Mary, University of London, London, E1 4NS, UK
Tao Xiang & Yi-Zhe Song

Authors

Yonggang Qi
View author publications
You can also search for this author in PubMed Google Scholar
Wei-Shi Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Tao Xiang
View author publications
You can also search for this author in PubMed Google Scholar
Yi-Zhe Song
View author publications
You can also search for this author in PubMed Google Scholar
Honggang Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Jun Guo
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science and Engineering, University of Nevada at Reno, USA
George Bebis
NASA Ames Research Center, Moffett Field, CA, USA
Richard Boyle
Lawrence Berkeley National Laboratory, Berkeley, CA, USA
Bahram Parvin
Desert Research Institute, Reno, NV, USA
Darko Koracin
The University of Texas at Dallas, 75080, Richardson, TX, USA
Ryan McMahan
NextGen Interactions, 27604, Raleigh, NC, USA
Jason Jerald
Indiana University, 46202, Indianapolis, IN, USA
Hui Zhang
Microsoft Research, 1 Microsoft Way, 98052, Redmond, WA, USA
Steven M. Drucker
University of Delaware, 19716-2712, Newark, DE, USA
Chandra Kambhamettu
Intel Corp., 95054, Sata Clara, CA, USA
Maha El Choubassi
Computer Graphics and Interactive Media Lab, Department of Computer Science, University of Houston, 77004, Houston, TX, USA
Zhigang Deng
NVIDIA, 34788, Leesburg, FL, USA
Mark Carlson

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Qi, Y., Zheng, WS., Xiang, T., Song, YZ., Zhang, H., Guo, J. (2014). One-Shot Learning of Sketch Categories with Co-regularized Sparse Coding. In: Bebis, G., et al. Advances in Visual Computing. ISVC 2014. Lecture Notes in Computer Science, vol 8888. Springer, Cham. https://doi.org/10.1007/978-3-319-14364-4_8

Download citation

DOI: https://doi.org/10.1007/978-3-319-14364-4_8
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-14363-7
Online ISBN: 978-3-319-14364-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics