Abstract
Different types of visual object categories can be found in real-world applications. Some categories are very heterogeneous in terms of local features (broad categories) while others are consistently characterized by some highly distinctive local features (narrow categories). The work described in this paper was motivated by the need to develop representations and categorization mechanisms that can be applied to domains involving different types of categories. A second concern of the paper is that these representations and mechanisms have potential for scaling up to large numbers of categories. The approach is based on combinining global shape descriptors with local features. A new shape representation is proposed. Two additional representations are used, one also capturing the object’s shape and another based on sets of highly distinctive local features. Basic classifiers following the nearest-neighbor rule were implemented for each representation. A meta-level classifier, based on a voting strategy, was also implemented. The relevance of each representation and classifier to both broad and narrow categories is evaluated on two datasets with a combined total of 114 categories.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Al-Ani, A., Deriche, M.: A new technique for combining multiple classifiers using the dempster-shafer theory of evidence. Journal of Artificial Intelligence Research 17, 333–361 (2002)
Belongie, S., Malik, J., Puzicha, J.: Shape matching and object recognition using shape contexts. IEEE Transactions on Pattern Analysis and Machine Intelligence 24, 509–522 (2002)
Dietterich, T.G.: Ensemble methods in machine learning, pp. 1–15. Springer, Heidelberg (2000)
Goshtasby, A.: Description and discrimination of planar shapes using shape matrices. IEEE Trans. Pattern Anal. Mach. Intell. 7, 738–743 (1985)
Huang, Y.S., Suen, C.Y.: A method of combining multiple experts for the recognition of unconstrained handwritten numerals. IEEE Trans. Pattern Anal. Mach. Intell. 17(1), 90–94 (1995)
Kittler, J., Hatef, M., Duin, R., Matas, J.: On combining classifiers. IEEE Transactions on Pattern Analysis and Machine Intelligence 20(3), 226–239 (1998)
Lisin, D., Mattar, M., Blaschko, M., Learned-Miller, E., Benfield, M.: Combining local and global image features for object class recognition. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops, 2005. CVPR Workshops, p. 47 (2005)
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60, 91–110 (2004)
Murphy, K.P., Torralba, A.B., Eaton, D., Freeman, W.T.: Object detection and localization using local and global features. In: Ponce, J., Hebert, M., Schmid, C., Zisserman, A. (eds.) Toward Category-Level Object Recognition. LNCS, vol. 4170, pp. 382–400. Springer, Heidelberg (2006)
Neumann, J., Samet, H., Soffer, A.: Integration of local and global shape analysis for logo classification. Pattern Recognition Letters 23(12), 1449–1457 (2002)
Pereira, R., Seabra Lopes, L., Silva, A.: Semantic image search and subset selection for classifier training in object recognition. In: Seabra Lopes, L., et al. (eds.) EPIA 2009. LNCS(LNAI), vol. 5816, pp. 338–349. Springer, Heidelberg (2009)
Ribeiro, L.S.: Object recognition for semantic robot vision. Master’s thesis, Universidade de Aveiro (2008)
Roy, D.K.: Learning Words from Sights and Sounds: A Computational Model. PhD thesis, MIT (2000)
Sarfraz, M., Ridha, A.: Content-based image retrieval using multiple shape descriptors. In: IEEE/ACS International Conference on Computer Systems and Applications, AICCSA 2007, pp. 730–737 (2007)
Seabra Lopes, L., Chauhan, A.: How many words can my robot learn? an approach and experiments with one-class learning. Interaction Studies 8(1), 53–81 (2007)
Seabra Lopes, L., Chauhan, A.: Open-ended category learning for language acquisition. Connection Science 8(4) (2008)
Steels, L., Kaplan, F.: Aibo’s first words: the social learning of language and meaning. Evolution of Communication 4(1), 3–32 (2002)
van Erp, M., Vuurpijl, L., Schomaker, L.: An overview and comparison of voting methods for pattern recognition. In: 8th International Workshop on Frontiers in Handwriting Recognition (IWFHR-8), pp. 195–200 (2002)
Wardhani, A., Thomson, T.: Content based image retrieval using category-based indexing. In: IEEE International Conference on Multimedia and Expo, ICME 2004, vol. 2, pp. 783–786 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Pereira, R., Seabra Lopes, L. (2009). Learning Visual Object Categories with Global Descriptors and Local Features. In: Lopes, L.S., Lau, N., Mariano, P., Rocha, L.M. (eds) Progress in Artificial Intelligence. EPIA 2009. Lecture Notes in Computer Science(), vol 5816. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04686-5_19
Download citation
DOI: https://doi.org/10.1007/978-3-642-04686-5_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04685-8
Online ISBN: 978-3-642-04686-5
eBook Packages: Computer ScienceComputer Science (R0)