Abstract
Traditional database systems assume that precise query concepts can be specified by users (for example, by using query languages). For many search tasks, however, a query concept is hard to articulate, and articulation can be subjective. Most users would find it hard to describe an image or a music query in low-level perceptual features. We believe that one desirable paradigm for search engines is to mine (i.e., to learn) users’ query concepts through active learning. In this paper, we formulate the query-concept learning problem as finding a binary classifier that separates relevant objects from those that are irrelevant to the query concept. We propose two hybrid algorithms, pipeline learning, and co-training, that are built on top of two active learning algorithms. Our empirical study shows that even when the feature dimension is very high and target concepts are very specific, the hybrid algorithms can grasp a complex query concept in a small number of user iterations.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Beitao Li, Edward Chang, and Chung-Sheng Li. Learning image query concepts via intelligent sampling. Proceedings of IEEE Multimedia and Expo, August 2001.
Simon Tong and Edward Chang. Support vector machine active learning for image retrieval. Proceedings of ACM International Conference on Multimedia, pages 107–118, October 2001.
Edward Chang and Beitao Li. Mega-the maximizing expected generalization algorithm for learning complex query concepts (extended version). Technical Report http://www-db.stanford.edu/~echang/mega-tois_OnlinePDF.pdf, November 2000.
Michael Kearns, Ming Li, and Leslie Valiant. Learning boolean formulae. Journal of ACM, 41(6):1298–1328, 1994.
A. Blum and T. Mitchell. Combining Labeled and Unlabeled Data wih Co-Training. Proceedings of the Workshop on Computational Learning Theory, 1998.
Edward Chang, Kwang-Ting Cheng, and Lisa Chang. PBIR-perception-based image retrieval. ACM Sigmod (Demo), May 2001.
James Wang, Jia Li, and Gio Wiederhold. Simplicity: Semantics-sensitive integrated matching for picture libraries. ACM Multimedia Conference, 2000.
Beitao Li, Wei-Cheng Lai, Edward Chang, and Kwang-Ting Cheng. Minig image features for efficient query processing. Proceedings of IEEE Data Mining, November 2001.
Y. Ishikawa, R. Subramanya, and C. Faloutsos. Mindreader: Querying databases through multiple examples. VLDB, 1998.
Leejay Wu, Christos Faloutsos, Katia Sycara, and Terry R. Payne. Falcon: Feedback adaptive loop for content-based retrieval. The 26th VLDB Conference, September 2000.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lai, WC., Chang, E., Cheng, KT.T. (2002). Hybrid Learning Schemes for Multimedia Information Retrieval. In: Chen, YC., Chang, LW., Hsu, CT. (eds) Advances in Multimedia Information Processing — PCM 2002. PCM 2002. Lecture Notes in Computer Science, vol 2532. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36228-2_69
Download citation
DOI: https://doi.org/10.1007/3-540-36228-2_69
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00262-8
Online ISBN: 978-3-540-36228-9
eBook Packages: Springer Book Archive