Abstract
In this paper, we address the problem of image content characterization in the compressed domain for the facilitation of similarity matching in content-based image retrieval. Specifically, given the disparity of the content characterization power of compressed domain approaches and those based on pixel-domain features, with the latter being usually considered as the more superior one, our objective is to transform the selected set of compressed domain feature histograms in such a way that the retrieval result based on these features is compatible with their spatial domain counterparts. Since there are a large number of possible transformations, we adopt a genetic algorithm approach to search for the optimal one, where each of the binary strings in the population represents a candidate transformation. The fitness of each transformation is defined as a function of the discrepancies between the spatial-domain and compressed-domain retrieval results. In this way, the GA mechanism ensures that transformations which best approximate the performance of spatial domain retrieval will survive into the next generation and are allowed through the operations of crossover and mutation to generate variations of themselves to further improve their performances.
Similar content being viewed by others
References
N. Ahmed, T. Natarajan, and K.R. Rao“Discrete cosine transform,” IEEE Trans. Comput., Vol. 23, pp. 90–93, 1974.
P. Aigrain, H. Zhang, and D. Petkovic“Content-based representation and retrieval of visual media: A state of the art review,” Multimedia Tools and Applications, Vol. 3, No. 3, pp. 179–202, 1996.
T. Bäck, Evolutionary Algorithms in Theory and Practice. Oxford Univ. Press: New York, 1996.
T. Bäck, U. Hammel, and H.-P. Schwefel“Evolutionary computation: Comments on the history and current state,” IEEE Trans. Evolutionary Comp., Vol. 1, No. 1, pp. 3–17, 1997.
S.-K. Chang, Q.Y. Shi, and C.W. Yan“Iconic indexing by 2-D strings,” IEEE Trans. Pattern Analysis and Machine Intelligence, Vol. 9, No. 3, pp. 413–428, 1987.
S.-F. Chang, J.R. Smith, M. Beigi, and A. Benitez“Visual information retrieval from large distributed online repositories,” Comm. ACM, Vol. 40, No. 12, pp. 63–71, 1997.
J.M. Corridoni, A. del Bimbo, and P. Pala“Image retrieval by color semantics,” Multimedia Systems, Vol. 7, No. 3, pp. 175–183, 1999.
I.J. Cox, M.L. Miller, T.P. Minka, T.V. Papathomas, and P.N. Yianilos“The Bayesian image retrieval system, picHunter: Theory, implementation and psychophysical experiments,” IEEE Trans. Image Proc., Vol. 9, No. 1, pp. 20–37, 2000.
J.P. Eakins, J.M. Boardman, and M.E. Graham“Similarity retrieval of trademark images,” IEEE Multimedia, Vol. 5, No. 2, pp. 53–63, 1998.
D.B. Fogel, Evolutionary Computation: Toward a New Philosophy of Machine Intelligence. IEEE Press: Piscataway, NJ, 1995.
D.E. Goldberg, Genetic Algorithms in Search, Optimization and Machine Learning. Addison-Wesley: Reading, MA, 1989.
W.I. Grosky“Multimedia information systems,” IEEE Multimedia, Vol. 1, No. 1, pp. 12–24, 1994.
A. Gupta and R. Jain“Visual information retrieval,” Comm. ACM, Vol. 40, No. 5, pp. 71–79, 1997.
J. Hafner, H.S. Sawhney, W. Equitz, M. Flickner, and W. Niblack“Efficient color histogram indexing for quadratic form distance functions,” IEEE Trans. Pattern Analysis and Machine Intelligence, Vol. 17, No. 7, pp. 729–736, 1995.
J.H. Holland, Adaptation in Natural and Artificial Systems. Univ. of Michigan Press: Ann Arbor, MI, 1975.
N.R. Howe and D.P. Huttenlocher“Integrating color, texture and geometry for image retrieval,” in Proc. CVPR, 2000, pp. 239–247.
C.C. Hsu, W.W. Chu, and R.K. Taira“A knowledge-based approach for retrieving images by content,” IEEE Trans. on Knowledge and Data Engineering, Vol. 8, No. 4, pp. 522–532, 1996.
J. Huang, S.R. Kumar, M. Mitra, W.-J. Zhu, and R. Zabih“Spatial color indexing and applications,” Int. J. Computer Vision, Vol. 35, No. 3, pp. 245–268, 1999.
B. Huet and E.R. Hancock“Line pattern retrieval using relational histograms,” IEEE Trans. Pattern Analysis and Machine Intelligence, Vol. 21, No. 12, pp. 1363–1371, 1999.
A.K. Jain and A. Vailaya“Image retrieval using color and shape,” Pattern Recognition, Vol. 29, No. 8, pp. 1233–1244, 1996.
A.K. Jain and A. Vailaya“Shape-based retrieval: A case study with trademark image databases,” Pattern Recognition, Vol. 31, No. 9, pp. 1369–1390, 1998.
J.A. Lay and L. Guan“Image retrieval based on energy histograms of the low frequency DCT coefficients,” in Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Proc., 1999, pp. 3009–3012.
M.K. Mandal, F. Idris, and S. Panchanathan“A critical evaluation of image and video indexing techniques in the compressed domain,” Image and Vision Computing, Vol. 17, No. 7, pp. 513–529, 1999.
B.S. Manjunath and W.Y. Ma“Texture features for browsing and retrieval of image data,” IEEE Trans. Pattern Analysis and Machine Intelligence, Vol. 18, No. 8, pp. 837–842, 1996.
R. Mehrotra and J.E. Gary“Similar-shape retrieval in shape data management,” IEEE Computer, Vol. 28, No. 9, pp. 57–62, 1995.
M. Mitchell, An Introduction to Genetic Algorithms. MIT Press: Cambridge MA, 1996.
P. Pala and S. Santini“Image retrieval by shape and texture,” Pattern Recognition, Vol. 32, No. 3, pp. 517–527, 1999.
W.B. Pennebaker and J.L. Mitchell, JPEG Still Image Compression Standard, Van Nostrand Reinhold: New York, NY, 1993.
A. Pentland, R.W. Picard, and S. Sclaroff“Photobook: Content-based manipulation of image databases,” Int. J. Computer Vision, Vol. 18, No. 3, pp. 233–254, 1996.
G.R. Roussas, A Course in Mathematical Statistics. Academic Press: San Diego, Calif., 1997.
Y. Rui, T.S. Huang, and S.-F. Chang“Image retrieval: Current techniques, promising directions and open issues,” J. Visual Comm. and Image Representation, Vol. 10, No. 1, pp. 39–62, 1999.
M. Schneier and M. Abdel-Mottaleb“Exploiting the JPEG compression scheme for image retrieval,” IEEE Trans. Pattern Analysis and Machine Intelligence, Vol. 18, No. 8, pp. 849–853, 1996.
M.J. Swain and B.H. Ballard“Color indexing,” Int. J. Computer Vision, Vol. 7, No. 1, pp. 11–32, 1991.
A. Vailaya, A.K. Jain, and H.J. Zhang“On image classification: City images vs landscapes,” Pattern Recognition, Vol. 31, No. 12, pp. 1921–1935, 1998.
G.K. Wallace“The JPEG still picture compression standard,” Communications of the ACM, Vol. 34, pp. 30–44, 1991.
J. Z. Wang, G. Wiederhold, O. Firschein, and S.X. Wei“Content-based image indexing and searching using Daubechies’ wavelets,” Int. J. Digital Libraries, Vol. 1, pp. 311–328, 1997.
Author information
Authors and Affiliations
Corresponding author
Additional information
This research was supported by a grant from City University of Hong Kong (Project No. 7100220).
Hau San Wong is currently an assistant professor in the Department of Computer Science, City University of Hong Kong. He received the BSc and MPhil degrees in Electronic Engineering from the Chinese University of Hong Kong, and the PhD degree in Electrical and Information Engineering from the University of Sydney. He has also held research positions in the University of Sydney and Hong Kong Baptist University. His research interests include multimedia signal processing, neural networks and evolutionary computation. He is the co-author of the book Adaptive Image Processing: A Computational Intelligence Perspective, which is a joint publication of CRC Press and SPIE Press, and was an organizing committee member of the 2000 IEEE Pacific-Rim Conference on Multimedia and 2000 IEEE Workshop on Neural Networks for Signal Processing, which were both held in Sydney, Australia. He has also co-organized a number of conference special sessions, including the special session on “Image Content Extraction and Description for Multimedia” in 2000 IEEE International Conference on Image Processing, Vancouver, Canada, and“Machine Learning Techniques for Visual Information Retrieval” in 2003 International Conference on Visual Information Retrieval, Miami, Florida.
Horace H. S. Ip received his B.Sc. (First Class Honours) degree in Applied Physics and Ph.D. degree in Image Processing from University College London, United Kingdom, in 1980 and 1983 respectively. Presently, he is the Chair Professor of the Computer Science Department and the founding director of the AIMtech Centre (Centre for Innovative Applications of Internet and Multimedia Technologies) at City University of Hong Kong. His research interests include image processing and analysis, pattern recognition, hypermedia systems in education and computer graphics.
Prof. Ip the Chairman of the IEEE (Hong Kong Section) Computer Chapter, and the Founding President of the Hong Kong Society for Multimedia and Image Computing. He has published over 160 papers in international journals and conference proceedings. Prof. Ip is a member of the IEEE, a Fellow of the Hong Kong Institution of Engineers (HKIE), Fellow of the Institution of Engineers (IEE), UK and Fellow of the International Association for Pattern Recognition (IAPR).
Lawrence Iu was awarded the Hong Kong Ten Most Outstanding Students Award in 2000. He has studied in Cornell University, USA and attained the Dean’s Honor List for outstanding scholastic performance. He was a research assistant in the City University of Hong Kong in 2002, and is currently pursuing the Bachelor of Medicine& Bachelor of Surgery degree in the University of Hong Kong.
Kent K. T. Cheung received BSc. (first class honours) and PhD. degrees in the Department of Computer Science, City University of Hong Kong in 1996 and 2002 respectively. He worked as a research staff and a part-time lecturer in the same department until 2004. Within the period of his years in City University of Hong Kong, he involved in a wide range of projects, such as content-based retrieval of color logos, intelligent retrieval of histological images, 3D head model classification and retrieval and an object oriented framework for image representation and retrieval. In 2004, he joined the Department of Computing at The Hong Kong Polytechnic University as a Visiting Assistant Professor. His research interests include content-based retrieval of images and 3D models, image and 3D model classification and evolutionary optimization.
Ling Guan received his Bachelor Degree in Electronic Engineering from Tianjin University, China in 1982, Master’s degree in Systems Design Engineering at University of Waterloo, Canada in 1985, and Ph.D. Degree in Electrical Engineering from University of British Columbia, Canada in 1989. From 1993 to 2000, he was on the Faculty of Engineering at the University of Sydney, Australia. Since May 2001, he has been a professor and director of Ryerson Multimedia Research Laboratory at Ryerson University, Toronto, Canada. In November 2001, he was appointed to the position of Canada Research Chair in Multimedia. Dr. Guan held visiting positions at British Telecom (1994), Tokyo Institute of Technology (1999), Princeton University (2000), Microsoft Research Asia (2002). Dr. Guan’s research interests include human-centered computing, multimedia indexing and retrieval, human-computer interface, transmission of multimedia data over P2P networks, machine learning, and adaptive image and signal processing. He has authored/co-authored more than 200 technical publications, including 50 refereed journal papers, two books and two patents. Dr. Guan is an associate editor/guest editor of numerous international journals, including Proceedings of the IEEE, and two IEEE Transactions. He also serves on the editorial board of CRC Press’ Book Series on Image Processing. He has involved in organizing many international conferences. He was the Founding General Chair of IEEE Pacific-Rim Conference on Multimedia, and currently serves as the General Chair of 2006 IEEE International Conference on Multimedia and Expo to be held in Toronto, Canada. Dr. Guan is a Senior Member of IEEE, and a Member of IAPR. Currently he is serving on IAPR Technical Committee on Structural and Syntactic Pattern Recognition and is on the Advisory Board of International Computational Intelligence Society. He was a member of IEEE SP Society Technical Committee on Multimedia Signal Processing (2000–2003) and a member of IEEE SP Society Technical Committee on Neural Networks in Signal Processing (1997–2000).
Rights and permissions
About this article
Cite this article
Wong, HS., Ip, H.H.S., Iu, L.P.L. et al. Transformation of Compressed Domain Features for Content-Based Image Indexing and Retrieval. Multimed Tools Appl 26, 5–26 (2005). https://doi.org/10.1007/s11042-005-6847-6
Issue Date:
DOI: https://doi.org/10.1007/s11042-005-6847-6