Abstract
In multimedia information processing, while the previous focus was on image/video retrieval, content-based categorization and retrieval of 3D computer graphics model is becoming increasingly important. This is due to the increased adoption of 3D graphics representations in multimedia applications and the resulting need for rapid virtual scene assembly from a repository of 3D models. Motivated by these requirements, the main focus of this paper is on the content-based classification and retrieval of 3D computer graphics models based on a histogram feature representation, and the search for an adaptive transformation of this representation such that the resulting classification and retrieval accuracies are optimized. Observing that a histogram is basically an approximation of the probability density function of an underlying random variable, and that a suitable transformation, when applied to the random variable, will allow the classifier to attain better accuracy based on this new representation, we propose an evolutionary optimization approach to search for this set of optimal transformations due to the large size of the search space. In particular, we consider the special class of transformations that take the form of a piecewise continuous mapping. In this case, the transformed variable is a mixed random variable, with both discrete and continuous components, which provides added flexibility for modeling a number of more diverse random variable types. With a suitably defined fitness function for evolutionary strategies (ES) that measures the capability of the transformed histogram representation to induce the correct class structure, our proposed approach is capable of improving the head model classification performance, which in turn allows, in the case of content-based retrieval, the correct preassignment of a query object to its correct class for more efficient search, even in those cases where the query is ambiguous and difficult to characterize.
Similar content being viewed by others
References
Ankerst, M., Kastenmuller, G., Kriegel, H.P., Seidl, T.: Nearest neighbor classification in 3D protein databases. In: Proceedings of the 7th International Conference on Intelligent Systems for Molecular Biology, pp. 34–43. Heidelberg, Germany (1999)
Ankerst, M., Kastenmuller, G., Kriegel, H.P., Seidl, T.: 3D shape histograms for similarity search and classification in spatial databases. In: Proceedings of the 6th International Symposium on Large Spatial Databases (SSD'99) pp. 207–226. Hong Kong, China (1999)
Antoine, J.P., Demanet, L., Jacques, L., Vandergheynst, P.: Wavelets on the sphere: implementation and approximation. Appl. Comput. Harmon. Anal. 13, 177–200 (2002)
Back, T.: Evolutionary Algorithms in Theory and Practice. Oxford: Oxford University Press (1996)
Basri, R., Weinshall, D.: Distance metric between 3D models and 2D images for recognition and classification. IEEE Trans. Pattern Anal. Mach. Intell. 18(4), 465–470 (1996)
Beyer, H.G.: The Theory of Evolution Strategies. Berlin Heidelberg New York: Springer (2001)
Duin, R.P.W.: The combining classifier: to train or not to train? In: Proceedings of the 16th International Conference on Pattern Recognition, vol. 2, pp. 765–770. Quebec City, Canada (2002)
Fogel, D.B.: Evolutionary Computation: Toward a New Philosophy of Machine Intelligence, 2nd edn. Piscataway, NJ: IEEE Press (1998)
Funkhouser, T., Min, P., Kazhdan, M., Chen, J., Halderman, A., Dobkin, D.: A search engine for 3D models. ACM Trans. Graph. 21(3), 83–105 (2003)
Goldberg, D.E.: Genetic Algorithms in Search, Optimization, and Machine Learning. Reading, MA: Addison-Wesley (1989)
Groemer, H.: Geometric Applications of Fourier Series and Spherical Harmonics. New York: Cambridge University Press (1996)
Hilaga, M., Shinagawa, Y., Kohmura, T., Kunii, T.: Topology matching for fully automatic similarity estimation of 3D shapes. In: Proceedings of ACM SIGGRAPH '01, pp. 203–212. Los Angeles (2001)
Ho, T.K.: Decision combination in multiple classifier systems. IEEE Trans. Pattern Anal. Mach. Intell. 16(1), 66–75 (1994)
Horn, B.K.P.: Extended gaussian images. Proc. IEEE 72(12), 1671–1686 (1984)
Ip, H.H.S., Wong, W.Y.F.: 3D head models retrieval based on hierarchical facial region similarity. In: 15th Vision Interface pp. 314–319. Calgary, Canada (2002)
Keim, D.: Efficient geometry-based similarity search for 3D spatial databases. In: Proceedings of the 1999 ACM SIGMOD International Conference on Management of Data and Symposium on Principles of Database Systems, pp. 419–430. Philadelphia (1999)
Kittler, J., Alkoot, F.M.: Sum versus vote fusion in multiple classifier systems. IEEE Trans. Pattern Anal. Mach. Intell. 25(1), 110–115 (2003)
Kitter, J., Hatef, M., Duin, R.P.W., Matax, J.: On combining classifiers. IEEE Trans. Pattern Anal. Mach. Intell. 20(3), 226–239 (1998)
Kuncheva, L.I.: A theoretical study on six classifier fusion strategies. IEEE Trans. Pattern Anal. Mach. Intell. 24(2), 281–286 (2002)
Kuncheva, L.I., Jain, L.C.: Designing classifier fusion systems by genetic algorithms. IEEE Trans. Evol. Comput. 4(4), 327–336 (2000)
Lau, R.W.H., Wong, B.: Web-based 3D geometry model retrieval. World Wide Web: Internet and Web Information Systems, vol. 5, pp. 193–206 (2002)
Newall, M., Meny, D., Hoon, S.: 3D performance matching for terminator 3. In: Sketches and Applications, SIGGRAPH 2003, pp. 27–31. San Diego (2003)
Novotni, M., Klein, R.: A geometric approach to 3D object comparison. In: Proceedings of the International Conference on Shape Modelling and Applications, pp. 167–175. Genova, Italy (2001)
Osada, R., Funkhouser, T., Chazelle, B., Dobkin, D.: Matching 3D models with shape distributions. In: Proceedings of the International Conference on Shape Modelling and Applications, pp. 154–166. Genova, Italy (2001)
Osada, R., Funkhouser, T., Chazelle, B., Dobkin, D.: Shape distributions. ACM Trans. Graph. 21(4), 807–832 (2002)
Paquet, E., Rioux, M.: Content-based access of VRML libraries. In: Proceedings of the IAPR International Workshop on Multimedia Information Analysis and Retrieval, pp. 20–32. Hong Kong, China (1998)
Paquet, E., Rioux, M., Murching, A., Naveen, T., Tabatabai, A.: Description of shape information for 2-D and 3-D objects. Signal Process. Image Commun. 16, 103–122 (2000)
Saupe, D., Vranic, D.V.: 3D model retrieval with spherical harmonics and moments. In: Proceedings of the DAGM 2001, pp. 392–397. Munich (2001)
Smeulders, A.W.M., Worring, M., Santini, S., Gupta, A., Jain, R.: Content-based image retrieval at the end of the early years. IEEE Trans. Pattern Anal. Mach. Intell. 22(11), 1349–1380 (2000)
Swain, M.J., Ballard, D.H.: Color indexing. Int. J. Comput. Vis. 7(1), 11–32 (1991)
Villalobos, L., Merat, F.L.: 3D modeling and indexing for CAD-based object recognition. In: IEEE International Conference on Robotics and Automation, vol. 3, pp. 1965–1972 (1994)
Vranic, D.V., Saupe, D., Richter, J.: Tools for 3D-object retrieval: Karhunen–Loeve transform and spherical harmonics. In: Proceedings of the IEEE 2001 Workshop Multimedia Signal Processing pp. 293–298. Cannes, France (2001)
Author information
Authors and Affiliations
Corresponding author
Additional information
Hau San Wong is currently an assistant professor in the Department of Computer Science, City University of Hong Kong. He received the B.Sc. and M.Phil. degrees in electronic engineering from the Chinese University of Hong Kong and the Ph.D. degree in electrical and information engineering from the University of Sydney. He has also held research positions at the University of Sydney and Hong Kong Baptist University. His research interests include multimedia signal processing, neural networks, and evolutionary computation. He is the coauthor of the book Adaptive Image Processing: A Computational Intelligence Perspective, which is a joint publication of CRC Press and SPIE Press, and was an organizing committee member of the 2000 IEEE Pacific Rim Conference on Multimedia and 2000 IEEE Workshop on Neural Networks for Signal Processing, both of which were held in Sydney, Australia. He has also co-organized a number of conference special sessions, including the special session on “Image Content Extraction and Description for Multimedia” at the 2000 IEEE International Conference on Image Processing, Vancouver, Canada, and “Machine Learning Techniques for Visual Information Retrieval” at the 2003 International Conference on Visual Information Retrieval, Miami, FL.
K.T. Cheung received his B.Sc. (first class honours) and Ph.D. in the Department of Computer Science, City University of Hong Kong in 1996 and 2002, respectively. He worked as a research staff and a part-time lecturer in the same department until 2004. During his years at City University of Hong Kong, he was involved in a wide range of projects, such as content-based retrieval of color logos, intelligent retrieval of histological images, 3D head model classification and retrieval, and an object-oriented framework for image representation and retrieval. In 2004 he joined the Department of Computing at the Hong Kong Polytechnic University as a visiting assistant professor. His research interests include content-based retrieval of images and 3D models, image and 3D model classification, and evolutionary optimization.
Chun Ip Chiu received his B.S. with first class honors in 2002 and his M.Phil. in computer science in 2004, both from the City University of Hong Kong. His research interests include image processing, evolutionary computation, and content-based image retrieval and classification.
Horace H. S. Ip received his B.Sc. (first class honors) in applied physics and Ph.D. in image processing from University College London, UK in 1980 and 1983, respectively. Presently, he is the chair professor of the computer science department and the founding director of the AIMtech Centre (Centre for Innovative Applications of Internet and Multimedia Technologies) at the City University of Hong Kong. His research interests include image processing and analysis, pattern recognition, hypermedia systems in education, and computer graphics. Prof. Ip is the chairman of the IEEE (Hong Kong section) Computer Chapter and the founding president of the Hong Kong Society for Multimedia and Image Computing. He has published over 160 papers in international journals and conference proceedings. Prof. Ip is a member of the IEEE, a fellow of the Hong Kong Institution of Engineers (HKIE), fellow of the Institution of Engineers (IEE), UK, and fellow of the International Association for Pattern Recognition (IAPR).
Rights and permissions
About this article
Cite this article
Wong, HS., Cheung, K.K.T., Chiu, CI. et al. Application of evolutionary strategies for 3D graphical model categorization and retrieval. Multimedia Systems 10, 422–431 (2005). https://doi.org/10.1007/s00530-005-0171-x
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00530-005-0171-x