Abstract
Many robotics and mechatronics systems rely on a fast analysis of visual landmarks. Recently, binary feature representations of the popular SIFT and SURF landmarks have been proposed that offer large speed improvements and low memory consumption at high accuracy. In this paper, we compare a binarisation based on median-centred hyperplanes to the dominating approach of random hyperplanes. We describe the algorithms in a joint taxonomy and show that the kernel for median-centred hyperplanes satiesfies Mercer’s condition. Speed and accuracy are benchmarked in a registration and classification task. Both methods achieve the same dramatic speedup in kernel evaluation. But we show that median-centred hyperplanes are faster in binarisation, find better matches and generalise better over pose and individual variation in the classification.
Similar content being viewed by others
References
André B, Vercauteren T, Buchner AM, Wallace MB, Ayache N (2011) Retrieval evaluation and distance learning from perceived similarity between endomicroscopy videos. In: Medical Image Computing and Computer Assisted Intervention (MICCAI), LNCS. Springer, Heidelberg
Anvaripour M, Ebrahimnezhad H (2013) Accurate object detection using local shape descriptors. Pattern Anal Appl
Bay H, Ess A, Tuytelaars T, van Gool L (2006) SURF: speeded up robust features. Computer Vision Image Underst (CVIU) 110(3):346–359
Chandrasekhar V, Takacs G, Chen DM, Tsai SS, Singh JP, Girod B (2009) Transform coding of image feature descriptors. In: visual Communication and Image Processing (VCIP)
Chen H, Sun D, Yang J (2009) Global localization of multirobot formations using ceiling vision SLAM strategy. Mechatronics 19(5):617–628
Choras M, Kozik R (2013) Contactless palmprint and knuckle biometrics for mobile devices. Pattern Anal Appl 15:73–85
Cortes C, Vapnik V (1995) Support-vector networks. Mach Learn 20(3):273–297
Dani A, Fischer N, Kan Z, Dixon W (2012) Globally exponentially stable observer for vision-based range estimation. Mechatronics 22(4):381–389
Diephuis M, Voloshynovskiy S, Koval O, Beekhof F (2011) Statistical Analysis of Binarized SIFT Descriptors. In: International Symposium on Image and Signal Processing and Analysis (ISPA)
Dong Y, Gao S, Tao K, Liu J, Wang H (2013) Performance evaluation of early and late fusion methods for generic semantics indexing. Pattern Anal Appl
Donoho DL, Tanner J (2010) Counting the faces of randomly-projected hypercubes and orthants, with applications. Discrete Comput Geom 43:522–541
Edelkamp S, Stommel M (2012) The bitvector machine: a fast and robust machine learning algorithm for non-linear problems. In: Flach PA, Bie TD, Cristianini N (eds) European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD), pp 175–190. Springer .
Gong Y, Lazebnik S (2011) Iterative quantization: a procrustean approach to learning binary codes. In: Computer Vision and Pattern Recognition (CVPR)
Hsu CW, Lin CJ (2002) A comparison of methods for multi-class support vector machines. IEEE Trans Neural Networks 13:415–425
Jain V, Learned-Miller E (2010) Fddb: A benchmark for face detection in unconstrained settings. Tech Rep UM-CS-2010-009, University of Massachusetts, Amherst
Jarrett K, Kavukcuoglu K, Ranzato M, LeCun Y (2009) What is the best multi-stage architecture for object recognition? In: International Conference on Computer Vision (ICCV)
Ke Y, Sukthankar R (2004) PCA-SIFT: A More Distinctive Representation for Local Image Descriptors. Computer Vision and Pattern Recognition (CVPR) 2:506–513
Kuo YH, Lin HT, Cheng WH, Yang YH, Hsu WH (2011) Unsupervised auxiliary visual words discovery for large-scale image object retrieval. In: IEEE Comp Vision Pattern Recognit (CVPR)
Lee IH, Chai TS (2013) Accurate registration using adaptive block processing for multi-spectral images. Circuits Syst Video Technol, IEEE Trans on PP(99), 1–1
Lowe DG (1999) Object recognition from local scale-invariant features. In: International Converence on Computer Vision (ICCV), pp 1150–1157
Lyu S (2005) Mercer kernels for object recognition with local features. Comp Vision Pattern Recognit (CVPR) 2:223–229
Makar M, Chang CL, Chen D, Tsai SS, Girod B (2009) Compression of image patches for local feature extraction. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) pp. 821–824. DOI http://doi.ieeecomputersociety.org/10.1109/ICASSP.2009.4959710
Mikolajczyk K, Leibe B, Schiele B (2005) Local features for object class recognition. In: International Conference on Computer Vision (ICCV’05)
Mikolajczyk K, Leibe B, Schiele B (2006) Multiple object class detection with a generative model. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR’06)
Mikolajczyk K, Schmid C (2005) A performance evaluation of local descriptors. IEEE Trans. Pattern Anal Mach Intell 27(10):1615–1630
Minh HQ, Niyogi P, Yao Y (2006) Mercer’s theorem, feature maps, and smoothing. In: The 19th Annual Conference on Learning Theory (COLT), pp 154–168
Mühling M, Ewerth R, Freisleben B (2011) On the spatial extents of SIFT descriptors for visual concept detection. In: International Conference on Computer Vision Systems (ICVS), pp 71–80. Springer, Berlin, Heidelberg
Opelt A, Fussenegger M, Pinz A, Auer P (2004) Weak hypotheses and boosting for generic object detection and recognition. In: European Conference on Computer Vision (ECCV), pp 71–84
Pavani SK, Delgado-Gomez D, Frangi AF (2012) Gaussian weak classifiers based on co-occurring haar-like features for face detection. Pattern Analy Appl
Phillips PJ, Rauss PJ, Der SZ (1996) FERET (Face Recognition Technology) recognition algorithm development and test results. Tech Rep 995, Army Research Lab
Rifkin R, Klautau A (2004) In defence of one-vs-all classification. J Mach Learn Res 5:101–141
Roy S, Sun Q (2007) Robust hash for detecting and localizing image tampering. In: Proc. IEEE Int. Conf. on Image Processing, pp 117–120. San Antonio, TX
Savicky P, Robnik-Sikonja M (2008) Learning random numbers: a MATLAB anomaly. Appl Artif Intell 22(3):254–265
Sleumer NH (2000) Hyperplane arrangements. construction, visualization and application. Ph.D. thesis, Technische Wissenschaften ETH Zürich, Nr. 13502
Stanley RP (2007) An introduction to hyperplane arrangements, IAS/Park City Math. Ser., vol. 13, pp 389–496. Amer Math Soc
Stommel M, Herzog O (2009) Binarising SIFT-descriptors to reduce the curse of dimensionality in histogram-based object recognition. In: Slezak D, Pal SK, Kang BH, Gu J, Kurada H, Kim TH (eds) Signal Processing, Image Processing and Pattern Recognition, pp 320–327. Springer
Stommel M, Langer M, Herzog O, Kuhnert KD (2011) A fast, robust and low bit-rate representation for SIFT and SURF features. In: Proc. IEEE International Symposium on Safety, Security, and Rescue Robotics, pp 278n++-283
Strecha C, Bronstein AM, Bronstein MM, Fua P (2010) LDAHash: Improved matching with smaller descriptors. In: EPFL-REPORT-152487
Su Y, Jurie F (2011) Visual word disambiguation by semantic contexts. In: International Conference on Computer Vision (ICCV)
Vapnik VN, Chervonenkis AY (1974) Theory of pattern recognition [in Russian]. Nauka, USSR
Viola P, Jones MJ (2004) Robust real-time face detection. Intern J Comp Vision 57(2):137–154
Wiedemeyer T, Stommel M, Herzog O (2011) Wide range face pose estimation by modelling the 3D arrangement of robustly detectable sub-parts. In: Intl. Conf. on Computer Analysis of Images and Patterns (CAIP), pp 237–244. Springer
Winder SAJ, Brown M (2007) Learning Local Image Descriptors. In: IEEE Computer Vision and Pattern Recognition (CVPR)
Yang L, Jin R, Sukthankar R, Jurie F (2008) Unifying discriminative visual codebook generation with classifier training for object category recognition. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Yeo C, Ahammad P, Ramchandran K (2008) Rate-efficient visual correspondences using random projections. In: IEEE International Conference on Image Processing, pp 217–220. San Diego, CA
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Stommel, M., Herzog, O. & Xu, W. Hyperplane arrangements for the fast matching and classification of visual landmarks. Pattern Anal Applic 19, 621–629 (2016). https://doi.org/10.1007/s10044-014-0417-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10044-014-0417-3