Learning to Efficiently Detect Repeatable Interest Points in Depth Data

Holzer, Stefan; Shotton, Jamie; Kohli, Pushmeet

doi:10.1007/978-3-642-33718-5_15

Stefan Holzer^21,22,
Jamie Shotton²² &
Pushmeet Kohli²²

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7572))

Included in the following conference series:

European Conference on Computer Vision

Abstract

Interest point (IP) detection is an important component of many computer vision methods. While there are a number of methods for detecting IPs in RGB images, modalities such as depth images and range scans have seen relatively little work. In this paper, we approach the IP detection problem from a machine learning viewpoint and formulate it as a regression problem. We learn a regression forest (RF) model that, given an image patch, tells us if there is an IP in the center of the patch. Our RF based method for IP detection allows an easy trade-off between speed and repeatability by adapting the depth and number of trees used for approximating the interest point response maps. The data used for training the RF model is obtained by running state-of-the-art IP detection methods on the depth images. We show further how the IP response map used for training the RF can be specifically designed to increase repeatability by employing 3D models of scenes generated by reconstruction systems such as KinectFusion [1]. Our experiments demonstrate that the use of such data leads to considerably improved IP detection.

Download to read the full chapter text

Chapter PDF

CURFIL: A GPU Library for Image Labeling with Random Forests

6-DOF Model Based Tracking via Object Coordinate Regression

Metric Regression Forests for Correspondence Estimation

Article 11 April 2015

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Newcombe, R.A., Izadi, S., Hilliges, O., Molyneaux, D., Kim, D., Davison, A.J., Kohli, P., Shotton, J., Hodges, S., Fitzgibbon, A.: Kinectfusion: Real-time dense surface mapping and tracking. In: ISMAR (2011)
Google Scholar
Steder, B., Grisetti, G., Burgard, W.: Robust place recognition for 3D range data based on point features. In: ICRA (2010)
Google Scholar
Steder, B., Rusu, R.B., Konolige, K., Burgard, W.: Point feature extraction on 3D range scans taking into account object boundaries. In: ICRA (2011)
Google Scholar
Unnikrishnan, R.: Statistical approaches to multi-scale point cloud processing (2008)
Google Scholar
Rosten, E., Drummond, T.: Fusing points and lines for high performance tracking. In: ICCV (2005)
Google Scholar
Rosten, E., Drummond, T.W.: Machine Learning for High-Speed Corner Detection. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006, Part I. LNCS, vol. 3951, pp. 430–443. Springer, Heidelberg (2006)
Chapter Google Scholar
Rosten, E., Porter, R., Drummond, T.: Faster and better: A machine learning approach to corner detection. PAMI (2010)
Google Scholar
Šochman, J., Matas, J.: Learning a Fast Emulator of a Binary Decision Process. In: Yagi, Y., Kang, S.B., Kweon, I.S., Zha, H. (eds.) ACCV 2007, Part II. LNCS, vol. 4844, pp. 236–245. Springer, Heidelberg (2007)
Chapter Google Scholar
Sochman, J., Matas, J.: Waldboost - learning for time constrained sequential detection. In: CVPR (2005)
Google Scholar
Schapire, R.E., Singer, Y.: Improved boosting algorithms using confidence-rated predictions. In: Machine Learning, pp. 80–91 (1999)
Google Scholar
Viola, P., Jones, M.: Fast and robust classification using asymmetric adaboost and a detector cascade. In: Advances in Neural Information Processing System 14, pp. 1311–1318. MIT Press (2001)
Google Scholar
Foresti, G.: Invariant feature extraction and neural trees for range surface classification. IEEE Transactions on Systems, Man, and Cybernetics (2002)
Google Scholar
Lepetit, V., Fua, P.: Keypoint recognition using randomized trees. PAMI (2006)
Google Scholar
Özuysal, M., Calonder, M., Lepetit, V., Fua, P.: Fast keypoint recognition using random ferns. PAMI (2010)
Google Scholar
Shotton, J., Fitzgibbon, A., Cook, M., Sharp, T., Finocchio, M., Moore, R., Kipman, A., Blake, A.: Real-time human pose recognition in parts from a single depth image. In: CVPR (2011)
Google Scholar
Stückler, J., Behnke, S.: Interest point detection in depth images through scale-space surface analysis. In: ICRA (2011)
Google Scholar
Gelfand, N., Mitra, N.J., Guibas, L.J., Pottmann, H.: Robust global registration. In: Eurographics Symposium on Geometry Processing (2005)
Google Scholar
Hinterstoisser, S., Holzer, S., Cagniart, C., Ilic, S., Konolige, K., Navab, N., Lepetit, V.: Multimodal templates for real-time detection of texture-less objects in heavily cluttered scenes. In: ICCV (2011)
Google Scholar
Criminisi, A., Shotton, J., Konukoglu, E.: Decision forests: A unified framework for classification, regression, density estimation, manifold learning and semi-supervised learning. In: Foundations and Trends in Computer Graphics and Vision (2012)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, CAMP, Technische Universität München (TUM), Germany
Stefan Holzer
Microsoft Research Cambridge, UK
Stefan Holzer, Jamie Shotton & Pushmeet Kohli

Authors

Stefan Holzer
View author publications
You can also search for this author in PubMed Google Scholar
Jamie Shotton
View author publications
You can also search for this author in PubMed Google Scholar
Pushmeet Kohli
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Microsoft Research Ltd., CB3 0FB, Cambridge, UK
Andrew Fitzgibbon
Dept. of Computer Science, University of North Carolina, 27599, Chapel Hill, NC, USA
Svetlana Lazebnik
California Institute of Technology, 91125, Pasadena, CA, USA
Pietro Perona
Institute of Industrial Science, The University of Tokyo, 153-8505, Tokyo, Japan
Yoichi Sato
INRIA, 38330, Montbonnot, France
Cordelia Schmid

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Holzer, S., Shotton, J., Kohli, P. (2012). Learning to Efficiently Detect Repeatable Interest Points in Depth Data. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds) Computer Vision – ECCV 2012. ECCV 2012. Lecture Notes in Computer Science, vol 7572. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33718-5_15

Download citation

DOI: https://doi.org/10.1007/978-3-642-33718-5_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33717-8
Online ISBN: 978-3-642-33718-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Learning to Efficiently Detect Repeatable Interest Points in Depth Data

Abstract

Chapter PDF

Similar content being viewed by others

CURFIL: A GPU Library for Image Labeling with Random Forests

6-DOF Model Based Tracking via Object Coordinate Regression

Metric Regression Forests for Correspondence Estimation

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Learning to Efficiently Detect Repeatable Interest Points in Depth Data

Abstract

Chapter PDF

Similar content being viewed by others

CURFIL: A GPU Library for Image Labeling with Random Forests

6-DOF Model Based Tracking via Object Coordinate Regression

Metric Regression Forests for Correspondence Estimation

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation