Choosing Multiple Parameters for Support Vector Machines

Chapelle, Olivier; Vapnik, Vladimir; Bousquet, Olivier; Mukherjee, Sayan

doi:10.1023/A:1012450327387

Choosing Multiple Parameters for Support Vector Machines

Published: January 2002

Volume 46, pages 131–159, (2002)
Cite this article

Download PDF

Machine Learning Aims and scope Submit manuscript

Choosing Multiple Parameters for Support Vector Machines

Download PDF

Olivier Chapelle¹,
Vladimir Vapnik²,
Olivier Bousquet³ &
…
Sayan Mukherjee⁴

13k Accesses
1558 Citations
10 Altmetric
Explore all metrics

Abstract

The problem of automatically tuning multiple parameters for pattern recognition Support Vector Machines (SVMs) is considered. This is done by minimizing some estimates of the generalization error of SVMs using a gradient descent algorithm over the set of parameters. Usual methods for choosing parameters, based on exhaustive search become intractable as soon as the number of parameters exceeds two. Some experimental results assess the feasibility of our approach for a large number of parameters (more than 100) and demonstrate an improvement of generalization performance.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

References

Bengio. Y. (2000). Gradient-based optimization of hyper-parameters. Neural Computation, 12:8.
Bonnans J. F. & Shapiro, A. (2000). Perturbation analysis of optimization problems. Berlin: Springer-Verlag.
Google Scholar
Chapelle, O. & Vapnik, V. (1999). Model selection for support vector machines. In Advances in neural information processing systems.
Cortes, C. & Vapnik, V. (1995). Support vector networks. Machine Learning, 20, 273-297.
Google Scholar
Cristianini, N., Campbell, C., & Shawe-Taylor, J. (1999). Dynamically adapting kernels in support vector machines. In Advances in neural information processing systems.
Cristianini, N. & Shawe-Taylor, J. (2000). An introduction to support vector machines. Cambridge, MA: Cambridge University Press.
Google Scholar
Golub, T., Slonim, D., Tamayo, P., Huard, C., Gaasenbeek, M., Mesirov, J. P., Coller, H., Loh, M. L., Downing, J. R., Caligiuri, M. A., Bloomfield, C. D., & Lander, E. S. (1999). Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring. Science, 286, 531-537.
Google Scholar
Heisele, B., Poggio, T., & Pontil, M. (2000). Face detection in still gray images. AI Memo 1687, Massachusetts Institute of Technology.
Jaakkola, T. S. & Haussler, D. (1999). Probabilistic kernel regression models. In Proceedings of the 1999 Conference on AI and Statistics.
Joachims, T. (2000). Estimating the generalization performance of a svm efficiently. In Proceedings of the International Conference on Machine Learning. San Mateo, CA: Morgan Kaufman.
Google Scholar
Larsen, J., Svarer, C., Andersen, L. N., & Hansen, L. K. (1998). Adaptive regularization in neural network modeling. In G. B. Orr & K. R. Müller (Eds.). Neural networks: Trick of the trade. Berlin: Springer.
Google Scholar
Luntz, A. & Brailovsky, V. (1969). On estimation of characters obtained in statistical procedure of recognition. Technicheskaya Kibernetica, 3, (in Russian).
Lütkepohl, H. (1996). Handbook of matrices. New York: Wiley & Sons.
Google Scholar
Opper, M. & Winther, O. (2000). Gaussian processes and svm: Mean field and leave-one-out. In A. J. Smola, P. L. Bartlett, B. Schölkopf, & D. Schuurmans (Eds.). Advances in large margin classifiers (pp. 311-326). Cambridge, MA: MIT Press.
Google Scholar
Platt, J. (2000). Probabilities for support vector machines. In A. Smola, P. Bartlett, B. Schölkopf, & D. Schuurmans (Eds.). Advances in large margin classifiers. Cambridge, MA: MIT Press.
Google Scholar
Rätsch, G., Onoda, T., & Müller, K.-R. (2001). Soft margins for AdaBoost. Machine Learning, 42:3, 287-320.
Google Scholar
Serre, T., Heisele, B., Mukherjee, S., & Poggio, T. (2000). Feature selection for face detection. AI Memo 1697, Massachusetts Institute of Technology.
Vapnik, V. (1995). The nature of statistical learning theory. Berlin: Springer.
Google Scholar
Vapnik, V. (1998). Statistical learning theory. New York: John Wiley & Sons.
Google Scholar
Vapnik, V. & Chapelle, O. (2000). Bounds on error expectation for support vector machines: Neural Computation, 12:9.
Wahba, G., Lin, Y., & Zhang, H. (2000). Generalized approximate crossvalidation for support vector machines: Another way to look at marginlike quantities. In A. Smola, P. Bartlett, B. Schölkopf, & D. Schuurmans (Eds.). Advances in large margin classifiers (pp. 297-309). Cambridge, MA: MIT Press.
Google Scholar
Weston, J., Mukherjee, S., Chapelle, O., Pontil, M., Poggio, T., & Vapnik, V. (2000). Feature selection for support vector machines. In Advances in neural information processing systems.

Download references

Author information

Authors and Affiliations

LIP6, Paris, France
Olivier Chapelle
AT&T Research Labs, 200 Laurel Ave, Middletown, NJ, 07748, USA
Vladimir Vapnik
École Polytechnique, France
Olivier Bousquet
MIT, Cambridge, MA, 02139, USA
Sayan Mukherjee

Authors

Olivier Chapelle
View author publications
You can also search for this author in PubMed Google Scholar
Vladimir Vapnik
View author publications
You can also search for this author in PubMed Google Scholar
Olivier Bousquet
View author publications
You can also search for this author in PubMed Google Scholar
Sayan Mukherjee
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chapelle, O., Vapnik, V., Bousquet, O. et al. Choosing Multiple Parameters for Support Vector Machines. Machine Learning 46, 131–159 (2002). https://doi.org/10.1023/A:1012450327387

Download citation

Issue Date: January 2002
DOI: https://doi.org/10.1023/A:1012450327387

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Choosing Multiple Parameters for Support Vector Machines

Abstract

Article PDF

Similar content being viewed by others

Nonlinear optimization and support vector machines

Nonlinear optimization and support vector machines

Support Vector Machines

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Navigation

Choosing Multiple Parameters for Support Vector Machines

Abstract

Article PDF

Similar content being viewed by others

Nonlinear optimization and support vector machines

Nonlinear optimization and support vector machines

Support Vector Machines

Explore related subjects

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation