Abstract
In this paper we show some preliminary results of our research in the fieldwork of classification of imbalanced datasets with SVM and stratified sampling. Our main goal is to deal with the clinical problem of automatic intestinal contractions detection in endoscopic video images. The prevalence of contractions is very low, and this yields to highly skewed training sets. Stratified sampling together with SVM have been reported in the literature to behave well in this kind of problems. We applied both the SMOTE algorithm developed by Chawla et al. and under-sampling, in a cascade system implementation to deal with the skewed training sets in the final SVM classifier. We show comparative results for both sampling techniques using precision-recall curves, which appear to be useful tools for performance testing.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Hansen, M.B.: Small Intestinal Manometry. Physiological Research 51, 541–556 (2002)
Schulmann, S.K., Hollerbach, M.D., et al.: Feasibility and diagnostic of video capsule endoscopy for small bowel polyps. American Journal of Gastroenterollogy (2005)
Brodsky, L.M.: Wireless capsule endoscopy. Issues in Emerging Health Technologies. In: CCOHTA, vol. 53 (2003)
Eliakim, R.: Wireless capsule video endoscopy: Three years of experience. World journal of Gastroenterology 10, 1238–1239 (2004)
Vapnik, V.: The nature of Statistical Learning Theory. Springer, USA (1995)
Akbani, R., Kwek, S., Japkowicz, N.: Applying Support Vector Machines to Imbal-anced Datasets. In: European Conference on Machine Learning, pp. 39–50 (2004)
Brank, J., Grobelnik, M., et al.: Training text classifiers with SVM on very few positive examples, Technical Report MSR-TR-2003-34 (2003)
Chawla, N., Hall, L., Kegelmeyer, W.: SMOTE: Synthetic Minority Over-sampling Technique. Journal of Artificial Intelligence Research 16, 341–378 (2002)
Gonzalez, R.C., Woods, R.E.: Digital Image Processing, 2nd edn. Prentice Hall, Englewood Cliffs (2002)
Russ, J.C.: The Image Processing Handbook, 2nd edn. IEEE Press, Los Alamitos (1994)
Crone, S.F., Lessmann, S., Stahlbock, R.: Empirical Comparison and Evaluation of Classifier Performance for Data Mining in Customer Relationship Management. In: 3rd International Conference on KDDM (1997)
Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification. Willey InterScience, Willey & Sons, Inc. (2001)
Van Rijsbergen, C.: Information Retrieval. Dept. of Computer Science. University of Glasgow (1979)
Metz, C.: Basic Principles of ROC Analysis. Seminars on Nuclear Medicine 8, 283–298 (1978)
Swets, J., Pickett, R.: Evaluation of Diagnostic Systems: Methods for Signal Detection Theory. Academic Press, New York (1982)
Bradley, A.: The Use of the Area Under the ROC Curve in the Evaluation of Machine Learning Algorithms. Pattern Recognition 30, 1145–1159 (1997)
Duin, R.P.W., Juszczak, P., Paclik, P., Pekalska, E., Ridder, D.M.J.: Tax, PRTools4, A Matlab toolbox for pattern recognition. Delft University of Technology (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Vilariño, F., Spyridonos, P., Vitrià, J., Radeva, P. (2005). Experiments with SVM and Stratified Sampling with an Imbalanced Problem: Detection of Intestinal Contractions. In: Singh, S., Singh, M., Apte, C., Perner, P. (eds) Pattern Recognition and Image Analysis. ICAPR 2005. Lecture Notes in Computer Science, vol 3687. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11552499_86
Download citation
DOI: https://doi.org/10.1007/11552499_86
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28833-6
Online ISBN: 978-3-540-31999-3
eBook Packages: Computer ScienceComputer Science (R0)