Abstract
Video captions can be used to index large video archives in digital libraries. In this paper, an algorithm for detecting captions in video frames using support vector machine (SVM) is proposed. First, the input video frame is divided into square sub-blocks and a trained SVM is used to identify whether each sub-block is a caption block or not. Second, horizontal projection and vertical projection are performed to locate the candidate caption regions. Finally, false alarms are reduced by caption region verification. Experimental results show that the algorithm has a low missed rate and false alarm rate.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Jain, A.K., Yu, B.: Automatic Text Location in Images and Video Frames. Pattern Recognition 31, 2055–2076 (1998)
Lienhart, R.: Automatic Text Recognition for Video Indexing. In: Proc. ACM Multimedia 1996, Boston MA, pp. 11–20 (1996)
Smith, M.A., Kanade, T.: Video Skimming and Characterization through the Combination of Image and Language Understanding. In: IEEE International Workshop on Content-Based Access of Image and Video Database, pp. 61–70 (1998)
Wu, V., Manmatha, R., Riseman, E.M.: Finding Text in Images. In: Proc. 2nd ACM International Conference on Digital Libraries, Philadelphia, pp. 23–26 (1997)
Vapnik, V.: The Nature of Statistical Learning Theory. Springer, New York (1997)
Jeong, K.Y., Jung, K., Kim, E.Y., Kim, H.J.: Neural Network-based Text Location for News Video Indexing. In: Proc. IEEE International Conference on Image Processing, Japan, pp. 319–323 (1999)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Xu, J., Li, S. (2005). Automatic Caption Detection in Video Frames Based on Support Vector Machine. In: Wang, J., Liao, XF., Yi, Z. (eds) Advances in Neural Networks – ISNN 2005. ISNN 2005. Lecture Notes in Computer Science, vol 3497. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11427445_41
Download citation
DOI: https://doi.org/10.1007/11427445_41
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-25913-8
Online ISBN: 978-3-540-32067-8
eBook Packages: Computer ScienceComputer Science (R0)