Abstract
In this paper we present a system for movie segmentation based on the automatic detection of dialogue scenes.
The proposed system processes the video stream directly in the MPEG domain: it starts with the segmentation of the video footage in shots. Then, a characterization of each shot between dialogue and not-dialogue according to a Multi-Expert System (MES) is performed. Finally, the individuated sequences of shots are aggregated in dialogue scenes by means of a suitable algorithm. The MES integrates three experts, which classifies a given shot on the basis of very complementary descriptions; in particular an audio classifier, a face detector and a camera motion estimator have been built up and employed.
The performance of the system have been tested on a huge MPEG movie database made up of more than 15000 shots and 200 scenes, giving rise to encouraging results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
M. M. Yeung, B. Liu, “Efficient matching and clustering of video shots”, in Proc IEEE ICIP’95, vol II, pp. 260–263.
A. Hanjalic, R. Lagendijk, J. Biemond, “Automated high-level movie segmentation for advanced video-retrieval systems”, in IEEE Trans. on Circuits and Systems for Video Technology, vol. 9, No. 4, June 1999, pp. 580–588.
S. Boykin, A. Merlino, “Machine learning of event segmentation for news on demand”, in Communications of the ACM, Feb. 2000, vol. 43, No. 2, pp. 35–41.
M. Bertini, A. Del Bimbo, P. Pala, “Content-based Indexing and Retrieval of TV-news”, in Pattern Recognition Letters, 22, (2001), 503–516.
C. Saraceno, R. Leopardi, “Identification of Story Units in Audio-Visual Sequences by Joint Audio and Video Processing”, in Proc. ICIP’98, pp. 363–367, 1998.
L. P. Cordella, P. Foggia, C. Sansone, F. Tortorella and M. Vento, Reliability Parameters to Improve Combination Strategies in Multi-Expert Systems, Pattern Analysis & Applications, Springer-Verlag, vol. 2, pp. 205–214, 1999.
T.K. Ho, J.J. Hull, S.N. Srihari, “Decision Combination in Multiple Classifier Systems”, IEEE Transactions on Pattern Analysis and Machine Intelligence 1994; 16(1): 66–75.
J. Kittler, J. Hatef, R.P.W. Duin, J. Matas, “On Combining Classifiers”, IEEE Trans. on PAMI, vol 20 n. 3 March 1998.
S.C. Pei, Y.Z. Chou, “Efficient MPEG compressed video analysis using macroblock type information”, in IEEE Trans. on Multimedia, pp. 321–333, Dec. 1999, Vol. 1, Issue: 4.
H. Wang, S.F. Chang, “A Highly Efficient System for Automatic Face Region Detection in MPEG Video”, IEEE Trans. on Circuits and Systems for Video Technology, vol. 7, no. 4, August 1997, pp. 615–628.
Y.P. Tan, D.D. Saur, S.R. Kulkarni, P.J. Ramadge, “Rapid Estimation of Camera Motion from Compressed Video with Application to Video Annotation”, IEEE Trans. on Circuits and Systems for Video Technology, vol. 10, no. 1, February 2000, pp. 133–146.
M. De Santo, G. Percannella, C. Sansone, M. Vento, “Classifying Audio of Movies by a Multi-Expert System”, Proc. of the 11th ICIAP, pp. 386–391, 2001.
M. De Santo, G. Percannella, C. Sansone, M. Vento, “Dialogue Scenes Detection in Mpeg Movies: a Multi-Expert Approach”, LNCS, vol. 2184, pp. 192–201, Sept. 2001.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cordelia, L.P., De Santo, M., Percannella, G., Sansone, C., Vento, M. (2002). A Multi-expert System for Movie Segmentation. In: Roli, F., Kittler, J. (eds) Multiple Classifier Systems. MCS 2002. Lecture Notes in Computer Science, vol 2364. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45428-4_30
Download citation
DOI: https://doi.org/10.1007/3-540-45428-4_30
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43818-2
Online ISBN: 978-3-540-45428-1
eBook Packages: Springer Book Archive