Abstract
MPEG-4 object oriented video codec implementations are rapidly emerging as a solution to compress audio-video information in an efficient way, suitable for narrowband applications.
A different view is proposed in this paper: several images in a video sequence result very close to each other. Each image of the sequence can be seen as a vector in a hyperspace and the whole video can be considered as a curve described by the image-vector at a given time instant.
The curve can be sampled to represent the whole video, and its evolution along the video space can be reconstructed from its video-samples. Any image in the hyperspace can be obtained by means of a reconstruction algorithm, in analogy with the reconstruction of an analog signal from its samples; anyway, here the multi-dimensional nature of the problem asks for the knowledge of the position in the space and a suitable interpolating kernel function.
The definition of an appropriate Video Key-frames Codebook is introduced to simplify video reproduction; a good quality of the predicted image of the sequence might be obtained with a few information parameters. Once created and stored the VKC, the generic image in the video sequence can be referred to the selected key-frames in the codebook and reconstructed in the hyperspace from its samples.
Focus of this paper is on the analysis phase of a give video sequence. Preliminary results seem promising.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
P Salembier, F. Marques, Region based representation of image and video: Segmentation tool for multimedia services, IEEE Trans. Circuits and systems for Video Technology, invited paper, vol. 9 no. 8, dec. 1999
ISO/IEC DIS 13818-2, Information Technology-Generic Coding of Moving Pictures and Associated Audio Information-Part 2: Video, ISO, 1994.
MPEG Video Group, MPEG-4 Video Verification Model Version 4.0, ISO/IEC JTC1/SC29/WG11/M1380, Proceedings of Chicago meeting, October 1996
CCITT SG XV, Recommendation H.261-Video Codec for Audiovisual Services at px64 kbit/s, COM XV-R37-E, Int. Telecommunication Union, August 1990.
ITU-T Draft, Recommendation H.263-Video Coding for low bit rate communication, Int. Telecommunication Union, November 1995.
G. Acciani, E. Chiarantoni, and M. Minenna “A new non Competitive Unsupervised Neural Network for Clustering” Proc. of Intern. Symp. On Circuits and Systems, Vol. 6, pp. 273–276, London May 1994.
C. Guaragnella, E. Di Sciascio, Object Oriented Motion Estimation by Sliced-Block Matching Algorithm, Proc. IEEE 15th Intl. Conf. On Pattern Recognition, Vol. 3, Image, speech and signal processing, pp. 865–869, Barcelona, Sept. 3-7, 2000.
C. Cafforio, E. Di Sciascio, C. Guaragnella, Motion estimation and Modeling for Video Sequences, Proc. of EUSIPCO 98, 8–11, Rhodes, GR.
A. Guerriero and V. Di Lecce, An Evaluation of the Effectiveness of image Features for Image Retrieval, J. Visual Communication and Image Representation 10, 351–362 (1999).
A. Del Bimbo and P. Pala, Visual image retrieval by elastic matching of user sketches, IEEE Trans. Pattern Anal. Mach. Intell. 19(2), 1997.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Acciani, G., Chiarantoni, E., Girimonte, D., Guaragnella, C. (2002). Unsupervised - Neural Network Approach for Efficient Video Description. In: Dorronsoro, J.R. (eds) Artificial Neural Networks — ICANN 2002. ICANN 2002. Lecture Notes in Computer Science, vol 2415. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46084-5_211
Download citation
DOI: https://doi.org/10.1007/3-540-46084-5_211
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44074-1
Online ISBN: 978-3-540-46084-8
eBook Packages: Springer Book Archive