Abstract
Developing a question answering (Q/A) system involves in integrating abundant linguistic resources such as syntactic parsers, named entity recognizers which are not only impose time cost but also unavailable in other languages. Ranking-based approaches take the advantage of both efficiency and multilingual portability but most of them bias to high frequent words. In this paper, we propose a new passage ranking algorithm for extending textQ/A toward videoQ/A based on searching lexical information in videos. This method takes both N-gram match and word density into account and finds the optimal match sequence using dynamic programming techniques. Besides, it is very efficient to handle real time tasks for online video question answering. We evaluated our method with 150 actual user’s questions on the 45GB video collections. Nevertheless, four well-known but multilingual portable ranking approaches were adopted to compare. Experimental results show that our method outperforms the second best approach with relatively 25.64% MRR score.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Cai, M., Song, J., Lyu, M.R.: A new approach for video text detection. In: Proceedings of International Conference on Image Processing, pp. 117–120 (2002)
Cao, J., Nunamaker, J.F.: Question answering on lecture videos: a multifaceted approach. In: International Conference on Digital Libraries, pp. 214–215 (2004)
Chang, F., Chen, G.C., Lin, C.C., Lin, W.H.: Caption analysis and recognition for building video indexing systems. ACM Multimedia systems 10(4), 344–355 (2005)
Cui, H., Sun, R., Li, K., Kan, M., Chua, T.: Question answering passage retrieval using dependency relations. In: Proceedings of the 28th ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 400–407 (2005)
Fan, J., Yau, D.K.Y., Elmagarmid, A.K., Aref, W.G.: Automatic image segmentation by integrating color-edge extraction and seeded region growing. IEEE Trans. On Image Processing 10(10), 1454–1464 (2001)
Hong, T., Lam, S.W., Hull, J.J., Srihari, S.N.: The design of a nearest-neighbor classifier and its use for japanese character recognition. In: Proceedings of Third International Conference on Document Analysis and Recognition, pp. 270–291 (1995)
Lee, G.G., Seo, J.Y., Lee, S.W., Jung, H.M., Cho, B.H., Lee, C.K., Kwak, B.K., Cha, J.W., Kim, D.S., An, J.H., Kim, H.S.: SiteQ: Engineering high performance QA system using lexico-semantic pattern matching and shallow NLP. In: Proceedings of the 10th Text Retrieval Conference, pp. 437–446 (2001)
Lienhart, R., Wernicke, A.: Localizing and segmenting text in images and videos. IEEE Trans. Circuits and Systems for Video Technology 12(4), 243–255 (2002)
Lin, C.J., Liu, C.C., Chen, H.H.: A simple method for Chinese video OCR and its application to question answering. Computational linguistics and Chinese language processing 6(2), 11–30 (2001)
Lin, J., Quan, D., Sinha, V., Bakshi, K., Huynh, D., Katz, B., Karger, D.R.: What makes a good answer? the role of context in question answering. In: Proceedings of the 9th international conference on human-computer interaction (INTERACT), pp. 25–32 (2003)
Lyu, M.R., Song, J., Cai, M.: A comprehensive method for multilingual video text detection, localization, and extraction. IEEE Trans. Circuits and Systems for Video Technology 15(2), 243–255 (2005)
Pasca, M., Harabagiu, S.: High-performance question answering. In: Proceedings of the 24th ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 366–374 (2001)
Robertson, E., Walker, S., Beaulieu, M.: Okapi at TREC-7: automatic ad hoc, filter-ing, VLC and interactive track. In: Proceedings of the 7th Text Retrieval Conference (1998)
Rus, V., Moldovan, D.: High precision logic form transformation. International Journal on Artificial Intelligence Tools 11(3), 437–454 (2002)
Savoy, J.: Comparative study on monolingual and multilingual search models for use with Asian languages. ACM transactions on Asian language information processing (TALIP) 4(2), 163–189 (2005)
Tellex, S., Katz, B., Lin, J.J., Fernandes, A., Marton, G.: Quantitative evaluation of passage retrieval algorithms for question answering. In: Proceedings of the 26th ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 41–47 (2003)
Voorhees, E.M.: Overview of the TREC 2001 question answering track. In: Proceedings of the 10th Text Retrieval Conference, pp. 42–52 (2001)
Wu, Y.C., Lee, Y.S., Chang, C.H.: CLVQ: Cross-language video question/answering system. In: Proceedings of 6th IEEE International Symposium on Multimedia Software Engineering, pp. 294–301 (2004)
Yang, H., Chaison, L., Zhao, Y., Neo, S.Y., Chua, T.S.: VideoQA: Question answering on news video. In: Proceedings of the 11th ACM International Conference on Multimedia, pp. 632–641 (2003a)
Yang, H., Chua, T.S., Wang, S.G., Koh, C.K.: Structural use of external knowledge for event-based open domain question answering. In: Proceedings of the 26th ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 33–40 (2003b)
Zhang, D., Nunamaker, J.: A natural language approach to content-based video indexing and retrieval for interactive E-learning. IEEE Transactions on Multimedia 6(3), 450–458 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wu, YC., Lee, YS., Yang, JC., Yen, SJ. (2006). A New Passage Ranking Algorithm for Video Question Answering. In: Chang, LW., Lie, WN. (eds) Advances in Image and Video Technology. PSIVT 2006. Lecture Notes in Computer Science, vol 4319. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11949534_56
Download citation
DOI: https://doi.org/10.1007/11949534_56
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-68297-4
Online ISBN: 978-3-540-68298-1
eBook Packages: Computer ScienceComputer Science (R0)