Abstract
This paper proposes a new approach to automate and manage the testing process for developing document analysis and understanding algorithms. A distributed test environment is proposed to assure visibility, repeatability, scalability and consistency during and between testing sessions. A variety of views are used to deal with multi-level operations in algorithm development. Tests are realized with dedicated test scenarios and events at different stages of the development cycle. Our main objective is to provide collaborating researchers with a flexible means of generating consistent ways to validate algorithm behaviour in a target environment. This is accomplished with a simulated environment and underlying resources for each test scenarios. A set of techniques to design test events for test scenarios (e.g. module, integration, regression) is proposed aimed at promoting a black-board style research approach. To demonstrate the functionality of this approach, we have implemented a prototype and trace an example algorithm through the development process.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Srihari S., Shin Y., Ramanaprasad V. and Lee D. (1995) Name and address block reader system for tax form processing. In: Proc. of the 3rd International Conference on Document Analysis and Recognition, Montreal, Canada, 1:5–10.
Dengel A., Bleisinger R., Fein F., Hoch R., Hones F. and Malburg M. (1995) OfficeMAID-A system for office mail analysis, interpretation and delivery. In: Spitz L & Dengel A (ed) Document Analysis Systems, 1:52–75. World Scientific Press, Co.
Sharpe M., Sutcliffe G. and Ahemd N. (1994) Implementation of an intelligent document understanding and reproduction system. In: Proc. of the IAPR Workshop on Machine Vision Applications, Kawasaki, Japan.
Casey R. and Ferguson D. (1990) Intelligent forms processing. IBM Systems Journal, 29(3):435–450.
Taylor S., Fritzon R. and Pastor J. (1992) Extraction of data from preprinted forms. Machine Vision and Applications, 5:211–222.
von Mayrhauser A. (1990) Software Engineering: Methods and Management. Academic Press Inc., pp. 435–497.
Sauvola J., Doermann D., Haapakoski S., Kauniskangas H., Seppäinen T. and Pietikäinen M. (1997) A Distributed Management System for Testing Document Image Analysis Algorithms. To appear in ICDAR'97, Ulm, Germany.
Nieminen S., Sauvola J. and Pietikäinen M. (1997) Benchmarking System for Document Image Binarization. Submitted.
Kanungo T., Haralick R.M. and Phillips I. (1993) Global and local document degradation models, 1993 IEEE, pp. 730–734.
Sauvola J., Seppdnen T., Haapakoski S. and Pietikäinen M (1997) Adaptive Document Binarization. To appear in ICDAR'97, Ulm, Germany.
Sauvola J. and Pietikäinen M. (1995) Skew Detection Using Texture Direction Analysis. In: The Proc. of the 9th SCIA, Uppsala, Sweden, 1:1099–1106.
Sauvola J., Doermann D. and Pietikäinen M. (1997) Local Document Skew Detection. In: The proceedings of West-Coast SPIE, Document Recognition Systems IV, California, USA, 3027:13 p.
Sauvola J. and Pietikäinen M. (1995) Page Segmentation and Classification Using Fast Feature Extraction and Connectivity Analysis. In: The Proc. of the 3rd ICDAR, Montreal, Canada, 2:1127–1131.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1997 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Sauvola, J., Doermann, D., Kauniskangas, H., Pietikäinen, M. (1997). Techniques for the automated testing of document analysis algorithms. In: Murshed, N.A., Bortolozzi, F. (eds) Advances in Document Image Analysis. BSDIA 1997. Lecture Notes in Computer Science, vol 1339. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-63791-5_15
Download citation
DOI: https://doi.org/10.1007/3-540-63791-5_15
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-63791-2
Online ISBN: 978-3-540-69646-9
eBook Packages: Springer Book Archive