Abstract
The paper explores the issues of universal computational formalisms and reusable developer environment as applied to rule-based NLP. It suggests a portable grammar framework and modular NLP architecture that by combining certain modules can be reused for different unilingual and multilingual applications through a universal developer environment. The developer environment includes a lexicon shell with flexible settings to define, among others, tag descriptions, entry structures, depth of knowledge, and a number of compilers with universal rule-writing formalisms. The formalisms and compilers described have been successfully used (in different combinations and with different depth of analysis) in a number of unilingual and multilingual applications that involved English, Danish, French and Russian.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
It might not always be the case as there are applications that skip phrase chunking and work directly on lexicon tags.
References
Camilleri, J.J.: An IDE for the grammatical framework. In: Proceedings of a Workshop on Free/Open-Source Rule-Based Machine Translation, Gothenburg, Sweden, pp. 1–12 (2012)
Feldman, A., Jirka, H., Brew, Ch.: A cross-language approach to rapid creation of new morpho-syntactically annotated resources. In: Proceedings of LREC 2006 (2006)
Joshi, A.K.: Unification and some new grammatical formalisms. In: Proceedings of the Workshop on Theoretical Issues in Natural Language Processing TINLAP 1987, pp. 45–50. Las Cruces, New Mexico. USA (1987)
Joshi, A.K, Srinivas, B.: Disambiguation of super parts of speech (or Supertags): almost parsing (1994). http://acl.ldc.upenn.edu/C/C94/C94-1024.pdf
Neumann, C.: Generating patent claims in english from a Japanese-only interface. In: Proceedings of the Workshop on Patent Translation in conjunction with the MT Summit X. Phluket, Thailand, September 2005
Paul, M.: Translation knowledge recycling for related languages. In: Proceedings of MT Summit VIII. Santiago de Compostela, Galicia, Spain, 18–22 September 2001
Petrov, S., Das, D., McDonald, R.: A universal part-of-speech tagset. In: Proceedings of the Conference on Language Resources and Evaluation (LREC 2012). Istanbul, Turkey (2012)
Pereira, F.C.N., Shieber, S.M.: The semantics of grammar formalisms seen as computer languages. In: Proceedings of the Tenth International Conference on Computational Linguistics. Stanford, CA, USA (1984)
Pinkham, J., Corston-Oliver, M., Smets, M., Pettenaro, M.: Rapid assembly of a large-scale French-English MT system. In: Proceedings of MT Summit VIII. Santiago de Compostela, Galicia. Spain, 18–22 September 2001
Przepiórkowski, A.: A preliminary formalism for simultaneous rule-based tagging and partial parsing data structures for linguistic resources and applications. In: Rehm, G., Witt, A., Lemnitzer, L. (eds.) Tübingen:Gunter Narr Verlag, pp. 81–90 (2007)
Sharoff, S., Nivre, J.: The proper place of men and machines in language technology processing Russian without any linguistic knowledge. In: Proceedings of the International Conference on Computational Linguistics “Dialogue”- 2011. Russian State Humanitarian University, Moscow (2011)
Sharoff, S., Kopotev, M., Erjavec, T., Feldman, A., Divjak, D.: Designing and evaluating a Russian tagset. In: Proceedings of the Sixth Language Resources and Evaluation Conference, LREC 2008, Marrakech (2008)
Sheremetyeva, S.: Application adaptive electronic dictionary with intelligent interface. In: Proceedings of the Workshop on Enhancing and Using Electronic Dictionaries in Conjunction with the 20th International Conference on Computational Linguistics. COLING 2004, Geneva, Switzerland, 23–28 August 2004
Sheremetyeva, S.: On portability of resources for quick ramp-up of multilingual MT for patent claims. In: Proceedings of the workshop on Patent Translation in conjunction with MT Summit XI, Copenhagen, Denmark, 10–14 September 2007
Sheremetyeva, S.: On teaching technical writing with an authoring tool. In: Proceedings of the 10th International Technology, Education and Development Conference INTED 2016, Valencia, Spain, 7–9 March 2016
Takeda, K.: Portable knowledge sources for machine translation. In: Proceedings of the 15th International Conference on Computational Linguistics, Kyoto, Japan, 5–9 August 1994
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Sheremetyeva, S. (2018). Universal Computational Formalisms and Developer Environment for Rule-Based NLP. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2017. Lecture Notes in Computer Science(), vol 10761. Springer, Cham. https://doi.org/10.1007/978-3-319-77113-7_5
Download citation
DOI: https://doi.org/10.1007/978-3-319-77113-7_5
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-77112-0
Online ISBN: 978-3-319-77113-7
eBook Packages: Computer ScienceComputer Science (R0)