Abstract
Despite being one of the most widely-spoken languages in the world, Portuguese remains a relatively resource-poor language, for which only in recently years NLP tools such as parsers, taggers and (fairly) large corpora have become available. In this work we describe the task of pronominal co-reference annotation and resolution in Portuguese texts, in which we take advantage of information provided by a tagged corpus and a simple annotation tool that has been developed for this purpose. Besides developing some of these basic resources from scratch, our ultimate goal is to investigate the multilingual resolution of Portuguese personal pronouns to improve the accuracy of their translations to both Spanish and English in an underlying MT project.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Mitkov, R.: Anaphora Resolution. Longman, New york (2002)
Bick, E.: The parsing system PALAVRAS: automatic grammatical analysis of Portuguese in a constraint grammar framework. PhD Thesis, Arhus University (2000)
Müller, C., Strube, M.: MMAX: A tool for the annotation of multi-modal corpora. In: IJCAI 2001, Seattle, pp. 45–50 (2001)
Soon, W.M., et al.: A Machine Learning Approach to Correference Resolution of Noun Phrases. Computational Linguistics 27(4) (2001)
McCarthy, J.F., Lehnert, W.G.: Using Decision Trees for Coreference Resolution. In: 14th International Conference on Artificial Intelligence IJCAI (1995)
Ng, V., Cardie, C.: Improving Machine Learning Approaches to Coreference Resolution. In: 40th Annual Meeting of the Association for Computational Linguistics (ACL), Philadelphia, pp. 104–111 (2002)
Mitkov, R.: Multilingual Anaphora Resolution. Machine Translation 14(3-4), 281–299 (1999)
Oliveira, P.R., Romero, R.A.F.: Enhanced ICA Mixture Model for Unsupervised Classification. In: Lemaître, C., Reyes, C.A., González, J.A. (eds.) IBERAMIA 2004. LNCS (LNAI), vol. 3315, pp. 205–214. Springer, Heidelberg (2004)
Lee, T., Lewicki, M.S., Sejnowski, T.J.: ICA mixture models for unsupervised classification of non-Gaussian classes and automatic context switching in blind signal separation. IEEE Transactions on Pattern Analysis and Machine Intelligence 22(10), 1078–1089 (2000)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cuevas, R.R.M., Honda, W.Y., de Lucena, D.J., Paraboni, I., Oliveira, P.R. (2008). Portuguese Pronoun Resolution: Resources and Evaluation. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2008. Lecture Notes in Computer Science, vol 4919. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-78135-6_29
Download citation
DOI: https://doi.org/10.1007/978-3-540-78135-6_29
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-78134-9
Online ISBN: 978-3-540-78135-6
eBook Packages: Computer ScienceComputer Science (R0)