Abstract
The paper describes annotation principles developed for tagging of speech acts in the “One Day of Speech” (ORD) corpus of Russian everyday speech, with special attention being paid to categories and subcategories of speech acts distinguished in the ORD. Annotation of speech acts is a part of pragmatic annotation of the corpus, which includes as well the tagging of macro- and microepisodes of verbal communication. Speech acts are annotated on four levels: (1) the orthographic transcript with information on syntagmatic and phrasal boundaries, (2) the speakerʼs code, (3) the main category of a speech act, and (4) its subcategory. Practical approbation of the proposed annotation scheme has been made on the material of 6 macroepisodes of everyday communication, in which 2250 speech acts have been discerned. Pragmatic annotation of the ORD corpus provides an opportunity to study everyday discourse in terms of speech acts and to study linguistic properties and patterns of speech acts of different types.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Asinovsky, A., Bogdanova, N., Rusakova, M., Ryko, A., Stepanova, S., Sherstinova, T.: The ORD speech corpus of Russian everyday communication “One Speaker’s Day’’: creation principles and annotation. In: Matoušek, V., Mautner, P. (eds.) TSD 2009. LNCS, vol. 5729, pp. 250–257. Springer, Heidelberg (2009)
Sherstinova, T.: Macro episodes of Russian everyday oral communication: towards pragmatic annotation of the ORD speech corpus. In: Ronzhin, A., Potapova, R., Fakotakis, N. (eds.) SPECOM 2015. LNCS, vol. 9319, pp. 268–276. Springer, Heidelberg (2015)
Sherstinova, T.: Approaches to Pragmatic Annotation in the ORD Corpus: Microepisodes and Speech Acts. In: Proceedings of the International Conference on “Corpus linguistics-2015”, pp. 436–446 (2015)
Weisser, M.: Speech act annotation. In: Aijmer, K., Rühlemann, C. (eds.) Corpus Pragmatics: a Handbook, pp. 84–111. CUP, Cambridge (2014)
Potapova, R.K.: Speech: Communication, Information, Cybernetics. URSS, Moscow (2003) (In Russian)
Austin, J.L.: How To Do Things With Words. Oxford University Press, Oxford (1962)
Searle, J.R.: A classification of illocutionary acts. Lang. Soc. 5(1), 1–23 (1976)
Qadir, A., Riloff, E.: Classifying sentences as speech acts in message board posts. In: Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP-2011), pp. 748–758 (2011)
Bakhtin, M.M.: Speech Genres and Other Late Essays. University of Texas Press, Austin (1986). Edited by Caryl Emerson and Michael Holquist, Translated by Vern W. McGee
Jurafsky, D.: Pragmatics and computational linguistics. In: Horn, L., Ward, G. (eds.) The Handbook of Pragmatics, pp. 578–604. Blackwell, Oxford (2006)
Allen, J., Core, M.: Draft of DAMSL: dialog act markup in several layers (1997). https://www.cs.rochester.edu/research/speech/damsl/RevisedManual/
Leech, G., Weisser, M.: Generic speech act annotation for task-oriented dialogues. In: Proceedings of the Corpus Linguistics 2003 Conference, vol. 16. UCREL Technical Papers, Lancaster University (2003)
Weisser, M.: SPAACy: a semi-automated tool for annotating dialogue acts. Int. J. Corpus Linguist. 8(1), 63–74 (2003)
Carletta, J., Isard, A., Isard, S., Kowtko, J.S., Doherty-Sneddon, G., Anderson, A.H.: The reliability of a dialogue structure coding scheme. Comput. Linguist. 23, 13–32 (1997)
Blum-Kulka, S., Olshtain, E.: Requests and apologies: a cross-cultural study of speech act realization patterns (CCSARP). Appl. Linguist. 5(3), 196–215 (1984)
Stiles, W.: Describing Talk: A Taxonomy of Verbal Response Modes. Sage, Newbury Park (1992)
Borisova, I.N.: Russian spoken dialogue. Structure and Dynamics. KomKniga, Moscow (2009) (In Russian)
Hellwig, B., Van Uytvanck, D., Hulsbosch, M., et al.: ELAN — Linguistic Annotator. Version 4.9.3 (2014). http://tla.mpi.nl/tools/tla-tools/elan/
Acknowledgements
The annotation principles for macro episodes tagging have been developed with support of the Russian Foundation for Humanities (project # 12-04-12017, “Information System of Communication Scenarios of Russian Spontaneous Speech”). The presented statistics were obtained within the framework of the project “Everyday Russian Language in Different Social Groups” supported by the Russian Science Foundation, project # 14-18-02070.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Sherstinova, T. (2016). Speech Acts Annotation of Everyday Conversations in the ORD Сorpus of Spoken Russian. In: Ronzhin, A., Potapova, R., Németh, G. (eds) Speech and Computer. SPECOM 2016. Lecture Notes in Computer Science(), vol 9811. Springer, Cham. https://doi.org/10.1007/978-3-319-43958-7_76
Download citation
DOI: https://doi.org/10.1007/978-3-319-43958-7_76
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-43957-0
Online ISBN: 978-3-319-43958-7
eBook Packages: Computer ScienceComputer Science (R0)