Abstract
This chapter introduces a simulator for incremental human-machine dialogue in order to generate artificial dialogue datasets that can be used to train and test data-driven methods. We review the various simulator components in detail, including an unstable speech recognizer, and their differences with non-incremental approaches. Then, as an illustration of its capacities, an incremental strategy based on hand-crafted rules is implemented and compared to several non-incremental baselines. Their performances in terms of dialogue efficiency are presented under different noise conditions and prove that the simulator is able to handle several configurations which are representative of real usages.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
This rule is kept as an indication, it is the French way of telling time and it does not apply to English.
- 2.
Here, the value of the priority variable designate its importance. Therefore, the higher the priority, the more important the task is.
- 3.
We do not claim that the Intent Manager algorithm solves the task in an optimal way. Moreover, there are some pathological examples that are not handled at all. Yet it complies with our objective to have a simple algorithm able to run realistic dialogues to study turn-taking mechanisms.
- 4.
This N-Best corresponds to the last input word only. It is important to make the distinction between this N-Best and the one corresponding to the last partial utterance as a whole. In Fig. 3, the block New word N-Best is a word N-Best whereas the other three blocks are partial utterances N-Best.
- 5.
Here, we only use the best hypothesis of the N-Best. However, the others are indirectly used through the boost mechanism.
References
Plátek, O., Jurčíček, F.: Free on-line speech recogniser based on Kaldi ASR toolkit producing word posterior lattices. In: Proceedings of the 15th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL) (2014)
Allen, J., Ferguson, G., Stent, A.: An architecture for more realistic conversational systems. In: Proceedings of the 6th International Conference on Intelligent User Interfaces (2001)
Dohsaka, K., Shimazu, A.: A system architecture for spoken utterance production in collaborative dialogue. In: Working Notes of IJCAI 1997 Workshop on Collaboration, Cooperation and Conflict in Dialogue Systems (1997)
Skantze, G., Schlangen, D.: Incremental dialogue processing in a micro-domain. In: Proceedings of the 12th Conference of the European Chapter of the ACL (EACL) (2009)
Tanenhaus, M.K., Spivey-Knowlton, M.J., Eberhard, K.M., Sedivy, J.C.: Integration of visual and linguistic information in spoken language comprehension. Science 268, 1632–1634 (1995)
Schlangen, D., Skantze, G.: A general, abstract model of incremental dialogue processing. Dialogue and Discourse 2, 83–111 (2011)
Sutton, R.S., Barto, A.G.: Reinforcement Learning, An Introduction. The MIT Press, Cambridge (1998)
Levin, E., Pieraccini, R.: A stochastic model of computer-human interaction for learning dialogue strategies. In: Proceedings of the 5th Biennial European Conference on Speech Communication and Technology (Eurospeech) (1997)
Lemon, O., Pietquin, O.: Data-Driven Methods for Adaptive Spoken Dialogue Systems. Springer Publishing Company, Incorporated (2012)
Eckert, W., Levin, E., Pieraccini, R.: User modeling for spoken dialogue system evaluation. In: Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding (1997)
Pietquin, O., Hastie, H.: A survey on metrics for the evaluation of user simulations. Knowl. Eng. Rev. (2013)
Taylor, M.E., Stone, P.: Transfer learning for reinforcement learning domains: a survey. J. Mach. Learn. Res. 10, 1633–1685 (2009)
Selfridge, E.O., Heeman, P.A.: A temporal simulator for developing turn-taking methods for spoken dialogue systems. In: Proceedings of the 13th Annual Meeting of the Special Interest Group on Discourse and Dialogue (2012)
McGraw, I., Gruenstein, A.: Estimating word-stability during incremental speech recognition. In: Proceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech) (2012)
Selfridge, E.O., Arizmendi, I., Heeman, P.A., Williams, J.D.: Stability and accuracy in incremental speech recognition. In: Proceedings of the 12th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL) (2011)
Khouzaimi, H., Laroche, R., Lefèvre, F.: An easy method to make dialogue systems incremental. In: Proceedings of the 15th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL) (2014)
Khouzaimi, H., Laroche, R., Lefèvre, F.: Optimising turn-taking strategies with reinforcement learning. In: Proceedings of the 16th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL) (2015)
Ghigi, F., Eskenazi, M., Torres, M.I., Lee, S.: Incremental dialog processing in a task-oriented dialog. In: Proceedings of the 15th Annual Conference of the International Speech Communication Association (Interspeech) (2014)
Yuan, J., Liberman, M., Cieri, C.: Towards an integrated understanding of speaking rate in conversation. In: Proceedings of the 9th International Conference on Spoken Language Processing (Interspeech-ICLSP) (2006)
Pietquin, O., Beaufort, R.: Comparing asr modeling methods for spoken dialogue simulation and optimal strategy learning. In: Proceedings of the 9th European Conference on Speech Communication and Technology (Eurospeech/Interspeech) (2005)
Jiang, H.: Confidence measures for speech recognition: a survey. Speech Commun. 45, 455–470 (2005)
Seigel, M.S., Woodland, P.C.: Combining information sources for confidence estimation with crf models. In: Proceedings of the 11th Annual Conference of the International Speech Communication Association (Interspeech) (2011)
Nakano, M., Miyazaki, N., Hirasawa, J.I., Dohsaka, K., Kawabata, T.: Understanding unsegmented user utterances in real-time spoken dialogue systems. In: Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics (ACL) (1999)
Clark, H.H.: Using Language. Cambridge University Press (1996)
Sacks, H., Schegloff, E.A., Jefferson, G.: A simplest systematics for the organization of turn-taking for conversation. Language 50, 696–735 (1974)
Khouzaimi, H., Laroche, R., Lefèvre, F.: Turn-taking phenomena in incremental dialogue systems. In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (2015)
DeVault, D., Sagae, K., Traum, D.: Incremental interpretation and prediction of utterance meaning for interactive dialogue. Dialogue and Discourse 2, 143–170 (2011)
El Asri, L., Lemonnier, R., Laroche, R., Pietquin, O., Khouzaimi, H.: NASTIA: Negotiating Appointment Setting Interface. In: Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC) (2014)
Selfridge, E.O., Arizmendi, I., Heeman, P.A., Williams, J.D.: Continuously predicting and processing barge-in during a live spoken dialogue task. In: Proceedings of the 14th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL) (2013)
Acknowledgements
This work is part of the FUI project VoiceHome.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer Science+Business Media Singapore
About this chapter
Cite this chapter
Khouzaimi, H., Laroche, R., Lefèvre, F. (2017). Incremental Human-Machine Dialogue Simulation. In: Jokinen, K., Wilcock, G. (eds) Dialogues with Social Robots. Lecture Notes in Electrical Engineering, vol 427. Springer, Singapore. https://doi.org/10.1007/978-981-10-2585-3_4
Download citation
DOI: https://doi.org/10.1007/978-981-10-2585-3_4
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-2584-6
Online ISBN: 978-981-10-2585-3
eBook Packages: EngineeringEngineering (R0)