Abstract
Speech recognition has proved to be a natural interaction modality and an effective technology for medical reporting, in particular in the speciality of radiology. High-volume text creation requirement and the complex structure of these texts make voice technologies useful. By employing speech, professionals in the field can generate reports and do so at a speed that approaches traditional dictation methods.
However, the integration of speech recognition in a user interface creates new problems: speech recognizers may introduce errors and moreover they should be adaptable to spoken language variations.
This paper describes a radiological reporting system and the related motivations for the use of the speech modality. A preliminary evaluation of the system has shown that, on average, although text recalling functions and keyword shortcuts are available, more than two thirds of a radiological report are generated by means of dictation.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
G. Antoniol, F. Brugnara, F. Dalla Palma, G. Lazzari, and E. Moser A.RE.S.: An interface for automatic reporting by speech. In Proceedings of the European Conference on Speech Communication and Technology, Genova, Italy, 1991.
L. R. Bahl, F. Jelinek, and R. L. Mercer. A maximum likelihood approach to continuous speech recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, PAMI-5(2):179–190, 1983.
L. R. Bahl, F. Jelinek, and R. L. Mercer. A Maximum Likelihood Approach to Continuous Speech Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 5(2):179–190, March 1983.
J. K. Baker. Trainable Grammars for Speech Recognition. In Proceedings of the Spring Conference of the Acoustical Society of America, 1979.
H. Cerf-Danon, S. DeGennaro, M. Ferretti, J.Gonzalez, and E. Keppel. Tangora — a large vocabulary speech recognition system for five languages. In Proceedings of the European Conference on Speech Communication and Technology, pages 215–218, Genova, Italy, September 1991.
M. Grice and B. Barry. Esprit project 2589 (sam) multi-lingual speech input/output assessment, methodology and standardisation, 1985. Doc. SAM-UC-149.
R. Joseph. Large vocabulary voice-to-text systems for medical reporting. Speech Technology, 4(4):49–51, 1989.
L. F. Lamel, R. H. Kassel, and S. Seneff. Speech Database Development: Design and Analysis of the Acoustic-Phonetic Corpus. In Proceedings of the DARPA Speech Recognition Workshop, 1986.
J.A. Larson. Interactive software: tools for building interactive user interfaces. Prentice-Hall, Englewood Cliffs, NJ, 1992.
H. Ney and U. Essen. On Smoothing Techniques for Bigram-Based Natural Language Modelling. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, pages 825–828, Toronto, Canada, 1991.
David S. Pallett. Performance assessment of automatic speech recognizers, 1985. Journal of Research of the National Bureau of Standards.
A. I. Rudnicky and M. H. Sakamoto. Transcription Conventions and Evaluation Techniques for Spoken Language System Research. Technical Report 9204-11, School of Computer Science, CMU, Pittsburgh, PA, 1989.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1993 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Antoniol, G., Fiutem, R., Flor, R., Lazzari, G. (1993). Radiological reporting based on voice recognition. In: Bass, L.J., Gornostaev, J., Unger, C. (eds) Human-Computer Interaction. EWHCI 1993. Lecture Notes in Computer Science, vol 753. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-57433-6_53
Download citation
DOI: https://doi.org/10.1007/3-540-57433-6_53
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-57433-0
Online ISBN: 978-3-540-48152-2
eBook Packages: Springer Book Archive