Abstract
Human voice accents have been shown to affect people’s perceptions of the speaker, but little research has looked at how synthesized voice accents affect perceptions of robots. This research investigated people’s perceptions of three synthesized voice accents. Three male robot voices were generated: British (UK), American (US), and New Zealand (NZ). In study one, twenty adults listened through headphones to a recorded script repeated in the three different accents, rated the nationality, roboticness, and overall impression of each voice, and chose their preferred accent. Study two used these voices on a healthcare robot to investigate the influence of accent on user perceptions of the robot. Ninety-one individuals were randomized to one of three conditions. In each condition they interacted with a healthcare robot that assisted with blood pressure measurement but the conditions differed in the accent the robot spoke with. In study one, each accent was correctly identified. There was no difference in impression ratings of each voice, but the US accent was rated as more robotic than the NZ accent, and the UK accent was preferred to the US accent. Study two showed that people randomized to the NZ accent had more positive feelings towards the robot and rated the robot’s overall performance as higher compared to the robot with the US voice. These results suggest that the employment of a less robotic voice with a local accent may positively affect user perceptions of robots.
Similar content being viewed by others
References
Alamsaputra DM, Kohnert KJ, Munson B, Reichle J (2006) Synthesized speech intelligibility among native speakers and non-native speakers of English. Augment Altern Commun 22:258–268
Aronovitch CD (1976) The voice of personality: Stereotyped judgment and their relation to voice quality and sex of speaker. J Soc Psychol 99:207–220
Arras KO, Cerqui D (2005) Do we want to share our lives and bodies with robots? A 2000 people survey. Technical report, Autonomous Systems Lab, Swiss Federal Institute of Technology Lausanne
Atrash A, Kaplow R, Villemure J, West R, Yamani H, Pineau J (2009) Development and validation of a robust speech interface for improved human-robot interaction. Int J Soc Robot 1:345–356
Ball P (1983) Stereotypes of anglo-saxon and non anglo-saxon accents: some exploratory Australian studies with the matched guise technique. Lang Sci 5:163–183
Bayard D (1995) Kiwitalk: sociolinguistics and New Zealand society. Dunmore Press, Palmerston North
Bayard D (1999) The cultural cringe revisited: changes through time in KIWI Attitudes towards accents. In: Bell A, Kuiper K (eds) New Zealand English. Benjamins, Amsterdam, pp 297–324
Bayard D, Weatherall A, Gallois C, Pittam J (2001) Pax Americana? Accent attitudinal evaluations in New Zealand, Australia, and America. J Socioling 5:22–49
Bennewitz M, Faber F, Joho D, Behnke S (2007) Fritz—a humanoid communication robot. In: Proc IEEE international workshop of robot and human interactive communication (ROMAN), Jeju Island, Korea, pp 1072–1077
Berry DS, Hansen JS (1996) Positive affect, negative affect, and social interaction. J Pers Soc Psychol 71:796–809
Black AW, Lenzo KA (2007) Building synthetic voices. http://festvox.org/festvox/festvox_toc.html
Black AW, Taylor P, Caley R (1999) The festival speech synthesis system. http://www.cstr.ed.ac.uk/projects/festival/
Breazeal C (2001) Emotive qualities in robot speech. In: Proc the 2001 IEEE/RSJ international conference on intelligent robots and systems, Maui, pp 1388–1394
Broadbent E, MacDonald BA, Jago L, Juergens M, Mazharullah O (2007) Human reactions to good and bad robots. In: Proc IEEE/RSJ international conference on intelligent robots and systems IROS, pp 3703–3708
Cargile A, Giles H (1997) Understanding language attitudes: exploring listener affect and identity. Lang Commun 17:195–217
Cesta A, Cortellessa G, Giuliani MV, Pecora F, Scopelliti M, Tiberio L (2007) Psychological implications of domestic assistive technology for the elderly. Psychol J 5:229–252
Fitt S (2000) Documentation and user guide to Unisyn Lexicon and Post-Lexical rules. Technical report, Centre for Speech Technology Research, University of Edinburgh
Giles H (1970) Evaluative reactions to accents. Educ Rev 22:211–227
Giles H, Williams A, Mackie DM, Rosselli F (1995) Reactions to Anglo- and Hispanic-American-accented speakers: Affect, identity, persuasion, and the English-only controversy. Lang Commun 15:107–120
Goetz J, Kiesler S, Powers A (2003) Matching robot appearance and behavior to tasks to improve human-robot cooperation. In: Proc the 12th IEEE international symposium on robot and human interactive communication, Millbrae, California, USA, pp 55–60
Hall JA, Roter DL, Rand CS (1981) Communication of affect between patient and physician. J Health Soc Behav 22:18–30
Huygens I, Vaughan GM (1983) Language attitudes, ethnicity and social class in New Zealand. J Multiling Multicult Dev 4:207–223
Igic A, Watson CI, Teutenberg J, Tamagawa R, Macdonald BA, Broadbent E (2009) Towards a flexible platform for voice accent and expression selection on a Healthcare Robot. In: Proc the 2009 Australasian language technology workshop, Sydney, pp 109–113
Kuo IH, Rabindran J, Broadbent E, Lee YI, Kerse N, Stafford R, MacDonald BA (2009) Age and gender factors in user acceptance of healthcare robots. In: Proc the 18th IEEE international symposium on robot and human interactive communication, Toyama, Japan, pp 214–219
LeBaron S, Reyher J, Stack JM (1985) Paternalistic vs egalitarian physician styles: The treatment of patients in crisis. J Fam Pract 21:56–62
Li X, Watson CI, Igic A, Macdonald BA (2009) Expressive speech for a virtual talking head. In: Australasian conference on robotics and automation, Sydney
Luhman R (1990) Appalachian English stereotypes: Language attitudes in Kentucky. Lang Soc 19:331–348
Mayer RE, Sobko K, Mautone PD (2003) Social cues in multimedia learning: roles of speaker’s voice. J Educ Psychol 95:419–425
Mullennix JW, Johnson K, Topcu-Durgun M, Farnsworth LW (1995) The perceptual representation of voice gender. J Acoust Soc Am 98:3080–3095
Mullennix JW, Stern SE, Wilson SJ, Dyson C (2003) Social perception of male and female computer synthesized speech. Comput Hum Behav 19:407–424
Nass C, Brave S (2005) Wired for speech: how voice activates and advances the human-computer relationship. MIT Press, Cambridge
Nass C, Gong L (2000) Social aspects of speech interfaces from an evolutionary perspective: experimental research and design implications. Commun ACM 43(9):36–43
Niculescu AI, White GM, Lan SS, Waloejo RU, Kawaguchi Y (2008) Impact of English regional accents on user acceptance of voice user interfaces. In: Proc NordiCHI 2008, vol 358. ACM, New York, pp 523–526
Oestreicher L (2007) Cognitive, social, sociable or just socially acceptable robots. In: Proc the 16th IEEE international symposium on robot and human interactive communication (ROMAN), Jeju Island, Korea, pp 558–563
Pucher M, Schuchmann G, Fröhlich P (2008) Regionalized text-to-speech systems: Persona design and application scenarios. In: COST action 2102 school, Vietri sul Mare, Italy. Lecture notes in artificial intelligence (LNAI), vol 5398, pp 216–222
Robins B, Dautenhahn K, te Boekhorst R, Billard A (2004) Robots as assistive technology—does appearance matter. In: Proc the 13th IEEE international workshop on robot and human interactive communication (ROMAN), Okayama, Japan, pp 277–282
Roehling S, MacDonald BA, Watson C (2006) Towards expressive speech synthesis in English on a robot platform. In: Proc the 11th Australian international conference on speech science and technology, Auckland, New Zealand, pp 130–135
Stern SE (2008) Computer-synthesized speech and perceptions of the social influence of disabled users. J Lang Soc Psychol 27:254–265
Tusing KJ, Dillard JP (2000) The sound of dominance: vocal precursors of perceived dominance during interpersonal influence. Hum Commun Res 26:148–171
Walters ML, Syrdal DS, Koay KL, Dautenhahn K, te Boekhorst R (2008) Human approach distance to a mechanical-looking robot with different robot voice styles. In: Proc the 17th IEEE international symposium on robot and human interactive communication, Munich, Germany, pp 707–712
Watson D, Clark LA, Tellegen A (1988) Development and validation of a brief measure of positive and negative affect: the PANAS scales. J Pers Soc Psychol 54:1063–1070
Watson CI, Teutenberg J, Thompson L, Roehling S, Igic A (2009) How to build a New Zealand voice. In: NZ linguistic society conference, Palmerston North
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Tamagawa, R., Watson, C.I., Kuo, I.H. et al. The Effects of Synthesized Voice Accents on User Perceptions of Robots. Int J of Soc Robotics 3, 253–262 (2011). https://doi.org/10.1007/s12369-011-0100-4
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12369-011-0100-4