The Effects of Synthesized Voice Accents on User Perceptions of Robots | International Journal of Social Robotics Skip to main content
Log in

The Effects of Synthesized Voice Accents on User Perceptions of Robots

  • Original Paper
  • Published:
International Journal of Social Robotics Aims and scope Submit manuscript

Abstract

Human voice accents have been shown to affect people’s perceptions of the speaker, but little research has looked at how synthesized voice accents affect perceptions of robots. This research investigated people’s perceptions of three synthesized voice accents. Three male robot voices were generated: British (UK), American (US), and New Zealand (NZ). In study one, twenty adults listened through headphones to a recorded script repeated in the three different accents, rated the nationality, roboticness, and overall impression of each voice, and chose their preferred accent. Study two used these voices on a healthcare robot to investigate the influence of accent on user perceptions of the robot. Ninety-one individuals were randomized to one of three conditions. In each condition they interacted with a healthcare robot that assisted with blood pressure measurement but the conditions differed in the accent the robot spoke with. In study one, each accent was correctly identified. There was no difference in impression ratings of each voice, but the US accent was rated as more robotic than the NZ accent, and the UK accent was preferred to the US accent. Study two showed that people randomized to the NZ accent had more positive feelings towards the robot and rated the robot’s overall performance as higher compared to the robot with the US voice. These results suggest that the employment of a less robotic voice with a local accent may positively affect user perceptions of robots.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
¥17,985 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price includes VAT (Japan)

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Alamsaputra DM, Kohnert KJ, Munson B, Reichle J (2006) Synthesized speech intelligibility among native speakers and non-native speakers of English. Augment Altern Commun 22:258–268

    Article  Google Scholar 

  2. Aronovitch CD (1976) The voice of personality: Stereotyped judgment and their relation to voice quality and sex of speaker. J Soc Psychol 99:207–220

    Article  Google Scholar 

  3. Arras KO, Cerqui D (2005) Do we want to share our lives and bodies with robots? A 2000 people survey. Technical report, Autonomous Systems Lab, Swiss Federal Institute of Technology Lausanne

  4. Atrash A, Kaplow R, Villemure J, West R, Yamani H, Pineau J (2009) Development and validation of a robust speech interface for improved human-robot interaction. Int J Soc Robot 1:345–356

    Article  Google Scholar 

  5. Ball P (1983) Stereotypes of anglo-saxon and non anglo-saxon accents: some exploratory Australian studies with the matched guise technique. Lang Sci 5:163–183

    Article  Google Scholar 

  6. Bayard D (1995) Kiwitalk: sociolinguistics and New Zealand society. Dunmore Press, Palmerston North

    Google Scholar 

  7. Bayard D (1999) The cultural cringe revisited: changes through time in KIWI Attitudes towards accents. In: Bell A, Kuiper K (eds) New Zealand English. Benjamins, Amsterdam, pp 297–324

    Google Scholar 

  8. Bayard D, Weatherall A, Gallois C, Pittam J (2001) Pax Americana? Accent attitudinal evaluations in New Zealand, Australia, and America. J Socioling 5:22–49

    Article  Google Scholar 

  9. Bennewitz M, Faber F, Joho D, Behnke S (2007) Fritz—a humanoid communication robot. In: Proc IEEE international workshop of robot and human interactive communication (ROMAN), Jeju Island, Korea, pp 1072–1077

    Google Scholar 

  10. Berry DS, Hansen JS (1996) Positive affect, negative affect, and social interaction. J Pers Soc Psychol 71:796–809

    Article  Google Scholar 

  11. Black AW, Lenzo KA (2007) Building synthetic voices. http://festvox.org/festvox/festvox_toc.html

  12. Black AW, Taylor P, Caley R (1999) The festival speech synthesis system. http://www.cstr.ed.ac.uk/projects/festival/

  13. Breazeal C (2001) Emotive qualities in robot speech. In: Proc the 2001 IEEE/RSJ international conference on intelligent robots and systems, Maui, pp 1388–1394

    Google Scholar 

  14. Broadbent E, MacDonald BA, Jago L, Juergens M, Mazharullah O (2007) Human reactions to good and bad robots. In: Proc IEEE/RSJ international conference on intelligent robots and systems IROS, pp 3703–3708

    Google Scholar 

  15. Cargile A, Giles H (1997) Understanding language attitudes: exploring listener affect and identity. Lang Commun 17:195–217

    Article  Google Scholar 

  16. Cesta A, Cortellessa G, Giuliani MV, Pecora F, Scopelliti M, Tiberio L (2007) Psychological implications of domestic assistive technology for the elderly. Psychol J 5:229–252

    Google Scholar 

  17. Fitt S (2000) Documentation and user guide to Unisyn Lexicon and Post-Lexical rules. Technical report, Centre for Speech Technology Research, University of Edinburgh

  18. Giles H (1970) Evaluative reactions to accents. Educ Rev 22:211–227

    Article  Google Scholar 

  19. Giles H, Williams A, Mackie DM, Rosselli F (1995) Reactions to Anglo- and Hispanic-American-accented speakers: Affect, identity, persuasion, and the English-only controversy. Lang Commun 15:107–120

    Article  Google Scholar 

  20. Goetz J, Kiesler S, Powers A (2003) Matching robot appearance and behavior to tasks to improve human-robot cooperation. In: Proc the 12th IEEE international symposium on robot and human interactive communication, Millbrae, California, USA, pp 55–60

    Google Scholar 

  21. Hall JA, Roter DL, Rand CS (1981) Communication of affect between patient and physician. J Health Soc Behav 22:18–30

    Article  Google Scholar 

  22. Huygens I, Vaughan GM (1983) Language attitudes, ethnicity and social class in New Zealand. J Multiling Multicult Dev 4:207–223

    Article  Google Scholar 

  23. Igic A, Watson CI, Teutenberg J, Tamagawa R, Macdonald BA, Broadbent E (2009) Towards a flexible platform for voice accent and expression selection on a Healthcare Robot. In: Proc the 2009 Australasian language technology workshop, Sydney, pp 109–113

    Google Scholar 

  24. Kuo IH, Rabindran J, Broadbent E, Lee YI, Kerse N, Stafford R, MacDonald BA (2009) Age and gender factors in user acceptance of healthcare robots. In: Proc the 18th IEEE international symposium on robot and human interactive communication, Toyama, Japan, pp 214–219

    Chapter  Google Scholar 

  25. LeBaron S, Reyher J, Stack JM (1985) Paternalistic vs egalitarian physician styles: The treatment of patients in crisis. J Fam Pract 21:56–62

    Google Scholar 

  26. Li X, Watson CI, Igic A, Macdonald BA (2009) Expressive speech for a virtual talking head. In: Australasian conference on robotics and automation, Sydney

    Google Scholar 

  27. Luhman R (1990) Appalachian English stereotypes: Language attitudes in Kentucky. Lang Soc 19:331–348

    Article  Google Scholar 

  28. Mayer RE, Sobko K, Mautone PD (2003) Social cues in multimedia learning: roles of speaker’s voice. J Educ Psychol 95:419–425

    Article  Google Scholar 

  29. Mullennix JW, Johnson K, Topcu-Durgun M, Farnsworth LW (1995) The perceptual representation of voice gender. J Acoust Soc Am 98:3080–3095

    Article  Google Scholar 

  30. Mullennix JW, Stern SE, Wilson SJ, Dyson C (2003) Social perception of male and female computer synthesized speech. Comput Hum Behav 19:407–424

    Article  Google Scholar 

  31. Nass C, Brave S (2005) Wired for speech: how voice activates and advances the human-computer relationship. MIT Press, Cambridge

    Google Scholar 

  32. Nass C, Gong L (2000) Social aspects of speech interfaces from an evolutionary perspective: experimental research and design implications. Commun ACM 43(9):36–43

    Article  Google Scholar 

  33. Niculescu AI, White GM, Lan SS, Waloejo RU, Kawaguchi Y (2008) Impact of English regional accents on user acceptance of voice user interfaces. In: Proc NordiCHI 2008, vol 358. ACM, New York, pp 523–526

    Chapter  Google Scholar 

  34. Oestreicher L (2007) Cognitive, social, sociable or just socially acceptable robots. In: Proc the 16th IEEE international symposium on robot and human interactive communication (ROMAN), Jeju Island, Korea, pp 558–563

    Chapter  Google Scholar 

  35. Pucher M, Schuchmann G, Fröhlich P (2008) Regionalized text-to-speech systems: Persona design and application scenarios. In: COST action 2102 school, Vietri sul Mare, Italy. Lecture notes in artificial intelligence (LNAI), vol 5398, pp 216–222

    Google Scholar 

  36. Robins B, Dautenhahn K, te Boekhorst R, Billard A (2004) Robots as assistive technology—does appearance matter. In: Proc the 13th IEEE international workshop on robot and human interactive communication (ROMAN), Okayama, Japan, pp 277–282

    Google Scholar 

  37. Roehling S, MacDonald BA, Watson C (2006) Towards expressive speech synthesis in English on a robot platform. In: Proc the 11th Australian international conference on speech science and technology, Auckland, New Zealand, pp 130–135

    Google Scholar 

  38. Stern SE (2008) Computer-synthesized speech and perceptions of the social influence of disabled users. J Lang Soc Psychol 27:254–265

    Article  Google Scholar 

  39. Tusing KJ, Dillard JP (2000) The sound of dominance: vocal precursors of perceived dominance during interpersonal influence. Hum Commun Res 26:148–171

    Google Scholar 

  40. Walters ML, Syrdal DS, Koay KL, Dautenhahn K, te Boekhorst R (2008) Human approach distance to a mechanical-looking robot with different robot voice styles. In: Proc the 17th IEEE international symposium on robot and human interactive communication, Munich, Germany, pp 707–712

    Chapter  Google Scholar 

  41. Watson D, Clark LA, Tellegen A (1988) Development and validation of a brief measure of positive and negative affect: the PANAS scales. J Pers Soc Psychol 54:1063–1070

    Article  Google Scholar 

  42. Watson CI, Teutenberg J, Thompson L, Roehling S, Igic A (2009) How to build a New Zealand voice. In: NZ linguistic society conference, Palmerston North

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Elizabeth Broadbent.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Tamagawa, R., Watson, C.I., Kuo, I.H. et al. The Effects of Synthesized Voice Accents on User Perceptions of Robots. Int J of Soc Robotics 3, 253–262 (2011). https://doi.org/10.1007/s12369-011-0100-4

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12369-011-0100-4

Keywords

Navigation