Abstract
Nonverbal behavior plays an important role in any human–human interaction. Teaching—an inherently social activity—is not an exception. So far, the effect of nonverbal behavioral cues accompanying lecture delivery was investigated in the case of traditional ex-cathedra lectures, where students and teachers are co-located. However, it is becoming increasingly more frequent to watch lectures online and, in this new type of setting, it is still unclear what the effect of nonverbal communication is. This article tries to address the problem and proposes experiments performed over the lectures of a popular web repository (“Videolectures”). The results show that automatically extracted nonverbal behavioral cues (prosody, voice quality and gesturing activity) predict the ratings that “Videolectures” users assign to the presentations.
Similar content being viewed by others
Notes
As of September 2011.
The list of videos used in the experiment is available at https://pavisdata.iit.it/data/salvagnini/RatingPrediction_VL/RP_VL90v_INFOCOM2013.pdf.
References
Ambady N, Rosenthal R (1992) Thin slices of expressive behavior as predictors of interpersonal consequences: a meta-analysis. Psychol Bull 111(2):256–274
Ambady N, Rosenthal R (1993) Half a minute: predicting teacher evaluations from thin slices of nonverbal behavior and physical attractiveness. J Pers Soc Psychol 64(3):431–431
Babad E, Bernieri F, Rosenthal R (1991) Students as judges of teachers’ verbal and nonverbal behavior. Am Educ Res J 28(1):211–234
Bazzani L, Cristani M, Tosato D, Farenzena M, Paggetti G, Menegaz G, Murino V (2013) Social interactions by visual focus of attention in a three-dimensional environment. Expert Syst 30(2):115–127
Cristani M, Murino V, Vinciarelli A (2010) Socially intelligent surveillance and monitoring: analysing social dimensions of physical space. In: IEEE computer society conference on computer vision and pattern recognition workshops (CVPRW 2010), pp 51–58
Cristani M, Paggetti G, Vinciarelli A, Bazzani L, Menegaz G, Murino V (2011a) Towards computational proxemics: inferring social relations from interpersonal distances. In: Proceeedings of IEEE international conference on social computing, pp 290–297
Cristani M, Pesarin A, Drioli C, Tavano A, Perina A, Murino V (2011b) Generative modeling and classification of dialogs by a low-level turn-taking feature. Pattern Recogn 44(8):1785–1800
Cristani M, Pesarin A, Vinciarelli A, Crocco M, Murino V (2011c) Look at who’s talking: voice activity detection by automated gesture analysis. In: Am I workshops, pp 72–80
Curhan J, Pentland A (2007) Thin slices of negotiation: predicting outcomes from conversational dynamics within the first 5 minutes. J Appl Psychol 92(3):802–811
Delaherche E, Chetouani M, Mahdhaoui A, Saint-Georges C, Viaux S, Cohen D (2012) Interpersonal synchrony: a survey of evaluation methods across disciplines. IEEE Trans Affect Comput 3(3):349–365
Fini A (2009) The technological dimension of a massive open online course: the case of the cck08 course tools. Int Rev Res Open Dist Learn 10(5):1–20
Harris M, Rosenthal R (2005) No more teachers’ dirty looks: effects of teacher nonverbal behavior on student outcomes. In: Riggio R, Feldman R (eds) Applications of nonverbal communication. Lawrence Erlbaum, London, pp 157–192
Jayagopi D, Hung H, Yeo C, Gatica-Perez D (2009) Modeling dominance in group conversations from non-verbal activity cues. IEEE Trans Audio Speech Lang Process 17(3):501–513
Johnson WL, Rickel J (1997) Steve: an animated pedagogical agent for procedural training in virtual environments. ACM SIGART Bull 8(1–4):16–21
Knapp M, Hall J (1972) Nonverbal communication in human interaction. Harcourt Brace College Publishers, New York
Kop R (2011) The challenges to connectivist learning on open online networks: learning experiences during a Massive Open Online Course. Int Rev Res Open Dist Learn Special Issue-Connect Des Deliv Soc Netw Learn 12(3):19–38
Kop R, Fournier H, Mak J (2011) A pedagogy of abundance or a pedagogy to support human beings? Participant support on massive open online courses. Int Rev Res Open Dist Learn 12(7):74–93
Krämer NC, Bente G (2010) Personalizing e-learning. the social effects of pedagogical agents. Educ Psychol Rev 22(1):71–87
Kuncheva L (2002) A theoretical study on six classifier fusion strategies. IEEE Trans Pattern Anal Mach Intell 24(2):281–286
Liu C (2009) Beyond pixels: exploring new representations and applications for motion analysis. PhD Thesis, Massachusetts Institute of Technology
Mairesse F, Walker MA, Mehl MR, Moore RK (2007) Using linguistic cues for the automatic recognition of personality in conversation and text. J Artif Intell Res 30:457–500
Martin F (2012) Will Massive Open Online Courses change the way we teach? Commun ACM 55(8):28–30
Mohammadi G, Vinciarelli A (2012) Automatic personality perception: prediction of trait attribution based on prosodic features. IEEE Trans Affect Comput 3(3):273–284
Moran M, Seaman J, Tinti-Kane H (2011) Teaching, learning, and sharing: how today’s higher education faculty use social media. Technical Report, Pearson Education
Morency LP (2010) Modeling human communication dynamics. IEEE Signal Process Mag 27(5):112–116
Nass C, Brave S (2005) Wired for speech: how voice activates and advances the human–computer relationship. MIT Press, Cambridge
Otsuka K, Takemae Y, Yamato J (2005) A probabilistic inference of multiparty-conversation structure based on Markov-switching models of gaze patterns, head directions, and utterances. In: Proceedings of ACM international conference on multimodal interfaces, pp 191–198
Pesarin A, Cristani M, Murino V, Vinciarelli A (2012) Conversation analysis at work: detection of conflict in competitive discussions through semi-automatic turn-organization analysis. Cogn Process 13(2):533–540
Petrillo M, Cutugno F (2003) A syllable segmentation algorithm for English and Italian. In: Proceedings of Eurospeech, pp 2913–2916
Pianesi F, Zancanaro M, Not E, Leonardi C, Falcon V, Lepri B (2008) Multimodal support to group dynamics. Pers Ubiquit Comput 12(3):181–195
Picard R (2000) Affective computing. MIT Press, Cambridge
Reeves B, Nass C (1996) The media equation: how people treat computers, television, and new media like real people and places. Cambridge University Press, Cambridge
Richmond V, McCroskey J (1995) Nonverbal behaviors in interpersonal relations. Allyn and Bacon, Newton
Rienks R, Heylen D (2006) Dominance detection in meetings using easily obtainable features. In: Lecture notes in computer science, Springer, vol 3869, pp 76–86
Rienks R, Zhang D, Gatica-Perez D (2006) Detection and application of influence rankings in small group meetings. In: Proceedings of the international conference on multimodal interfaces, pp 257–264
Rodriguez T, Plax TG, Kearney P (1996) Clarifying the relationship between teacher nonverbal immediacy and student cognitive learning: affective learning as the central causal mediator. Commun Educ 45(4):293–305
Saint-Georges C, Cassel R, Cohen D, Chetouani M, Laznik MC, Maestro S, Muratori F (2010) What studies of family home movies can teach us about autistic infants: a literature review. Res Autism Spect Disord 4(3):355–366
Salamin H, Vinciarelli A (2012) Automatic role recognition in multiparty conversations: an approach based on turn organization, prosody, and conditional random fields. IEEE Trans Multimedia 14(2):338–345
Salvagnini P, Salamin H, Cristani M, Vinciarelli A, Murino V (2012) Learning how to teach from “videolectures”: automatic prediction of lecture ratings based on teacher’s nonverbal behavior. In: Proceedings of IEEE international conference on cognitive infocommunications, pp 415–419
Shaw E, Johnson WL, Ganeshan R (1999) Pedagogical agents on the web. In: Proceedings of the third annual conference on Autonomous Agents, ACM, pp 283–290
Smith H (1979) Nonverbal communication in teaching. Rev Educ Res 49(4):631–672
Tallent-Runnels M, Thomas J, Lan W, Cooper S, Ahern T, Shaw S, Liu X (2006) Teaching courses online: a review of the research. Rev Educ Res 76(1):93–135
Thalmann NM, Thalmann D (2012) Virtual humans: back to the future. In: Proceedings of graphics interface, pp 1–8
Vinciarelli A, Pantic M, Heylen D, Pelachaud C, Poggi I, D’Errico F, Schroeder M (2012) Bridging the gap between social animal and unsocial machine: a survey of social signal processing. IEEE Trans Affect Comput 3(1):69–87
Wharton T (2009) The pragmatics of non-verbal communication. Cambridge University Press, Cambridge
Wiggins J (ed) (1996) The five-factor model of personality. Guilford, New York
Woolfolk AE, Brooks D (1983) Nonverbal communication in teaching. Rev Res Educ 10:103–149
Yang Y, Ramanan D (2011) Articulated pose estimation with flexible mixtures-of-parts. In: IEEE conference on computer vision and pattern recognition, pp 1385–1392
Acknowledgments
The research that has led to this work has been supported in part by the European Community Seventh Framework Programme (FP7/2007–2013), under Grant Agreement No. 231287 (SSPNet). Dr. Cheng was supported by the Hankuk University of Foreign Studies Research Fund of 2013.
Author information
Authors and Affiliations
Corresponding authors
Rights and permissions
About this article
Cite this article
Cheng, D.S., Salamin, H., Salvagnini, P. et al. Predicting online lecture ratings based on gesturing and vocal behavior. J Multimodal User Interfaces 8, 151–160 (2014). https://doi.org/10.1007/s12193-013-0142-z
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12193-013-0142-z