{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,9,14]],"date-time":"2024-09-14T07:50:34Z","timestamp":1726300234221},"reference-count":25,"publisher":"Springer Science and Business Media LLC","issue":"4","license":[{"start":{"date-parts":[[2012,6,30]],"date-time":"2012-06-30T00:00:00Z","timestamp":1341014400000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/www.springer.com\/tdm"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Int J of Soc Robotics"],"published-print":{"date-parts":[[2012,11]]},"DOI":"10.1007\/s12369-012-0161-z","type":"journal-article","created":{"date-parts":[[2012,6,29]],"date-time":"2012-06-29T06:52:42Z","timestamp":1340952762000},"page":"331-342","source":"Crossref","is-referenced-by-count":27,"title":["Robot Learning from Failed Demonstrations"],"prefix":"10.1007","volume":"4","author":[{"given":"Daniel H.","family":"Grollman","sequence":"first","affiliation":[]},{"given":"Aude G.","family":"Billard","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2012,6,30]]},"reference":[{"key":"161_CR1","volume-title":"Neural inf proc systems","author":"P Abbeel","year":"2006","unstructured":"Abbeel P, Coates A, Quigley M, Ng AY (2006) An application of reinforcement learning to aerobatic helicopter flight. In: Neural inf proc systems"},{"issue":"5","key":"161_CR2","doi-asserted-by":"crossref","first-page":"469","DOI":"10.1016\/j.robot.2008.10.024","volume":"57","author":"BD Argall","year":"2009","unstructured":"Argall BD, Chernova S, Veloso M, Browning B (2009) A survey of robot learning from demonstration. Robot Auton Syst 57(5):469\u2013483","journal-title":"Robot Auton Syst"},{"key":"161_CR3","series-title":"Handbook of robotics","volume-title":"Survey: robot programming by demonstration","author":"A Billard","year":"2008","unstructured":"Billard A, Calinon S, Dillmann R, Schaal S (2008) Survey: robot programming by demonstration. Handbook of robotics. MIT Press, Cambridge"},{"key":"161_CR4","volume-title":"Intl joint conf on autonomous agents and multi-agent systems","author":"S Chernova","year":"2007","unstructured":"Chernova S, Veloso M (2007) Confidence-based policy learning from demonstration using Gaussian mixture models. In: Intl joint conf on autonomous agents and multi-agent systems"},{"issue":"2","key":"161_CR5","doi-asserted-by":"crossref","first-page":"271","DOI":"10.1162\/neco.1997.9.2.271","volume":"9","author":"P Dayan","year":"1997","unstructured":"Dayan P, Hinton G (1997) Using expectation-maximization for reinforcement learning. Neural Comput 9(2):271\u2013278","journal-title":"Neural Comput"},{"key":"161_CR6","volume-title":"Intl conf on machine learning","author":"MP Deisenroth","year":"2011","unstructured":"Deisenroth MP, Rasmussen CE (2011) Pilco: a model-based and data-efficient approach to policy search. In: Intl conf on machine learning"},{"key":"161_CR7","volume-title":"Intl conf on robotics and automation","author":"S Dong","year":"2011","unstructured":"Dong S, Williams B (2011) Motion learning in variable environments using probabilistic flow tubes. In: Intl conf on robotics and automation"},{"key":"161_CR8","volume-title":"Intl conf on humanoid robots","author":"A Gams","year":"2010","unstructured":"Gams A, Do M, Ude A, Asfour T, Dillmann R (2010) On-line periodic movement and force-profile learning for adaptation to new surfaces. In: Intl conf on humanoid robots"},{"key":"161_CR9","volume-title":"Robotics: science and systems","author":"DB Grimes","year":"2006","unstructured":"Grimes DB, Chalodhorn R, Rao RPN (2006) Dynamic imitation in a humanoid robot through nonparametric probabilistic inference. In: Robotics: science and systems"},{"key":"161_CR10","volume-title":"Intl conf on robotics and automation","author":"DH Grollman","year":"2011","unstructured":"Grollman DH, Billard A (2011) Donut as I do: Learning from failed demonstrations. In: Intl conf on robotics and automation"},{"key":"161_CR11","volume-title":"Intl conf on robotics and automation","author":"DH Grollman","year":"2007","unstructured":"Grollman DH, Jenkins OC (2007) Dogged learning for robots. In: Intl conf on robotics and automation"},{"key":"161_CR12","doi-asserted-by":"crossref","unstructured":"Hersch M, Guenter F, Calinon S, Billard A (2008) Dynamical system modulation for robot learning via kineshetic demonstrations. Trans Robot, 1463\u20131467","DOI":"10.1109\/TRO.2008.2006703"},{"issue":"1","key":"161_CR13","first-page":"1","volume":"4","author":"X Hu","year":"2004","unstructured":"Hu X, Xu L (2004) Investigation on several model selection criteria for determining the number of cluster. Neural Inf Process - Lett Rev 4(1):1\u201310","journal-title":"Neural Inf Process - Lett Rev"},{"issue":"1\u20132","key":"161_CR14","first-page":"171","volume":"84","author":"J Kober","year":"2010","unstructured":"Kober J, Peters J (2010) Policy search for motor primitives in robotics. Mach Learn 84(1\u20132):171\u2013203","journal-title":"Mach Learn"},{"key":"161_CR15","volume-title":"Intl conf on intelligent robots and systems","author":"J Kober","year":"2008","unstructured":"Kober J, Mohler B, Peters J (2008) Learning perceptual coupling for motor primitives. In: Intl conf on intelligent robots and systems"},{"key":"161_CR16","volume-title":"Intl conf on intelligent robots and systems","author":"K Kronander","year":"2011","unstructured":"Kronander K, Khansari Zadeh SM, Billard A (2011) Learning to control planar hitting motions in a monigolf-like task. In: Intl conf on intelligent robots and systems"},{"issue":"6","key":"161_CR17","doi-asserted-by":"crossref","first-page":"799","DOI":"10.1109\/70.338535","volume":"10","author":"Y Kuniyoshi","year":"1994","unstructured":"Kuniyoshi Y, Inaba M, Inoue H (1994) Learning by watching: Extracting reusable task knowledge from visual observation of human performance. IEEE Trans Robot Autom 10(6):799\u2013822","journal-title":"IEEE Trans Robot Autom"},{"issue":"5","key":"161_CR18","doi-asserted-by":"crossref","first-page":"838","DOI":"10.1037\/0012-1649.31.5.838","volume":"31","author":"AN Meltzoff","year":"1995","unstructured":"Meltzoff AN (1995) Understanding the intentions of others: re-enactment of intended acts by 18-month-old children. Dev Psychol 31(5):838\u2013850","journal-title":"Dev Psychol"},{"issue":"1","key":"161_CR19","first-page":"49","volume":"3","author":"T Mtsui","year":"2002","unstructured":"Mtsui T, Inuzuka N, Seki H (2002) Adapting to subsequent changes of environment by learning policy preconditions. Int J Comput Inf Sci 3(1):49\u201358","journal-title":"Int J Comput Inf Sci"},{"key":"161_CR20","volume-title":"Learning in graphical models","author":"R Neal","year":"1998","unstructured":"Neal R, Hinton GE (1998) A view of the EM algorithm that justifies incremental, sparse, and other variants. In: Learning in graphical models"},{"key":"161_CR21","volume-title":"Intl conf on robotics and automation","author":"P Pastor","year":"2011","unstructured":"Pastor P, Kalakrishnan M, Chitta S, Theodorou E, Schaal S (2011) Skill learning and task outcome prediction for manipulation. In: Intl conf on robotics and automation"},{"key":"161_CR22","volume-title":"Intl joint conf on artificial intelligence","author":"D Ramachandran","year":"2007","unstructured":"Ramachandran D, Amir E (2007) Bayesian inverse reinforcement learning. In: Intl joint conf on artificial intelligence"},{"key":"161_CR23","unstructured":"Sung HG (2004) Gaussian mixture regression and classification. PhD thesis, Rice"},{"issue":"2\u20133","key":"161_CR24","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1080\/09540090802091917","volume":"20","author":"AL Thomaz","year":"2008","unstructured":"Thomaz AL, Breazeal C (2008) Experiments in socially guided exploration: lessons learned in building robots that learn with and without human teachers. Connect Sci 20(2\u20133):91\u2013110","journal-title":"Connect Sci"},{"issue":"2","key":"161_CR25","first-page":"41","volume":"72","author":"SC Want","year":"2001","unstructured":"Want SC, Harris PL (2001) Learning from other people\u2019s mistakes: causal understanding in learning to use a tool. Child Dev 72(2):41\u2013443","journal-title":"Child Dev"}],"container-title":["International Journal of Social Robotics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s12369-012-0161-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1007\/s12369-012-0161-z\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s12369-012-0161-z","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2019,6,30]],"date-time":"2019-06-30T07:34:34Z","timestamp":1561880074000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/s12369-012-0161-z"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012,6,30]]},"references-count":25,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2012,11]]}},"alternative-id":["161"],"URL":"https:\/\/doi.org\/10.1007\/s12369-012-0161-z","relation":{},"ISSN":["1875-4791","1875-4805"],"issn-type":[{"value":"1875-4791","type":"print"},{"value":"1875-4805","type":"electronic"}],"subject":[],"published":{"date-parts":[[2012,6,30]]}}}