Abstract
The cerebellum is involved in avoidance learning tasks, where anticipatory actions are developed to protect against aversive stimuli. In the execution and acquisition of discrete actions we can distinguish errors of omission and commission due to a failure to execute a required defensive Conditioned Response (CR) to avoid an aversive Unconditioned Stimulus (US), and the energy expenditure of triggering an unnecessary CR in the absence of a US respectively. Hence, a motor learning cost function must consider both these components of performance and energy expenditure. Unlike remaining noxious stimuli, unnecessary actions are not directly sensed by the cerebellum. It has been suggested that the Nucleo-Olivary Inhibition (NOI) serves to internally rely information about these needless protective actions. Here we argue that the function of the NOI can be interpreted in broader terms as a signal that is used to learn optimal actions in terms of cost. We work with a computational model of the cerebellum to address: (i) how can the optimum balance between remaining aversive stimuli and preventing effort be found, and (ii) how can the cerebellum use the overall cost information to establish this optimum balance through the adjusting of the gain of the NOI. In this paper we derive the value of the NOI that minimizes the overall cost and propose a learning rule for the cerebellum through which this value is reached. We test this rule in a collision avoidance task performed by a simulated robot.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Aström, K.J., Murray, R.M.: Feedback systems: an introduction for scientists and engineers. Princeton University Press (2010)
Dean, P., Porrill, J., Ekerot, C.-F., Jörntell, H.: The cerebellar microcircuit as an adaptive filter: experimental and computational evidence. Nature Reviews Neuroscience 11(1), 30–43 (2009)
Doya, K.: What are the computations of the cerebellum, the basal ganglia and the cerebral cortex? Neural Networks 12(7), 961–974 (1999)
Fibla, M.S., Bernardet, U., Verschure, P.F.: Allostatic control for robot behaviour regulation: An extension to path planning. In: 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 1935–1942. IEEE (2010)
Fujita, M.: Simulation of adaptive modification of the vestibulo-ocular reflex with an adaptive filter model of the cerebellum. Biological Cybernetics 45(3), 207–214 (1982)
Herreros, I.: Data-Driven Batch Scheduling. PhD thesis, Universitat Pompeu Fabra. Departament de Tecnologies de la Informaci i les Comunicacions (2013)
Herreros, I., Maffei, G., Brandi, S., Sanchez-Fibla, M., Verschure, P.F.: Speed generalization capabilities of a cerebellar model on a rapid navigation task. In: 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 363–368. IEEE (2013)
Herreros, I., Verschure, P.F.: Nucleo-olivary inhibition balances the interaction between the reactive and adaptive layers in motor control. Neural Networks (2013)
Hesslow, G., Yeo, C.: The functional anatomy of skeletal conditioning. In: A Neuroscientist’s Guide to Classical Conditioning, pp. 86–146. Springer, Heidelberg (2002)
Hollerman, J.R., Schultz, W.: Dopamine neurons report an error in the temporal prediction of reward during learning. Nature Neuroscience 1(4), 304–309 (1998)
Lepora, N.F., Porrill, J., Yeo, C.H., Dean, P.: Sensory prediction or motor control application of marr–albus type models of cerebellar function to classical conditioning. Frontiers in Computational Neuroscience 4 (2010)
Medina, J., Nores, W., Mauk, M.: Inhibition of climbing fibres is a signal for the extinction of conditioned eyelid responses. Nature 416(6878), 330–333 (2002)
Miall, R.C., Christensen, L.O., Cain, O., Stanley, J.: Disruption of state estimation in the human lateral cerebellum. PLoS biology 5(11), e316 (2007)
Schultz, W.: Getting formal with dopamine and reward. Neuron 36(2), 241–263 (2002)
Spanagel, R., Weiss, F.: The dopamine hypothesis of reward: past and current status. Trends in Neurosciences 22(11), 521–527 (1999)
Thompson, R.F.: The search for the engram. American Psychologist 31(3), 209 (1976)
Winship, I., Pakan, J., Todd, K., Wong-Wylie, D.: A comparison of ventral tegmental neurons projecting to optic flow regions of the inferior olive vs. The Hippocampal Formation 141(1), 463–473 (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Brandi, S., Herreros, I., Verschure, P.F.M.J. (2014). Optimization of the Anticipatory Reflexes of a Computational Model of the Cerebellum. In: Duff, A., Lepora, N.F., Mura, A., Prescott, T.J., Verschure, P.F.M.J. (eds) Biomimetic and Biohybrid Systems. Living Machines 2014. Lecture Notes in Computer Science(), vol 8608. Springer, Cham. https://doi.org/10.1007/978-3-319-09435-9_2
Download citation
DOI: https://doi.org/10.1007/978-3-319-09435-9_2
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-09434-2
Online ISBN: 978-3-319-09435-9
eBook Packages: Computer ScienceComputer Science (R0)