default search action
8th EWRL 2008: Villeneuve d'Ascq, France
- Sertan Girgin, Manuel Loth, Rémi Munos, Philippe Preux, Daniil Ryabko:
Recent Advances in Reinforcement Learning, 8th European Workshop, EWRL 2008, Villeneuve d'Ascq, France, June 30 - July 3, 2008, Revised and Selected Papers. Lecture Notes in Computer Science 5323, Springer 2008, ISBN 978-3-540-89721-7 - Boris Defourny, Damien Ernst, Louis Wehenkel:
Lazy Planning under Uncertainty by Optimizing Decisions on an Ensemble of Incomplete Disturbance Trees. 1-14 - Thomas Degris, Olivier Sigaud, Pierre-Henri Wuillemin:
Exploiting Additive Structure in Factored MDPs for Reinforcement Learning. 15-26 - Christos Dimitrakakis, Michail G. Lagoudakis:
Algorithms and Bounds for Rollout Sampling Approximate Policy Iteration. 27-40 - Kirill Dyagilev, Shie Mannor, Nahum Shimkin:
Efficient Reinforcement Learning in Parameterized Models: Discrete Parameter Case. 41-54 - Amir Massoud Farahmand, Mohammad Ghavamzadeh, Csaba Szepesvári, Shie Mannor:
Regularized Fitted Q-Iteration: Application to Planning. 55-68 - Sarah Filippi, Olivier Cappé, Fabrice Clérot, Eric Moulines:
A Near Optimal Policy for Channel Allocation in Cognitive Radio. 69-81 - Thomas Gabel, Martin A. Riedmiller:
Evaluation of Batch-Mode Reinforcement Learning Methods for Solving DEC-MDPs with Changing Action Sets. 82-95 - Matthieu Geist, Olivier Pietquin, Gabriel Fricout:
Bayesian Reward Filtering. 96-109 - Sertan Girgin, Philippe Preux:
Basis Expansion in Natural Actor Critic Methods. 110-123 - Robby Goetschalckx, Scott Sanner, Kurt Driessens:
Reinforcement Learning with the Use of Costly Features. 124-135 - Verena Heidrich-Meisner, Christian Igel:
Variable Metric Reinforcement Learning Methods Applied to the Noisy Mountain Car Problem. 136-150 - Jean-François Hren, Rémi Munos:
Optimistic Planning of Deterministic Systems. 151-164 - Yuxi Li, Dale Schuurmans:
Policy Iteration for Learning an Exercise Policy for American Options. 165-178 - Daniele Loiacono, Pier Luca Lanzi:
Tile Coding Based on Hyperplane Tiles. 179-190 - José David Martín-Guerrero, Emilio Soria-Olivas, Marcelino Martínez-Sober, Antonio J. Serrano-López, José Rafael Magdalena Benedicto, Juan Gómez-Sanchís:
Use of Reinforcement Learning in Two Real Applications. 191-204 - Francis Maes, Ludovic Denoyer, Patrick Gallinari:
Applications of Reinforcement Learning to Structured Prediction. 205-219 - Jan Peters, Jens Kober, Duy Nguyen-Tuong:
Policy Learning - A Unified Perspective with Applications in Robotics. 220-228 - Carl Edward Rasmussen, Marc Peter Deisenroth:
Probabilistic Inference for Fast Learning in Control. 229-242 - Noel Welsh, Jeremy L. Wyatt:
United We Stand: Population Based Methods for Solving Unknown POMDPs. 243-252 - Huizhen Yu, Dimitri P. Bertsekas:
New Error Bounds for Approximations from Projected Linear Equations. 253-267 - Jia Yuan Yu, Shie Mannor, Nahum Shimkin:
Markov Decision Processes with Arbitrary Reward Processes. 268-281
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.