Abstract
Product order decision-making is an important feature of inventory control in supply chains. The beer game represents a typical task in this process. Recent approaches that have applied the agent model to the beer game have shown. Q-learning performing better than genetic algorithm (GA). However, flexibly adapting to dynamic environment is difficult for these approaches because their learning algorithm assume a static environment. As exploitation-oriented reinforcement learning algorithm are robust in dynamic environments, this study, approaches the beer game using profit sharing, a typical exploitation-oriented agent learning algorithm, and verifies its result’s validity by comparing performances.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Stockheim, T., Schwind, M., Koenig, W.: A Reinforcement Learning Approach for Supply Chain Management. In: 1st Europian Workshop on Malti Agent Systems (2003)
Kimbrough, S.O., Wu, D.J., Zhong, F.: Computer play the beer game-can artificial agents manage supply chains? Decision Support Systems 33(3), 323–333 (2002)
Sutton, R., Barto, A.: Reinforcement Learning. MIT Press (1998)
Watkins, C.J.H., Dayan, P.: Technical note: Q-learning. Machine Learning 8, 55–68 (1992)
van Tongeren, T., Kaymak, U., Naso, D., van Asperen, E.: Q-Learning in a Competitive Supply Chain. In: IEEE International Conference on Systems, Man and Cybernetics, ISIC, pp. 1211–1216 (2007)
Kamal Chaharsooghi, S., Heydari, J., Hessameddin Zegordi, S.: A reinforcement learning model for supply chain ordering management-An application to the beer game. Decision Support Systems 45(4), 949–959 (2008)
Grefenstette, J.J.: Credit Assignment in Rule Discovery System Based on Genetic Algorithms. Machine Learning 3, 225–245 (1988)
Arai, S., Miyazaki, K., Kobayashi, S.: Methodology in Multi-Agent Reinforcement Learning: Approaches by Q-Learning and Profit Sharing. Transaction of the Japanese Society for Artificial Intelligence 13(4), 609–618 (1998) (in Japanese)
Iyer, A., Seshadri, S., Vasher, R.: Toyotafs Supply Chain Management: A Strategic Approach to Toyota’s Renowned System. McGraw-Hill Education (2009)
Ichikawa, M., Koyama, Y., Deguchi, H.: Human and Agent Playing the“Beer Game”. Developments in Business Simulation and Experiential Learning 35, 231–237 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Saitoh, F., Utani, A. (2013). Coordinated Rule Acquisition of Decision Making on Supply Chain by Exploitation-Oriented Reinforcement Learning. In: Mladenov, V., Koprinkova-Hristova, P., Palm, G., Villa, A.E.P., Appollini, B., Kasabov, N. (eds) Artificial Neural Networks and Machine Learning – ICANN 2013. ICANN 2013. Lecture Notes in Computer Science, vol 8131. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40728-4_67
Download citation
DOI: https://doi.org/10.1007/978-3-642-40728-4_67
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40727-7
Online ISBN: 978-3-642-40728-4
eBook Packages: Computer ScienceComputer Science (R0)