Coordinated Rule Acquisition of Decision Making on Supply Chain by Exploitation-Oriented Reinforcement Learning

Saitoh, Fumiaki; Utani, Akihide

doi:10.1007/978-3-642-40728-4_67

Fumiaki Saitoh²² &
Akihide Utani²³

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8131))

Included in the following conference series:

International Conference on Artificial Neural Networks

6295 Accesses
4 Citations

Abstract

Product order decision-making is an important feature of inventory control in supply chains. The beer game represents a typical task in this process. Recent approaches that have applied the agent model to the beer game have shown. Q-learning performing better than genetic algorithm (GA). However, flexibly adapting to dynamic environment is difficult for these approaches because their learning algorithm assume a static environment. As exploitation-oriented reinforcement learning algorithm are robust in dynamic environments, this study, approaches the beer game using profit sharing, a typical exploitation-oriented agent learning algorithm, and verifies its result’s validity by comparing performances.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

¥17,985 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: JPY 3498; Price includes VAT (Japan)

eBook: JPY 5719; Price includes VAT (Japan)

Softcover Book: JPY 7149; Price includes VAT (Japan)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Multi-agent system approach applied to a manufacturer’s supply chain using global objective function and learning concepts

Article 08 February 2017

The Emergence of Periodic Properties of Ordering Strategies Under Disruption in the Beer Game

Reinforcement Learning-Based Adaptive Operator Selection

References

Stockheim, T., Schwind, M., Koenig, W.: A Reinforcement Learning Approach for Supply Chain Management. In: 1st Europian Workshop on Malti Agent Systems (2003)
Google Scholar
Kimbrough, S.O., Wu, D.J., Zhong, F.: Computer play the beer game-can artificial agents manage supply chains? Decision Support Systems 33(3), 323–333 (2002)
Article Google Scholar
Sutton, R., Barto, A.: Reinforcement Learning. MIT Press (1998)
Google Scholar
Watkins, C.J.H., Dayan, P.: Technical note: Q-learning. Machine Learning 8, 55–68 (1992)
Google Scholar
van Tongeren, T., Kaymak, U., Naso, D., van Asperen, E.: Q-Learning in a Competitive Supply Chain. In: IEEE International Conference on Systems, Man and Cybernetics, ISIC, pp. 1211–1216 (2007)
Google Scholar
Kamal Chaharsooghi, S., Heydari, J., Hessameddin Zegordi, S.: A reinforcement learning model for supply chain ordering management-An application to the beer game. Decision Support Systems 45(4), 949–959 (2008)
Article Google Scholar
Grefenstette, J.J.: Credit Assignment in Rule Discovery System Based on Genetic Algorithms. Machine Learning 3, 225–245 (1988)
Google Scholar
Arai, S., Miyazaki, K., Kobayashi, S.: Methodology in Multi-Agent Reinforcement Learning: Approaches by Q-Learning and Profit Sharing. Transaction of the Japanese Society for Artificial Intelligence 13(4), 609–618 (1998) (in Japanese)
Google Scholar
Iyer, A., Seshadri, S., Vasher, R.: Toyotafs Supply Chain Management: A Strategic Approach to Toyota’s Renowned System. McGraw-Hill Education (2009)
Google Scholar
Ichikawa, M., Koyama, Y., Deguchi, H.: Human and Agent Playing the“Beer Game”. Developments in Business Simulation and Experiential Learning 35, 231–237 (2008)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Industrial and Systems Engineering, College of Science and Engineering, Aoyama Gakuin University, 5-10-1 Fuchinobe, Chuo-ku, Sagamihara City, Kanagawa, Japan
Fumiaki Saitoh
Department of Information and Communication Engineering, Faculty of Knowledge Engineering, Tokyo City University, 1-28-1, Tamadutumi, Setagaya-ku, Tokyo, Japan
Akihide Utani

Authors

Fumiaki Saitoh
View author publications
You can also search for this author in PubMed Google Scholar
Akihide Utani
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty Automation,, Technical University of Sofia, 8 St. Kl. Ohridski Blvd., 1000, Sofia, Bulgaria
Valeri Mladenov
Institute of Information and Communication Technologies, Bulgarian Academy of Sciences, Acad. G. Bonchev str. bl.25A, 1113, Sofia, Bulgaria
Petia Koprinkova-Hristova
Institute of Neural Information Processing, University of Ulm, 89075, Ulm, Germany
Günther Palm
Quartier UNIL-Dorigny, Bâtiment Internef, Université de Lausanne, 1015, Lausanne, Switzerland
Alessandro E. P. Villa
Department of Computer Science, University of Milano, Via Comelico, 39, 20135, Milano, Italy
Bruno Appollini
Knowledge Engineering, School of Computing and Mathematical Sciences, Auckland University of Technology, 120 Mayoral Drive, 3rd floor, 1010, Auckland, New Zealand
Nikola Kasabov

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Saitoh, F., Utani, A. (2013). Coordinated Rule Acquisition of Decision Making on Supply Chain by Exploitation-Oriented Reinforcement Learning. In: Mladenov, V., Koprinkova-Hristova, P., Palm, G., Villa, A.E.P., Appollini, B., Kasabov, N. (eds) Artificial Neural Networks and Machine Learning – ICANN 2013. ICANN 2013. Lecture Notes in Computer Science, vol 8131. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40728-4_67

Download citation

DOI: https://doi.org/10.1007/978-3-642-40728-4_67
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40727-7
Online ISBN: 978-3-642-40728-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Coordinated Rule Acquisition of Decision Making on Supply Chain by Exploitation-Oriented Reinforcement Learning

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Multi-agent system approach applied to a manufacturer’s supply chain using global objective function and learning concepts

The Emergence of Periodic Properties of Ordering Strategies Under Disruption in the Beer Game

Reinforcement Learning-Based Adaptive Operator Selection

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Coordinated Rule Acquisition of Decision Making on Supply Chain by Exploitation-Oriented Reinforcement Learning

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Multi-agent system approach applied to a manufacturer’s supply chain using global objective function and learning concepts

The Emergence of Periodic Properties of Ordering Strategies Under Disruption in the Beer Game

Reinforcement Learning-Based Adaptive Operator Selection

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation