Budgeted Knowledge Transfer for State-Wise Heterogeneous RL Agents

Farshidian, Farbod; Talebpour, Zeinab; Ahmadabadi, Majid Nili

doi:10.1007/978-3-642-34475-6_53

Farbod Farshidian²⁰,
Zeinab Talebpour²⁰ &
Majid Nili Ahmadabadi²⁰

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7663))

Included in the following conference series:

International Conference on Neural Information Processing

3261 Accesses

Abstract

In this paper we introduce a budgeted knowledge transfer algorithm for non-homogeneous reinforcement learning agents. Here the source and the target agents are completely identical except in their state representations. The algorithm uses functional space (Q-value space) as the transfer-learning media. In this method, the target agent’s functional points (Q-values) are estimated in an automatically selected lower-dimension subspace in order to accelerate knowledge transfer. The target agent searches that subspace using an exploration policy and selects actions accordingly during the period of its knowledge transfer in order to facilitate gaining an appropriate estimate of its Q-table. We show both analytically and empirically that this method decreases the required learning budget for the target agent.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

¥17,985 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: JPY 3498; Price includes VAT (Japan)

eBook: JPY 5719; Price includes VAT (Japan)

Softcover Book: JPY 7149; Price includes VAT (Japan)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

S\(^{2}\)ES: a stationary and scalable knowledge transfer approach for multiagent reinforcement learning

Article Open access 13 July 2021

Learning in the Presence of Multiple Agents

Modeling and reinforcement learning in partially observable many-agent systems

Article 26 March 2024

References

Taylor, M.E., Stone, P.: Transfer learning for reinforcement learning domains: A survey. The Journal of Machine Learning Research 10, 1633–1685 (2009)
MathSciNet MATH Google Scholar
Torrey, L., Shavlik, J.: Transfer learning. In: Handbook of Research on Machine Learning Applications, vol. 3, pp. 17–35. IGI Global (2009)
Google Scholar
Lazaric: Knowledge transfer in reinforcement learning. PhD thesis, PhD thesis, Politecnico di Milano (2008)
Google Scholar
Tanaka, F., Yamamura, M.: Multitask reinforcement learning on the distribution ofMDPs. In: Proceedings. 2003 IEEE International Symposium on Computational Intelligence in Robotics and Automation, vol. 3, pp. 1108–1113 (2003)
Google Scholar
Taylor, M.E., Stone, P., Liu, Y.: Value functions for RL-based behavior transfer: Acomparative study. In: Proceedings of the National Conference on Artificial Intelligence, vol. 20, p. 880 (2005)
Google Scholar
Wilson, A., Fern, A., Ray, S., Tadepalli, P.: Multi-task reinforcement learning: a hierarchical bayesian approach. In: Proceedings of the 24th International Conference on Machine learning, pp. 1015–1022 (2007)
Google Scholar
Soni, V., Singh, S.: Using homeomorphisms to transfer options across continuous reinforcement learning domains. In: Proceedings of the National Conference on Artificial Inligence, vol. 21, p. 494 (2006)
Google Scholar
Mihalkova, L., Huynh, T., Mooney, R.J.: Mapping and revising Markov logic networksfor transfer learning. In: Proceedings of the National Conference on Artificial Intelligence, vol. 22, p. 608 (2007)
Google Scholar
Taylor, M.E., Whiteson, S., Stone, P.: Transfer via inter-task mappings in policy search reinforcement learning. In: Proceedings of the 6th International Joint Conference on Autonomous Agents and Multi-agent Systems, p. 37 (2007)
Google Scholar
Taylor, M.E., Jong, N.K., Stone, P.: Transferring Instances for Model-Based Reinforcement Learning. In: Daelemans, W., Goethals, B., Morik, K. (eds.) ECML PKDD 2008, Part II. LNCS (LNAI), vol. 5212, pp. 488–505. Springer, Heidelberg (2008)
Chapter Google Scholar
Driessens, K., Ramon, J., Croonenborghs, T.: Transfer learning for reinforcement learning through goal and policy parameterization. In: Proceedings of the ICML Workshop on Structural Knowledge Transfer for Machine Learning (Online Proceedings), p. 14 (2006)
Google Scholar
Pan, S.J., Kwok, J.T., Yang, Q.: Transfer learning via dimensionality reduction. In: Proceedings of the 23rd National Conference on Artificial Intelligence, vol. 2, pp. 677–682 (2008)
Google Scholar
Moore, A.W., Atkeson, C.G.: Prioritized sweeping: Reinforcement learning with lessdata and less time. Machine Learning 13(1), 103–130 (1993)
Google Scholar

Download references

Author information

Authors and Affiliations

Cognitive Robotics Lab., School of Electrical and Computer Engineering, University College of Engineering, University of Tehran, Iran
Farbod Farshidian, Zeinab Talebpour & Majid Nili Ahmadabadi

Authors

Farbod Farshidian
View author publications
You can also search for this author in PubMed Google Scholar
Zeinab Talebpour
View author publications
You can also search for this author in PubMed Google Scholar
Majid Nili Ahmadabadi
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Texas A&M University at Qatar, Education City, P.O. Box 23874, Doha, Qatar
Tingwen Huang
Department of Control Science and Engineering, Huazhong University of Science and Technology, 1037 Luoyu Road, 430074, Wuhan, Hubei, China
Zhigang Zeng
College of Computer Science, Chongqing University, 174 Shazhengjie Street, 400044, Chongqing, China
Chuandong Li
Department of Electronic Engineering, City University of Hong Kong, 83 Tat Chee Avenue, Kowloon, Hong Kong, China
Chi Sing Leung

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Farshidian, F., Talebpour, Z., Ahmadabadi, M.N. (2012). Budgeted Knowledge Transfer for State-Wise Heterogeneous RL Agents. In: Huang, T., Zeng, Z., Li, C., Leung, C.S. (eds) Neural Information Processing. ICONIP 2012. Lecture Notes in Computer Science, vol 7663. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34475-6_53

Download citation

DOI: https://doi.org/10.1007/978-3-642-34475-6_53
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-34474-9
Online ISBN: 978-3-642-34475-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Budgeted Knowledge Transfer for State-Wise Heterogeneous RL Agents

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

S\(^{2}\)ES: a stationary and scalable knowledge transfer approach for multiagent reinforcement learning

Learning in the Presence of Multiple Agents

Modeling and reinforcement learning in partially observable many-agent systems

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Budgeted Knowledge Transfer for State-Wise Heterogeneous RL Agents

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

S\(^{2}\)ES: a stationary and scalable knowledge transfer approach for multiagent reinforcement learning

Learning in the Presence of Multiple Agents

Modeling and reinforcement learning in partially observable many-agent systems

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation