Understanding and Addressing the Pitfalls of Bisimulation-based Representations in Offline Reinforcement Learning

Zang, Hongyu; Li, Xin; Zhang, Leiji; Liu, Yang; Sun, Baigui; Islam, Riashat; Combes, Remi Tachet des; Laroche, Romain

Computer Science > Machine Learning

arXiv:2310.17139 (cs)

[Submitted on 26 Oct 2023]

Title:Understanding and Addressing the Pitfalls of Bisimulation-based Representations in Offline Reinforcement Learning

Authors:Hongyu Zang, Xin Li, Leiji Zhang, Yang Liu, Baigui Sun, Riashat Islam, Remi Tachet des Combes, Romain Laroche

View PDF

Abstract:While bisimulation-based approaches hold promise for learning robust state representations for Reinforcement Learning (RL) tasks, their efficacy in offline RL tasks has not been up to par. In some instances, their performance has even significantly underperformed alternative methods. We aim to understand why bisimulation methods succeed in online settings, but falter in offline tasks. Our analysis reveals that missing transitions in the dataset are particularly harmful to the bisimulation principle, leading to ineffective estimation. We also shed light on the critical role of reward scaling in bounding the scale of bisimulation measurements and of the value error they induce. Based on these findings, we propose to apply the expectile operator for representation learning to our offline RL setting, which helps to prevent overfitting to incomplete data. Meanwhile, by introducing an appropriate reward scaling strategy, we avoid the risk of feature collapse in representation space. We implement these recommendations on two state-of-the-art bisimulation-based algorithms, MICo and SimSR, and demonstrate performance gains on two benchmark suites: D4RL and Visual D4RL. Codes are provided at \url{this https URL}.

Comments:	NeurIPS 2023
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2310.17139 [cs.LG]
	(or arXiv:2310.17139v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2310.17139

Submission history

From: Hongyu Zang [view email]
[v1] Thu, 26 Oct 2023 04:20:55 UTC (734 KB)

Computer Science > Machine Learning

Title:Understanding and Addressing the Pitfalls of Bisimulation-based Representations in Offline Reinforcement Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Understanding and Addressing the Pitfalls of Bisimulation-based Representations in Offline Reinforcement Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators