[2206.01079] When does return-conditioned supervised learning work for offline reinforcement learning?