[2205.05212] A State-Distribution Matching Approach to Non-Episodic Reinforcement Learning