[2307.04571] Alleviating Matthew Effect of Offline Reinforcement Learning in Interactive Recommendation