[2206.05357] Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning