[2312.08710] Gradient Informed Proximal Policy Optimization