[2311.05846] Clipped-Objective Policy Gradients for Pessimistic Policy Optimization