[2312.16267] Maximizing the Success Probability of Policy Allocations in Online Systems