[2401.01879] Theoretical guarantees on the best-of-n alignment policy