[2407.14477] Data-Centric Human Preference Optimization with Rationales