[2405.20541] Perplexed by Perplexity: Perplexity-Based Data Pruning With Small Reference Models