[2404.08683] Text clustering applied to data augmentation in legal contexts