Guided Beam Search to Improve Generalization in Low-Resource Data-to-Text Generation

Nicolas Garneau; Luc Lamontagne

doi:10.18653/v1/2023.inlg-main.1

Guided Beam Search to Improve Generalization in Low-Resource Data-to-Text Generation

Abstract

In this paper, we introduce a new beam search algorithm that improves the generalization of neural generators to unseen examples, especially in low-resource data-to-text settings. Our algorithm aims to reduce the number of omissions and hallucinations during the decoding process. For this purpose, it relies on two regression models to explicitly characterize factual errors. We explain how to create a new dataset to train these models given an original training set of less than a thousand data points. We apply our approach in the low-resource, legal setting using the French Plum2Text dataset, as well as in English using WebNLG. We observe in our experiment that this combination improves the faithfulness of pre-trained neural text generators using both human and automatic evaluation. Moreover, our approach offers a level of interpretability by predicting the number of omissions and hallucinations present in a given generation with respect to the input data. Finally, we visualize our algorithm’s exploration of the hypothesis space at different steps during the decoding process.

Anthology ID:: 2023.inlg-main.1
Volume:: Proceedings of the 16th International Natural Language Generation Conference
Month:: September
Year:: 2023
Address:: Prague, Czechia
Editors:: C. Maria Keet, Hung-Yi Lee, Sina Zarrieß
Venues:: INLG | SIGDIAL
SIG:: SIGGEN
Publisher:: Association for Computational Linguistics
Note:
Pages:: 1–14
Language:
URL:: https://aclanthology.org/2023.inlg-main.1
DOI:: 10.18653/v1/2023.inlg-main.1
Bibkey:
Cite (ACL):: Nicolas Garneau and Luc Lamontagne. 2023. Guided Beam Search to Improve Generalization in Low-Resource Data-to-Text Generation. In Proceedings of the 16th International Natural Language Generation Conference, pages 1–14, Prague, Czechia. Association for Computational Linguistics.
Cite (Informal):: Guided Beam Search to Improve Generalization in Low-Resource Data-to-Text Generation (Garneau & Lamontagne, INLG-SIGDIAL 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.inlg-main.1.pdf

PDF Cite Search