Towards Understanding Omission in Dialogue Summarization

Zou, Yicheng; Song, Kaitao; Tan, Xu; Fu, Zhongkai; Zhang, Qi; Li, Dongsheng; Gui, Tao

Computer Science > Computation and Language

arXiv:2211.07145 (cs)

[Submitted on 14 Nov 2022 (v1), last revised 11 May 2023 (this version, v2)]

Title:Towards Understanding Omission in Dialogue Summarization

Authors:Yicheng Zou, Kaitao Song, Xu Tan, Zhongkai Fu, Qi Zhang, Dongsheng Li, Tao Gui

View PDF

Abstract:Dialogue summarization aims to condense the lengthy dialogue into a concise summary, and has recently achieved significant progress. However, the result of existing methods is still far from satisfactory. Previous works indicated that omission is a major factor in affecting the quality of summarization, but few of them have further explored the omission problem, such as how omission affects summarization results and how to detect omission, which is critical for reducing omission and improving summarization quality. Moreover, analyzing and detecting omission relies on summarization datasets with omission labels (i.e., which dialogue utterances are omitted in the summarization), which are not available in the current literature. In this paper, we propose the OLDS dataset, which provides high-quality Omission Labels for Dialogue Summarization. By analyzing this dataset, we find that a large improvement in summarization quality can be achieved by providing ground-truth omission labels for the summarization model to recover omission information, which demonstrates the importance of omission detection for omission mitigation in dialogue summarization. Therefore, we formulate an omission detection task and demonstrate our proposed dataset can support the training and evaluation of this task well. We also call for research action on omission detection based on our proposed datasets. Our dataset and codes are publicly available.

Comments:	ACL 2023
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2211.07145 [cs.CL]
	(or arXiv:2211.07145v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2211.07145

Submission history

From: Yicheng Zou [view email]
[v1] Mon, 14 Nov 2022 06:56:59 UTC (265 KB)
[v2] Thu, 11 May 2023 13:26:02 UTC (265 KB)

Computer Science > Computation and Language

Title:Towards Understanding Omission in Dialogue Summarization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Towards Understanding Omission in Dialogue Summarization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators