Alexpaca: Learning Factual Clarification Question Generation Without Examples

Toles, Matthew; Huang, Yukun; Yu, Zhou; Gravano, Luis

Computer Science > Computation and Language

arXiv:2310.11571 (cs)

[Submitted on 17 Oct 2023 (v1), last revised 11 Oct 2024 (this version, v3)]

Title:Alexpaca: Learning Factual Clarification Question Generation Without Examples

Authors:Matthew Toles, Yukun Huang, Zhou Yu, Luis Gravano

View PDF HTML (experimental)

Abstract:Real-life tasks such as giving legal or technical advice often lack complete context at the outset and can have disparate answers depending thereon. The ability to derive missing factual information by asking clarifying questions (ACQ) is an important element of real-life collaboration on such reasoning tasks. Existing factual clarification question challenges evaluate generations based on word overlap or human evaluations. Recent work explores generating a response to the clarifying question then evaluating its utility directly. So far, these tasks are limited to disambiguating the user's intent rather than concrete facts about the situation. The factual domain presents unique challenges since responses to clarification questions must be factually true for accurate evaluation. To enable evaluation of factual domain clarification question generation, We present a new task that focuses on the ability to elicit missing information in multi-hop reasoning tasks. The task, HotpotQA-FLM, can be evaluated automatically, making it convenient for benchmarking language models. We observe that humans outperform GPT-4 by a large margin, while Llama 3 8B Instruct does not even beat the dummy baseline in some metrics. Finally, we find by fine-tuning Llama 3 8B Instruct on its own generations, filtered via rejection sampling, we can improve information recovery by 27.6 percent.

Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2310.11571 [cs.CL]
	(or arXiv:2310.11571v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2310.11571

Submission history

From: Matthew Toles [view email]
[v1] Tue, 17 Oct 2023 20:40:59 UTC (490 KB)
[v2] Sun, 7 Jan 2024 21:01:55 UTC (501 KB)
[v3] Fri, 11 Oct 2024 22:37:24 UTC (2,605 KB)

Computer Science > Computation and Language

Title:Alexpaca: Learning Factual Clarification Question Generation Without Examples

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Alexpaca: Learning Factual Clarification Question Generation Without Examples

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators