Self-Contradictory Reasoning Evaluation and Detection

Liu, Ziyi; Sanyal, Soumya; Lee, Isabelle; Du, Yongkang; Gupta, Rahul; Liu, Yang; Zhao, Jieyu

Computer Science > Computation and Language

arXiv:2311.09603 (cs)

[Submitted on 16 Nov 2023 (v1), last revised 21 Oct 2024 (this version, v4)]

Title:Self-Contradictory Reasoning Evaluation and Detection

Authors:Ziyi Liu, Soumya Sanyal, Isabelle Lee, Yongkang Du, Rahul Gupta, Yang Liu, Jieyu Zhao

View PDF HTML (experimental)

Abstract:In a plethora of recent work, large language models (LLMs) demonstrated impressive reasoning ability, but many proposed downstream reasoning tasks only focus on final answers. Two fundamental questions persist: 1) how consistent is the reasoning, and 2) can models detect unreliable reasoning? In this paper, we investigate self-contradictory (Self-Contra) reasoning, where the model reasoning does not support its answers. To answer 1), we define and assess the Self-Contra rate across three datasets and delve into finer-grained categories of Self-Contra reasoning. We find that LLMs often contradict themselves in reasoning tasks involving contextual information understanding or commonsense. The model may generate correct answers by taking shortcuts in reasoning or overlooking contextual evidence, leading to compromised reasoning. For 2), we task the state-of-the-art model GPT-4 with identifying Self-Contra reasoning and finer-grained fallacies. We find that finer-grained categories enhanced detection can improve GPT-4's ability to detect Self-Contra. However, it is only able to detect Self-Contra with a 52.2% F1 score, much lower compared to 66.7% for humans. Our results indicate that current LLMs lack the robustness necessary for reliable reasoning and we emphasize the urgent need for establishing best practices in comprehensive reasoning evaluations beyond pure performance-based metrics.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2311.09603 [cs.CL]
	(or arXiv:2311.09603v4 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2311.09603

Submission history

From: Ziyi Liu [view email]
[v1] Thu, 16 Nov 2023 06:22:17 UTC (7,846 KB)
[v2] Mon, 19 Feb 2024 18:01:56 UTC (8,630 KB)
[v3] Sat, 5 Oct 2024 04:17:27 UTC (8,902 KB)
[v4] Mon, 21 Oct 2024 04:16:09 UTC (8,902 KB)

Computer Science > Computation and Language

Title:Self-Contradictory Reasoning Evaluation and Detection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Self-Contradictory Reasoning Evaluation and Detection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators