Analyzing Explainer Robustness via Probabilistic Lipschitzness of Prediction Functions

Khan, Zulqarnain; Hill, Davin; Masoomi, Aria; Bone, Joshua; Dy, Jennifer

Computer Science > Machine Learning

arXiv:2206.12481 (cs)

[Submitted on 24 Jun 2022 (v1), last revised 16 Apr 2024 (this version, v3)]

Title:Analyzing Explainer Robustness via Probabilistic Lipschitzness of Prediction Functions

Authors:Zulqarnain Khan, Davin Hill, Aria Masoomi, Joshua Bone, Jennifer Dy

View PDF HTML (experimental)

Abstract:Machine learning methods have significantly improved in their predictive capabilities, but at the same time they are becoming more complex and less transparent. As a result, explainers are often relied on to provide interpretability to these black-box prediction models. As crucial diagnostics tools, it is important that these explainers themselves are robust. In this paper we focus on one particular aspect of robustness, namely that an explainer should give similar explanations for similar data inputs. We formalize this notion by introducing and defining explainer astuteness, analogous to astuteness of prediction functions. Our formalism allows us to connect explainer robustness to the predictor's probabilistic Lipschitzness, which captures the probability of local smoothness of a function. We provide lower bound guarantees on the astuteness of a variety of explainers (e.g., SHAP, RISE, CXPlain) given the Lipschitzness of the prediction function. These theoretical results imply that locally smooth prediction functions lend themselves to locally robust explanations. We evaluate these results empirically on simulated as well as real datasets.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2206.12481 [cs.LG]
	(or arXiv:2206.12481v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2206.12481

Submission history

From: Zulqarnain Khan [view email]
[v1] Fri, 24 Jun 2022 19:43:33 UTC (2,230 KB)
[v2] Thu, 27 Jul 2023 14:06:50 UTC (6,062 KB)
[v3] Tue, 16 Apr 2024 16:27:15 UTC (3,273 KB)

Computer Science > Machine Learning

Title:Analyzing Explainer Robustness via Probabilistic Lipschitzness of Prediction Functions

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Analyzing Explainer Robustness via Probabilistic Lipschitzness of Prediction Functions

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators