ExplorerArtificial IntelligenceAI
Research PaperResearchia:202605.08011

Concept-Based Abductive and Contrastive Explanations for Behaviors of Vision Models

Ronaldo Canizales

Abstract

Concept-based explanations offer a promising approach for explaining the predictions of deep neural networks in terms of high-level, human-understandable concepts. However, existing methods either do not establish a causal connection between the concepts and model predictions or are limited in expressivity and only able to infer causal explanations involving single concepts. At the same time, the parallel line of work on formal abductive and contrastive explanations computes the minimal set of i...

Submitted: May 8, 2026Subjects: AI; Artificial Intelligence

Description / Details

Concept-based explanations offer a promising approach for explaining the predictions of deep neural networks in terms of high-level, human-understandable concepts. However, existing methods either do not establish a causal connection between the concepts and model predictions or are limited in expressivity and only able to infer causal explanations involving single concepts. At the same time, the parallel line of work on formal abductive and contrastive explanations computes the minimal set of input features causally relevant for model outcomes but only considers low-level features such as pixels. Merging these two threads, in this work, we propose the notion of concept-based abductive and contrastive explanations that capture the minimal sets of high-level concepts causally relevant for model outcomes. We then present a family of algorithms that enumerate all minimal explanations while using concept erasure procedures to establish causal relationships. By appropriately aggregating such explanations, we are not only able to understand model predictions on individual images but also on collections of images where the model exhibits a user-specified, common behavior. We evaluate our approach on multiple models, datasets, and behaviors, and demonstrate its effectiveness in computing helpful, user-friendly explanations.


Source: arXiv:2605.06640v1 - http://arxiv.org/abs/2605.06640v1 PDF: https://arxiv.org/pdf/2605.06640v1 Original Link: http://arxiv.org/abs/2605.06640v1

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Access Paper
View Source PDF
Submission Info
Date:
May 8, 2026
Topic:
Artificial Intelligence
Area:
AI
Comments:
0
Bookmark
Concept-Based Abductive and Contrastive Explanations for Behaviors of Vision Models | Researchia