This paper takes the following view of #explainability: First train a model, and at test time, for a particular test instance, estimate the prediction Y, and then estimate the explanation E. Given this view, the paper uses the #PotentialOutcomes framework of #causality to understand the relationship between #prediction and explainability. Going from low to high performance of Y, the influence of Y on E is high, low, then high again.
#MachineLearning

https://arxiv.org/abs/2212.06925

On the Relationship Between Explanation and Prediction: A Causal View

Being able to provide explanations for a model's decision has become a central requirement for the development, deployment, and adoption of machine learning models. However, we are yet to understand what explanation methods can and cannot do. How do upstream factors such as data, model prediction, hyperparameters, and random initialization influence downstream explanations? While previous work raised concerns that explanations (E) may have little relationship with the prediction (Y), there is a lack of conclusive study to quantify this relationship. Our work borrows tools from causal inference to systematically assay this relationship. More specifically, we study the relationship between E and Y by measuring the treatment effect when intervening on their causal ancestors, i.e., on hyperparameters and inputs used to generate saliency-based Es or Ys. Our results suggest that the relationships between E and Y is far from ideal. In fact, the gap between 'ideal' case only increase in higher-performing models -- models that are likely to be deployed. Our work is a promising first step towards providing a quantitative measure of the relationship between E and Y, which could also inform the future development of methods for E with a quantitative metric.

arXiv.org

#introduction

Hi,

I'm Joseph Bulbulia (Joe).

I teach #QuantitativePsychology at Victoria University of Wellington.

I serve on the leadership team of the New Zealand Attitudes and Values Study.

I supervise graduate students interested in #longitudinal methods for national-scale #paneldata.

My substantive research interests are in the psychology of religion, cultural evolution, moral psychology, well-being, and more recently #climatepsychology.

Big fan of the #potentialoutcomes framework for #causalinference.

Not a big fan of #prediction and #associations in #psychology.

I migrated to qoto.org because of its nice interface and science community.*

-- Joe

*Also, I think I can write things like

\[E(Y^1) - E(Y^0) \neq 0\]