Publication

InterpretCC: Intrinsic User-Centric Interpretability through Global Mixture of Experts

Martin Jaggi, Vinitra Swamy, Jibril Albachir Frej, Julian Thomas Blackwell
2024
Journal paper

Abstract

Interpretability for neural networks is a trade-off between three key requirements: 1) faithfulness of the explanation (i.e., how perfectly it explains the prediction), 2) understandability of the explanation by humans, and 3) model performance. Most existing methods compromise one or more of these requirements; e.g., post-hoc approaches provide limited faithfulness, automatically identified feature masks compromise understandability, and intrinsically interpretable methods such as decision trees limit model performance. These shortcomings are unacceptable for sensitive applications such as education and healthcare, which require trustworthy explanations, actionable interpretations, and accurate predictions. In this work, we present InterpretCC (interpretable conditional computation), a family of interpretable-by-design neural networks that guarantee human-centric interpretability, while maintaining comparable performance to state-of-the-art models by adaptively and sparsely activating features before prediction. We extend this idea into an interpretable, global mixture-of-experts (MoE) model that allows humans to specify topics of interest, discretely separates the feature space for each data point into topical subnetworks, and adaptively and sparsely activates these topical subnetworks for prediction. We apply variations of the InterpretCC architecture for text, time series and tabular data across several real-world benchmarks, demonstrating comparable performance with non-interpretable baselines, outperforming interpretable-by-design baselines, and showing higher actionability and usefulness according to a user study.

Official source

https://infoscience.epfl.ch/record/311556?ln=en

About this result

This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

Martin Jaggi, Vinitra Swamy, Jibril Albachir Frej, Julian Thomas Blackwell
2024
Journal paper

Abstract

Official source

https://infoscience.epfl.ch/record/311556?ln=en

About this result

Ontological neighbourhood

Information engineering

Machine learning: Artificial neural networks

Related concepts (32)

Related publications (40)

Related MOOCs (23)

InterpretCC: Intrinsic User-Centric Interpretability through Global Mixture of Experts

Graph Chatbot

Chat with Graph Search

Task-driven neural network models predict neural dynamics of proprioception: Neural network model weights

Enabling Uncertainty Estimation in Iterative Neural Networks

Supervised learning and inference of spiking neural networks with temporal coding

Task-driven neural network models predict neural dynamics of proprioception: Neural network model weights

Enabling Uncertainty Estimation in Iterative Neural Networks

Supervised learning and inference of spiking neural networks with temporal coding