Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Granger-causal Attentive Mixtures of Experts: Learning Important Features with Neural Networks (1802.02195v6)

Published 6 Feb 2018 in cs.LG, cs.AI, and cs.NE

Abstract: Knowledge of the importance of input features towards decisions made by machine-learning models is essential to increase our understanding of both the models and the underlying data. Here, we present a new approach to estimating feature importance with neural networks based on the idea of distributing the features of interest among experts in an attentive mixture of experts (AME). AMEs use attentive gating networks trained with a Granger-causal objective to learn to jointly produce accurate predictions as well as estimates of feature importance in a single model. Our experiments show (i) that the feature importance estimates provided by AMEs compare favourably to those provided by state-of-the-art methods, (ii) that AMEs are significantly faster at estimating feature importance than existing methods, and (iii) that the associations discovered by AMEs are consistent with those reported by domain experts.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Patrick Schwab (27 papers)
  2. Djordje Miladinovic (2 papers)
  3. Walter Karlen (15 papers)
Citations (53)

Summary

We haven't generated a summary for this paper yet.