LatentExplainer: Explaining Latent Representations in Deep Generative Models with Multi-modal Foundation Models (2406.14862v4)

Published 21 Jun 2024 in cs.LG, cs.CL, and cs.CV

Abstract: Deep generative models like VAEs and diffusion models have advanced various generation tasks by leveraging latent variables to learn data distributions and generate high-quality samples. Despite the field of explainable AI making strides in interpreting machine learning models, understanding latent variables in generative models remains challenging. This paper introduces \textit{LatentExplainer}, a framework for automatically generating semantically meaningful explanations of latent variables in deep generative models. \textit{LatentExplainer} tackles three main challenges: inferring the meaning of latent variables, aligning explanations with inductive biases, and handling varying degrees of explainability. Our approach perturbs latent variables, interpreting changes in generated data, and uses multi-modal LLMs (MLLMs) to produce human-understandable explanations. We evaluate our proposed method on several real-world and synthetic datasets, and the results demonstrate superior performance in generating high-quality explanations for latent variables. The results highlight the effectiveness of incorporating inductive biases and uncertainty quantification, significantly enhancing model interpretability.

Authors (6)

Mengdan Zhu (6 papers)
Raasikh Kanjiani (2 papers)
Jiahui Lu (3 papers)
Andrew Choi (9 papers)
Qirui Ye (1 paper)
Liang Zhao (353 papers)

Summary

We haven't generated a summary for this paper yet.

Summarize Now

LatentExplainer: Explaining Latent Representations in Deep Generative Models with Multi-modal Foundation Models (2406.14862v4)

Summary

Related Papers