Variantional autoencoder with decremental information bottleneck for disentanglement (2303.12959v2)
Abstract: One major challenge of disentanglement learning with variational autoencoders is the trade-off between disentanglement and reconstruction fidelity. Previous studies, which increase the information bottleneck during training, tend to lose the constraint of disentanglement, leading to the information diffusion problem. In this paper, we present a novel framework for disentangled representation learning, DeVAE, which utilizes hierarchical latent spaces with decreasing information bottlenecks across these spaces. The key innovation of our approach lies in connecting the hierarchical latent spaces through disentanglement-invariant transformations, allowing the sharing of disentanglement properties among spaces while maintaining an acceptable level of reconstruction performance. We demonstrate the effectiveness of DeVAE in achieving a balance between disentanglement and reconstruction through a series of experiments and ablation studies on dSprites and Shapes3D datasets. Code is available at https://github.com/erow/disentanglement_lib/tree/pytorch#devae.
- Representation learning: A review and new perspectives. IEEE Transactions on Pattern Analysis and Machine Intelligence, 35(8):1798–1828, 2013.
- 3d shapes dataset. https://github.com/deepmind/3dshapes-dataset/, 2018.
- Understanding disentangling in β𝛽\betaitalic_β-vae. In International Conference on Machine Learning (ICML), 2018.
- Isolating sources of disentanglement in variational autoencoders. In Neural Information Processing Systems (NeurIPS), 2018.
- Pierre Comon. Independent component analysis, a new concept? Signal processing, 36(3):287–314, 1994.
- Theory and evaluation metrics for learning disentangled representations. In International Conference on Learning Representations (ICLR), 2020.
- Cian Eastwood and Christopher K. I. Williams. A framework for the quantitative evaluation of disentangled representations. In International Conference on Learning Representations (ICLR), 2018.
- beta-vae: Learning basic visual concepts with a constrained variational framework. In International Conference on Learning Representations (ICLR), 2017.
- Towards a definition of disentangled representations. arXiv preprint arXiv:1812.02230, 2018.
- Learning discrete and continuous factors of data via alternating disentanglement. In International Conference on Machine Learning (ICML), 2019.
- Disentangling by factorising. In International Conference on Machine Learning (ICML), 2018.
- Adam: A method for stochastic optimization. In 3rd International Conference on Learning Representations (ICLR), 2015.
- Auto-encoding variational bayes. In International Conference on Learning Representations (ICLR), 2014.
- Improved variational inference with inverse autoregressive flow. Advances in neural information processing systems, 29, 2016.
- Deep learning face attributes in the wild. In ICCV, pages 3730–3738. IEEE Computer Society, 2015.
- dsprites: Disentanglement testing sprites dataset, 2017.
- Variational inference with normalizing flows. In International conference on machine learning, pages 1530–1538. PMLR, 2015.
- Jürgen Schmidhuber. Learning factorial codes by predictability minimization. Neural Computation, 4(6):863–879, 1992. ISSN 08997667.
- Claude Elwood Shannon. A mathematical theory of communication. Bell system technical journal, 27(3):379–423, 1948.
- Rethinking controllable variational autoencoders. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 19250–19259, 2022.
- The information bottleneck method. In In Proceedings of the 37-th Annual Allerton Conference on Communication, Control and Computing, 1999.
- Satosi Watanabe. Information theoretical analysis of multivariate correlation. IBM Journal of research and development, 4:66–82, 1960.
- Principal component analysis. Chemometrics and intelligent laboratory systems, 2(1-3):37–52, 1987.
- DEFT: distilling entangled factors by preventing information diffusion. Mach. Learn., 111(6):2275–2295, 2022.
- Unsupervised feature learning via non-parametric instance discrimination. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018.