Matting by Generation (2407.21017v1)
Abstract: This paper introduces an innovative approach for image matting that redefines the traditional regression-based task as a generative modeling challenge. Our method harnesses the capabilities of latent diffusion models, enriched with extensive pre-trained knowledge, to regularize the matting process. We present novel architectural innovations that empower our model to produce mattes with superior resolution and detail. The proposed method is versatile and can perform both guidance-free and guidance-based image matting, accommodating a variety of additional cues. Our comprehensive evaluation across three benchmark datasets demonstrates the superior performance of our approach, both quantitatively and qualitatively. The results not only reflect our method's robust effectiveness but also highlight its ability to generate visually compelling mattes that approach photorealistic quality. The project page for this paper is available at https://lightchaserx.github.io/matting-by-generation/
- Designing effective inter-pixel information flow for natural image matting. In CVPR.
- MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation. In ICML.
- InstructPix2Pix: Learning to Follow Image Editing Instructions. In CVPR.
- Peekaboo: Text to image diffusion models are zero-shot segmentors. In CVPRW.
- Semantic human matting. In ACM MM.
- KNN matting. IEEE TPAMI 35, 9 (2013), 2175–2188.
- Deep Convolutional Neural Network for Natural Image Matting Using Initial Alpha Mattes. IEEE TIP 28, 3 (2019), 1054–1067.
- A Bayesian Approach to Digital Matting. In CVPR.
- Generative Diffusion Prior for Unified Image Restoration and Enhancement. In CVPR.
- A Cluster Sampling Method for Image Matting via Sparse Coding. In ECCV.
- Eduardo S. L. Gastal and Manuel M. Oliveira. 2010. Shared Sampling for Real-Time Alpha Matting. In Eurographics.
- Fast multi-level foreground estimation. In ICPR.
- Random walks for interactive alpha-matting. In Proceedings of the IASTED International Conference on Visualization, Imaging and Image Processing.
- A global sampling method for alpha matting. In CVPR.
- Denoising Diffusion Probabilistic Models. In NeurIPS.
- Diffusion for Natural Image Matting. arXiv preprint arXiv:2312.05915 (2023).
- Imagic: Text-Based Real Image Editing with Diffusion Models. In CVPR.
- MODNet: Real-time trimap-free portrait matting via objective decomposition. In AAAI.
- Segment Anything. In ICCV.
- A Closed-Form Solution to Natural Image Matting. IEEE TPAMI 30, 2 (2008), 228–242.
- Matting Anything. arXiv: 2306.05399 (2023).
- Blip-2: Bootstrapping language-image pre-training with frozen image encoders and large language models. In ICML.
- Privacy-Preserving Portrait Matting. In ACM MM.
- Bridging composite and real: towards end-to-end deep image matting. IJCV 130, 2 (2022), 246–266.
- Deep Image Matting: A Comprehensive Survey. arXiv preprint arXiv:2304.04672 (2023).
- GANimator: Neural Motion Synthesis from a Single Sequence. ACM TOG 41, 4 (2022), 138.
- SnapFusion: Text-to-Image Diffusion Model on Mobile Devices within Two Seconds. In NeurIPS.
- Real-time high-resolution background matting. In CVPR.
- Boosting semantic human matting with coarse annotations. In CVPR.
- Tripartite information mining and integration for image matting. In ICCV.
- Context-Aware Image Matting for Simultaneous Foreground and Alpha Estimation. In ICCV.
- Indices matter: Learning to index for deep image matting. In ICCV.
- Rethinking Portrait Matting with Pirvacy Preserving. IJCV 131, 8 (2023), 2172–2197.
- Matteformer: Transformer-based image matting via prior-tokens. In CVPR.
- Thomas Porter and Tom Duff. 1984. Compositing Digital Images. In SIGGRAPH.
- Attention-guided hierarchical structure aggregation for image matting.. In CVPR.
- A perceptually motivated online benchmark for image matting. In CVPR.
- High-resolution image synthesis with latent diffusion models. In CVPR.
- Laion-5b: An open large-scale dataset for training next generation image-text models. NeurIPS (2022).
- Background matting: The world is your green screen. In CVPR.
- Improving image matting using comprehensive sampling sets. In CVPR.
- Magenta Green Screen: Spectrally Multiplexed Alpha Matting with Deep Colorization. In Proceedings of the Digital Production Symposium.
- Denoising Diffusion Implicit Models. In ICML.
- Consistency models. In ICML.
- Score-Based Generative Modeling through Stochastic Differential Equations. In ICML.
- Poisson matting. ACM TOG 23, 3 (2004), 315–321.
- Semantic image matting. In CVPR.
- RealFill: Reference-Driven Generation for Authentic Image Completion. arXiv preprint arXiv:2309.16668 (2023).
- Jue Wang and Michael F. Cohen. 2007. Optimized Color Sampling for Robust Matting. In CVPR.
- Improved Image Matting via Real-time User Clicks and Uncertainty Estimation. In CVPR.
- dugMatting: decomposed-uncertainty-guided matting. In ICML.
- DiffIR: Efficient Diffusion Model for Image Restoration. In ICCV.
- Open-vocabulary panoptic segmentation with text-to-image diffusion models. In CVPR.
- DiffusionMat: Alpha Matting as Sequential Refinement Learning. arXiv preprint arXiv:2311.13535 (2023).
- Active Matting. In NeurIPS.
- Mask guided matting via progressive refinement network. In CVPR.
- ResShift: Efficient Diffusion Model for Image Super-resolution by Residual Shifting. In NeurIPS.
- SINE: SINgle Image Editing with Text-to-Image Diffusion Models. In CVPR.
Sponsor
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.