Papers
Topics
Authors
Recent
Assistant
AI Research Assistant
Well-researched responses based on relevant abstracts and paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses.
Gemini 2.5 Flash
Gemini 2.5 Flash 165 tok/s
Gemini 2.5 Pro 47 tok/s Pro
GPT-5 Medium 25 tok/s Pro
GPT-5 High 26 tok/s Pro
GPT-4o 81 tok/s Pro
Kimi K2 189 tok/s Pro
GPT OSS 120B 445 tok/s Pro
Claude Sonnet 4.5 35 tok/s Pro
2000 character limit reached

Perceptual Similarity guidance and text guidance optimization for Editing Real Images using Guided Diffusion Models (2312.06680v1)

Published 9 Dec 2023 in cs.CV

Abstract: When using a diffusion model for image editing, there are times when the modified image can differ greatly from the source. To address this, we apply a dual-guidance approach to maintain high fidelity to the original in areas that are not altered. First, we employ text-guided optimization, using text embeddings to direct latent space and classifier-free guidance. Second, we use perceptual similarity guidance, optimizing latent vectors with posterior sampling via Tweedie formula during the reverse process. This method ensures the realistic rendering of both the edited elements and the preservation of the unedited parts of the original image.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (16)
  1. Ilvr: Conditioning method for denoising diffusion probabilistic models, 2021.
  2. Diffedit: Diffusion-based semantic image editing with mask guidance, 2022.
  3. High-fidelity and arbitrary face editing, 2021.
  4. Prompt-to-prompt image editing with cross attention control, 2022.
  5. Clipscore: A reference-free evaluation metric for image captioning, 2022.
  6. Classifier-free diffusion guidance, 2022.
  7. Enhancing diffusion-based image synthesis with robust classifier guidance, 2023.
  8. Details or artifacts: A locally discriminative learning approach to realistic image super-resolution, 2022.
  9. Repaint: Inpainting using denoising diffusion probabilistic models, 2022.
  10. Sdedit: Guided image synthesis and editing with stochastic differential equations, 2022.
  11. Null-text inversion for editing real images using guided diffusion models, 2022.
  12. Lanit: Language-driven image-to-image translation for unlabeled data, 2023.
  13. High-resolution image synthesis with latent diffusion models, 2022.
  14. Knn-diffusion: Image generation via large-scale retrieval, 2022.
  15. Adaint: Learning adaptive intervals for 3d lookup tables on real-time image enhancement, 2022.
  16. The unreasonable effectiveness of deep features as a perceptual metric, 2018.

Summary

We haven't generated a summary for this paper yet.

Lightbulb Streamline Icon: https://streamlinehq.com

Continue Learning

We haven't generated follow-up questions for this paper yet.

Authors (1)

List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.

Don't miss out on important new AI/ML research

See which papers are being discussed right now on X, Reddit, and more:

“Emergent Mind helps me see which AI papers have caught fire online.”

Philip

Philip

Creator, AI Explained on YouTube