PromptIR: Prompting for All-in-One Blind Image Restoration (2306.13090v1)
Abstract: Image restoration involves recovering a high-quality clean image from its degraded version. Deep learning-based methods have significantly improved image restoration performance, however, they have limited generalization ability to different degradation types and levels. This restricts their real-world application since it requires training individual models for each specific degradation and knowing the input degradation type to apply the relevant model. We present a prompt-based learning approach, PromptIR, for All-In-One image restoration that can effectively restore images from various types and levels of degradation. In particular, our method uses prompts to encode degradation-specific information, which is then used to dynamically guide the restoration network. This allows our method to generalize to different degradation types and levels, while still achieving state-of-the-art results on image denoising, deraining, and dehazing. Overall, PromptIR offers a generic and efficient plugin module with few lightweight prompts that can be used to restore images of various types and levels of degradation with no prior information on the corruptions present in the image. Our code and pretrained models are available here: https://github.com/va1shn9v/PromptIR
- Contour detection and hierarchical image segmentation. TPAMI.
- Layer normalization. arXiv:1607.06450.
- Language models are few-shot learners. arXiv:2005.14165.
- Dehazenet: An end-to-end system for single image haze removal. IEEE Transactions on Image Processing 25(11), 5187–5198.
- End-to-end object detection with transformers. In ECCV.
- Pre-trained image processing transformer. In CVPR, pp. 12299–12310.
- Pre-trained image processing transformer. In CVPR.
- Simple baselines for image restoration. In ECCV.
- Activating more pixels in image super-resolution transformer. arxiv 2022. arXiv preprint arXiv:2205.04437.
- Blind image super-resolution with spatially variant degradations. ACM Transactions on Graphics (TOG) 38(6), 1–13.
- Color image denoising via sparse 3d collaborative filtering with grouping constraint in luminance-chrominance space. In 2007 IEEE International Conference on Image Processing, Volume 1, pp. I–313. IEEE.
- Multi-scale boosted dehazing network with dense feature fusion. In CVPR.
- Image deblurring and super-resolution by adaptive sparse domain selection and adaptive regularization. TIP.
- Fd-gan: Generative adversarial networks with fusion-discriminator for single image dehazing. In Proceedings of the AAAI Conference on Artificial Intelligence, Volume 34, pp. 10729–10736.
- An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929.
- A general decoupled learning framework for parameterized image operators. IEEE transactions on pattern analysis and machine intelligence 43(1), 33–47.
- Dynamic scene deblurring with parameter selective sharing and nested skip connections. In CVPR, pp. 3848–3856.
- Visual prompt tuning for test-time domain adaptation. arXiv preprint arXiv:2210.04831.
- Single image haze removal using dark channel prior. TPAMI.
- Hyperprompt: Prompt-based task-conditioning of transformers. In ICML, pp. 8678–8690. PMLR.
- Parameter-efficient transfer learning for nlp. In ICML, pp. 2790–2799. PMLR.
- Single image super-resolution from transformed self-exemplars. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 5197–5206.
- Visual prompt tuning. In Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XXXIII, pp. 709–727. Springer.
- Multi-scale progressive fusion network for single image deraining. In CVPR, pp. 8346–8355.
- Transformers in vision: A survey. ACM computing surveys (CSUR) 54(10s), 1–41.
- Maple: Multi-modal prompt learning. CVPR.
- Single-image super-resolution using sparse regression and natural image prior. TPAMI.
- Deep photo: Model-based photograph enhancement and viewing. ACM TOG.
- All-in-one image restoration for unknown corruption. In CVPR, pp. 17452–17462.
- Aod-net: All-in-one dehazing network. In ICCV, pp. 4770–4778.
- Benchmarking single-image dehazing and beyond. IEEE Transactions on Image Processing 28(1), 492–505.
- All in one bad weather removal using architectural search. In CVPR, pp. 3175–3185.
- On efficient transformer and image pre-training for low-level vision. arXiv preprint arXiv:2112.10175.
- Prefix-tuning: Optimizing continuous prompts for generation. arXiv preprint arXiv:2101.00190.
- SwinIR: Image restoration using swin transformer. In ICCV Workshops.
- Trident dehazing network. In CVPR Workshops.
- Tape: Task-agnostic prior embedding for image restoration. In Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XVIII, pp. 447–464. Springer.
- Swin transformer: Hierarchical vision transformer using shifted windows. arXiv:2103.14030.
- Learning the degradation distribution for blind image super-resolution. In CVPR, pp. 6063–6072.
- Waterloo exploration database: New challenges for image quality assessment models. TIP.
- A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics. In ICCV.
- Nonparametric blind super-resolution. In ICCV.
- Clean images are hard to reblur: Exploiting the ill-posed inverse task for dynamic scene deblurring. In ICLR.
- Enhanced pix2pix dehazing network. In CVPR, pp. 8160–8168.
- Adaptive consistency prior based deep network for image denoising. In CVPR.
- Single image dehazing via multi-scale convolutional neural networks. In Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11-14, 2016, Proceedings, Part II 14, pp. 154–169. Springer.
- Single image dehazing via multi-scale convolutional neural networks with holistic edges. IJCV.
- Multitask prompted training enables zero-shot task generalization. arXiv preprint arXiv:2110.08207.
- Coda-prompt: Continual decomposed attention-based prompting for rehearsal-free continual learning. arXiv preprint arXiv:2211.13218.
- Visual prompt tuning for generative transfer learning. arXiv preprint arXiv:2210.00990.
- Image denoising using deep cnn with batch renormalization. Neural Networks.
- Anchored neighborhood regression for fast example-based super-resolution. In ICCV.
- Training data-efficient image transformers & distillation through attention. In ICML.
- BANet: A blur-aware attention network for dynamic scene deblurring. IEEE Transactions on Image Processing.
- MAXIM: Multi-axis MLP for image processing. In CVPR, pp. 5769–5780.
- Transweather: Transformer-based restoration of images degraded by adverse weather conditions. In CVPR, pp. 2353–2363.
- Attention is all you need. In NeurIPS.
- Uformer: A general u-shaped transformer for image restoration. arXiv:2106.03106.
- Multitask prompt tuning enables parameter-efficient transfer learning. arXiv preprint arXiv:2303.02861.
- Dualprompt: Complementary prompting for rehearsal-free continual learning. In Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XXVI, pp. 631–648. Springer.
- Learning to prompt for continual learning. In CVPR, pp. 139–149.
- Semi-supervised transfer learning for image rain removal. In CVPR.
- Segformer: Simple and efficient design for semantic segmentation with transformers. arXiv:2105.15203.
- Learning texture transformer network for image super-resolution. In CVPR.
- Yasarla, R. and V. M. Patel (2019). Uncertainty guided multi-scale residual learning-using a cycle spinning cnn for single image de-raining. In CVPR.
- Tokens-to-token vit: Training vision transformers from scratch on imagenet. arXiv:2101.11986.
- Restormer: Efficient transformer for high-resolution image restoration. In CVPR.
- CycleISP: Real image restoration via improved data synthesis. In CVPR.
- Learning enriched features for real image restoration and enhancement. In ECCV.
- Multi-stage progressive image restoration. In CVPR.
- Zhang, H. and V. M. Patel (2018). Density-aware single image de-raining using a multi-stream dense network. In CVPR.
- Designing a practical degradation model for deep blind image super-resolution. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 4791–4800.
- Deblurring by realistic blurring. In CVPR, pp. 2737–2746.
- Beyond a gaussian denoiser: Residual learning of deep cnn for image denoising. IEEE transactions on image processing 26(7), 3142–3155.
- Learning deep CNN denoiser prior for image restoration. In CVPR.
- Ffdnet: Toward a fast and flexible solution for cnn-based image denoising. IEEE Transactions on Image Processing 27(9), 4608–4622.
- Rethinking semantic segmentation from a sequence-to-sequence perspective with transformers. In CVPR.
- Learning to prompt for vision-language models. International Journal of Computer Vision (IJCV).
- Deformable detr: Deformable transformers for end-to-end object detection. arXiv preprint arXiv:2010.04159.
- Vaishnav Potlapalli (3 papers)
- Syed Waqas Zamir (20 papers)
- Salman Khan (244 papers)
- Fahad Shahbaz Khan (225 papers)