2000 character limit reached
Transformer-based Variable-rate Image Compression with Region-of-interest Control (2305.10807v3)
Published 18 May 2023 in eess.IV and cs.CV
Abstract: This paper proposes a transformer-based learned image compression system. It is capable of achieving variable-rate compression with a single model while supporting the region-of-interest (ROI) functionality. Inspired by prompt tuning, we introduce prompt generation networks to condition the transformer-based autoencoder of compression. Our prompt generation networks generate content-adaptive tokens according to the input image, an ROI mask, and a rate parameter. The separation of the ROI mask and the rate parameter allows an intuitive way to achieve variable-rate and ROI coding simultaneously. Extensive experiments validate the effectiveness of our proposed method and confirm its superiority over the other competing methods.
- Chia-Hao Kao (7 papers)
- Ying-Chieh Weng (2 papers)
- Yi-Hsin Chen (37 papers)
- Wei-Chen Chiu (54 papers)
- Wen-Hsiao Peng (39 papers)