Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Transformer-based Variable-rate Image Compression with Region-of-interest Control (2305.10807v3)

Published 18 May 2023 in eess.IV and cs.CV

Abstract: This paper proposes a transformer-based learned image compression system. It is capable of achieving variable-rate compression with a single model while supporting the region-of-interest (ROI) functionality. Inspired by prompt tuning, we introduce prompt generation networks to condition the transformer-based autoencoder of compression. Our prompt generation networks generate content-adaptive tokens according to the input image, an ROI mask, and a rate parameter. The separation of the ROI mask and the rate parameter allows an intuitive way to achieve variable-rate and ROI coding simultaneously. Extensive experiments validate the effectiveness of our proposed method and confirm its superiority over the other competing methods.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Chia-Hao Kao (7 papers)
  2. Ying-Chieh Weng (2 papers)
  3. Yi-Hsin Chen (37 papers)
  4. Wei-Chen Chiu (54 papers)
  5. Wen-Hsiao Peng (39 papers)
Citations (7)

Summary

We haven't generated a summary for this paper yet.