Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
156 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Image Cropping under Design Constraints (2310.08892v1)

Published 13 Oct 2023 in cs.CV

Abstract: Image cropping is essential in image editing for obtaining a compositionally enhanced image. In display media, image cropping is a prospective technique for automatically creating media content. However, image cropping for media contents is often required to satisfy various constraints, such as an aspect ratio and blank regions for placing texts or objects. We call this problem image cropping under design constraints. To achieve image cropping under design constraints, we propose a score function-based approach, which computes scores for cropped results whether aesthetically plausible and satisfies design constraints. We explore two derived approaches, a proposal-based approach, and a heatmap-based approach, and we construct a dataset for evaluating the performance of the proposed approaches on image cropping under design constraints. In experiments, we demonstrate that the proposed approaches outperform a baseline, and we observe that the proposal-based approach is better than the heatmap-based approach under the same computation cost, but the heatmap-based approach leads to better scores by increasing computation cost. The experimental results indicate that balancing aesthetically plausible regions and satisfying design constraints is not a trivial problem and requires sensitive balance, and both proposed approaches are reasonable alternatives.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (35)
  1. Optuna: A Next-generation Hyperparameter Optimization Framework. In Proc. of the 25th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.
  2. Saliency based image cropping. In Proc. International Conference on Image Analysis and Processing (ICIAP). Springer, 773–782.
  3. CropNet: Real-time thumbnailing. In Proc. ACM International Conference on Multimedia (ACMMM). 81–89.
  4. Quantitative Analysis of Automatic Image Cropping Algorithms: A Dataset and Comparative Study. In Proc. IEEE Winter Conference on Applications of Computer Vision (WACV). 226–234.
  5. Learning to Compose with Professional Photographs on the Web. In Proc. ACM Conference on Multimedia (ACMMM). 37–45.
  6. Learning to photograph. In Proc. ACM International Conference on Multimedia (ACMMM). 291–300.
  7. Fast-At: Fast Automatic Thumbnail Generation Using Deep Neural Networks. In Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 4622–4630.
  8. Automatic Image Cropping Using Visual Composition, Boundary Simplicity and Content Preservation Models. In Proc. ACM International Conference on Multimedia (ACMMM). 1105–1108.
  9. Composing Photos Like a Photographer. In Proc. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 7057–7066.
  10. Repurposing existing deep networks for caption and aesthetic-guided image cropping. Pattern Recognition 126 (2022), 108485.
  11. Automatic Thumbnail Generation Based on Visual Representativeness and Foreground Recognizability. In Proc. IEEE International Conference on Computer Vision (ICCV). 253–261.
  12. Rethinking Image Cropping: Exploring Diverse Compositions From Global Views. In Proc. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 2446–2455.
  13. Automatic image cropping with aesthetic map and gradient energy map. In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 1982–1986.
  14. Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).
  15. A2-RL: Aesthetics aware reinforcement learning for image cropping. In Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 8193–8201.
  16. Fast A3RL: Aesthetics-Aware Adversarial Reinforcement Learning for Image Cropping. IEEE Transactions on Image Processing (TIP) 28, 10 (2019), 5105–5120.
  17. Learning to learn cropping models for different aspect ratio requirements. In Proc. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 12685–12694.
  18. TranSalNet: Towards perceptually relevant visual saliency prediction. Neurocomputing 494 (2022), 455–467.
  19. A framework for visual saliency detection with applications to image thumbnailing. In Proc. IEEE International Conference on Computer Vision (ICCV). 2232–2239.
  20. AVA: A large-scale database for aesthetic visual analysis. In Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2408–2415.
  21. Learning to photograph: A compositional perspective. IEEE Transactions on Multimedia 15, 5 (2013), 1138–1151.
  22. Aesthetic quality classification of photographs based on color harmony. In Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 33–40.
  23. Sensation-Based Photo Cropping. In Proc. ACM International Conference on Multimedia (ACMMM). 669–672.
  24. TransView: Inside, Outside, and Across the Cropping View Boundaries. In Proc. IEEE/CVF International Conference on Computer Vision (ICCV). 4218–4227.
  25. Gaze-based interaction for semi-automatic photo cropping. In Proc. SIGCHI Conference on Human Factors in Computing Systems (CHI). 771–780.
  26. Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014).
  27. Automatic thumbnail cropping and its effectiveness. In Proc. ACM symposium on User interface software and technology (UIST). 95–104.
  28. Wenguan Wang and Jianbing Shen. 2017. Deep cropping via attention box prediction and aesthetics assessment. In Proc. IEEE International Conference on Computer Vision (ICCV). 2186–2194.
  29. A Deep Network Solution for Attention and Aesthetics Aware Photo Cropping. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 41, 7 (2019), 1531–1544.
  30. Good view hunting: Learning photo composition from dense view pairs. In Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 5437–5446.
  31. Reliable and Efficient Image Cropping: A Grid Anchor based Approach. In Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 5949–5957.
  32. Grid anchor based image cropping: A new benchmark and an efficient model. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 44, 3 (2020), 1304–1319.
  33. Human-Centric Image Cropping with Partition-Aware and Content-Preserving Features. In Proc. European Conference on Computer Vision (ECCV). 181–197.
  34. Aesthetic-guided outward image cropping. ACM Transactions on Graphics (TOG) 40, 6 (2021), 1–13.
  35. ClipCrop: Conditioned Cropping Driven by Vision-Language Model. arXiv preprint arXiv:2211.11492 (2022).

Summary

We haven't generated a summary for this paper yet.