Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Towards a Semantic Perceptual Image Metric (1808.00447v1)

Published 1 Aug 2018 in cs.CV

Abstract: We present a full reference, perceptual image metric based on VGG-16, an artificial neural network trained on object classification. We fit the metric to a new database based on 140k unique images annotated with ground truth by human raters who received minimal instruction. The resulting metric shows competitive performance on TID 2013, a database widely used to assess image quality assessments methods. More interestingly, it shows strong responses to objects potentially carrying semantic relevance such as faces and text, which we demonstrate using a visualization technique and ablation experiments. In effect, the metric appears to model a higher influence of semantic context on judgments, which we observe particularly in untrained raters. As the vast majority of users of image processing systems are unfamiliar with Image Quality Assessment (IQA) tasks, these findings may have significant impact on real-world applications of perceptual metrics.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (11)
  1. Troy Chinen (4 papers)
  2. Johannes Ballé (29 papers)
  3. Chunhui Gu (6 papers)
  4. Sung Jin Hwang (10 papers)
  5. Sergey Ioffe (10 papers)
  6. Nick Johnston (17 papers)
  7. Thomas Leung (10 papers)
  8. David Minnen (19 papers)
  9. Sean O'Malley (1 paper)
  10. Charles Rosenberg (12 papers)
  11. George Toderici (22 papers)
Citations (11)

Summary

We haven't generated a summary for this paper yet.