Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Visual Validation versus Visual Estimation: A Study on the Average Value in Scatterplots (2307.09330v3)

Published 18 Jul 2023 in cs.CV and cs.GR

Abstract: We investigate the ability of individuals to visually validate statistical models in terms of their fit to the data. While visual model estimation has been studied extensively, visual model validation remains under-investigated. It is unknown how well people are able to visually validate models, and how their performance compares to visual and computational estimation. As a starting point, we conducted a study across two populations (crowdsourced and volunteers). Participants had to both visually estimate (i.e, draw) and visually validate (i.e., accept or reject) the frequently studied model of averages. Across both populations, the level of accuracy of the models that were considered valid was lower than the accuracy of the estimated models. We find that participants' validation and estimation were unbiased. Moreover, their natural critical point between accepting and rejecting a given mean value is close to the boundary of its 95% confidence interval, indicating that the visually perceived confidence interval corresponds to a common statistical standard. Our work contributes to the understanding of visual model validation and opens new research opportunities.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (31)
  1. C. R. Blyth. On simpson’s paradox and the sure-thing principle. Journal of the American Statistical Association, 67(338):364–366, 1972. doi: 10 . 1080/01621459 . 1972 . 10482387
  2. Experimental methods: Between-subject and within-subject design. Journal of Economic Behavior & Organization, 81(1):1–8, 2012. doi: 10 . 1016/j . jebo . 2011 . 08 . 009
  3. A survey of surveys on the use of visualization for interpreting machine learning models. Information Visualization, 19(3):207–233, 2020. doi: 10 . 1177/1473871620904671
  4. The state of the art in enhancing trust in machine learning models with the use of visualizations. Computer Graphics Forum, 39(3):713–756, 2020. doi: 10 . 1111/cgf . 14034
  5. M. Correll and J. Heer. Regression by eye: Estimating trends in bivariate visualizations. In Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems, pp. 1387––1396. ACM, New York, 2017. doi: 10 . 1145/3025453 . 3025922
  6. Looks good to me: Visualizations as sanity checks. IEEE Transactions on Visualization and Computer Graphics, 25(1):830–839, 2019. doi: 10 . 1109/TVCG . 2018 . 2864907
  7. Global forecasting confirmed and fatal cases of covid-19 outbreak using autoregressive integrated moving average model. Frontiers in Public Health, 8:580327:1–580327:11, 2020. doi: 10 . 3389/fpubh . 2020 . 580327
  8. Familiarity vs trust: A comparative study of domain scientists’ trust in visual analytics and conventional analysis methods. IEEE Transactions on Visualization and Computer Graphics, 23(1):271–280, 2017. doi: 10 . 1109/TVCG . 2016 . 2598544
  9. Visualizing the central tendency of ensembles of shapes. In SIGGRAPH ASIA 2016 Symposium on Visualization, pp. 3:1–3:8. ACM, New York, 2016. doi: 10 . 1145/3002151 . 3002165
  10. C. Dubé. Central tendency representation and exemplar matching in visual short-term memory. Memory & Cognition, 47:589–602, 2019. doi: 10 . 3758/s13421-019-00900-0
  11. How do we measure trust in visual data communication? In 2022 IEEE Evaluation and Beyond - Methodological Approaches for Visualization (BELIV), pp. 85–92. IEEE, New York, 2022. doi: 10 . 1109/BELIV57783 . 2022 . 00014
  12. Perception of average value in multiclass scatterplots. IEEE Transactions on Visualization and Computer Graphics, 19(12):2316–2325, 2013. doi: 10 . 1109/TVCG . 2013 . 183
  13. Ranking visualizations of correlation using weber’s law. IEEE Transactions on Visualization and Computer Graphics, 20(12):1943–1952, 2014. doi: 10 . 1109/TVCG . 2014 . 2346979
  14. The weighted average illusion: Biases in perceived mean position in scatterplots. IEEE Transactions on Visualization and Computer Graphics, 28(1):987–997, 2022. doi: 10 . 1109/TVCG . 2021 . 3114783
  15. M. Kay and J. Heer. Beyond weber’s law: A second look at ranking visualizations of correlation. IEEE Transactions on Visualization and Computer Graphics, 22(1):469–478, 2016. doi: 10 . 1109/TVCG . 2015 . 2467671
  16. J. Matejka and G. Fitzmaurice. Same stats, different graphs: Generating datasets with varied appearance and identical statistics through simulated annealing. In Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems, pp. 1290––1294. ACM, New York, 2017. doi: 10 . 1145/3025453 . 3025912
  17. Trust in information visualization. In EuroVis Workshop on Trustworthy Visualization (TrustVis), pp. 25–29. Eurographics Association, Goslar, 2019. doi: 10 . 2312/trvis . 20191187
  18. Sample size determination for logistic regression. Journal of Computational and Applied Mathematics, 255:743–752, 2014. doi: 10 . 1016/j . cam . 2013 . 06 . 031
  19. T. Munzner. Visualization Analysis and Design. A K Peters/CRC Press, Florida, 2014.
  20. T. Mühlbacher and H. Piringer. A partition-based framework for building and validating regression models. IEEE Transactions on Visualization and Computer Graphics, 19(12):1962–1971, 2013. doi: 10 . 1109/TVCG . 2013 . 125
  21. Fitting bell curves to data distributions using visualization. IEEE Transactions on Visualization and Computer Graphics, pp. 1–12, 2022. doi: 10 . 1109/TVCG . 2022 . 3210763
  22. It takes two to lie: One to lie, and one to listen. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 3811–3854. ACL, Stroudsburg, 2020. doi: 10 . 18653/v1/2020 . acl-main . 353
  23. Prolific. Prolific · quickly find research participants you can trust. https://www.prolific.co. Location: London, UK. Accessed: 2023-03-20.
  24. R. A. Rensink and G. Baldridge. The perception of correlation in scatterplots. Computer Graphics Forum, 29(3):1203–1210, 2010. doi: 10 . 1111/j . 1467-8659 . 2009 . 01694 . x
  25. How not to lie with visualization. Computers in Physics and IEEE Computational Science & Engineering, 10(3):268–273, 1996. doi: 10 . 1063/1 . 4822401
  26. E. H. Simpson. The interpretation of interaction in contingency tables. Journal of the Royal Statistical Society: Series B (Methodological), 13(2):238–241, 1951. doi: 10 . 1111/j . 2517-6161 . 1951 . tb00088 . x
  27. Four types of ensemble coding in data visualizations. Journal of Vision, 16(5):11:1–11:19, 2016. doi: 10 . 1167/16 . 5 . 11
  28. W. M. Vagias. Likert-type scale response anchors. Clemson International Institute for Tourism & Research Development, Department of Parks, Recreation and Tourism Management. Clemson University, 2006.
  29. Seeing what you believe or believing what you see? belief biases correlation estimation. IEEE Transactions on Visualization and Computer Graphics, 29(1):493–503, 2023. doi: 10 . 1109/TVCG . 2022 . 3209405
  30. Correlation judgment and visualization features: A comparative study. IEEE Transactions on Visualization and Computer Graphics, 25(3):1474–1488, 2019. doi: 10 . 1109/TVCG . 2018 . 2810918
  31. Perceptual proxies for extracting averages in data visualizations. Psychonomic Bulletin & Review, 26(2):669–676, 2019. doi: 10 . 3758/s13423-018-1525-7
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Daniel Braun (92 papers)
  2. Ashley Suh (18 papers)
  3. Remco Chang (31 papers)
  4. Michael Gleicher (44 papers)
  5. Tatiana von Landesberger (9 papers)
Citations (3)

Summary

We haven't generated a summary for this paper yet.