Visual Validation versus Visual Estimation: A Study on the Average Value in Scatterplots (2307.09330v3)
Abstract: We investigate the ability of individuals to visually validate statistical models in terms of their fit to the data. While visual model estimation has been studied extensively, visual model validation remains under-investigated. It is unknown how well people are able to visually validate models, and how their performance compares to visual and computational estimation. As a starting point, we conducted a study across two populations (crowdsourced and volunteers). Participants had to both visually estimate (i.e, draw) and visually validate (i.e., accept or reject) the frequently studied model of averages. Across both populations, the level of accuracy of the models that were considered valid was lower than the accuracy of the estimated models. We find that participants' validation and estimation were unbiased. Moreover, their natural critical point between accepting and rejecting a given mean value is close to the boundary of its 95% confidence interval, indicating that the visually perceived confidence interval corresponds to a common statistical standard. Our work contributes to the understanding of visual model validation and opens new research opportunities.
- C. R. Blyth. On simpson’s paradox and the sure-thing principle. Journal of the American Statistical Association, 67(338):364–366, 1972. doi: 10 . 1080/01621459 . 1972 . 10482387
- Experimental methods: Between-subject and within-subject design. Journal of Economic Behavior & Organization, 81(1):1–8, 2012. doi: 10 . 1016/j . jebo . 2011 . 08 . 009
- A survey of surveys on the use of visualization for interpreting machine learning models. Information Visualization, 19(3):207–233, 2020. doi: 10 . 1177/1473871620904671
- The state of the art in enhancing trust in machine learning models with the use of visualizations. Computer Graphics Forum, 39(3):713–756, 2020. doi: 10 . 1111/cgf . 14034
- M. Correll and J. Heer. Regression by eye: Estimating trends in bivariate visualizations. In Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems, pp. 1387––1396. ACM, New York, 2017. doi: 10 . 1145/3025453 . 3025922
- Looks good to me: Visualizations as sanity checks. IEEE Transactions on Visualization and Computer Graphics, 25(1):830–839, 2019. doi: 10 . 1109/TVCG . 2018 . 2864907
- Global forecasting confirmed and fatal cases of covid-19 outbreak using autoregressive integrated moving average model. Frontiers in Public Health, 8:580327:1–580327:11, 2020. doi: 10 . 3389/fpubh . 2020 . 580327
- Familiarity vs trust: A comparative study of domain scientists’ trust in visual analytics and conventional analysis methods. IEEE Transactions on Visualization and Computer Graphics, 23(1):271–280, 2017. doi: 10 . 1109/TVCG . 2016 . 2598544
- Visualizing the central tendency of ensembles of shapes. In SIGGRAPH ASIA 2016 Symposium on Visualization, pp. 3:1–3:8. ACM, New York, 2016. doi: 10 . 1145/3002151 . 3002165
- C. Dubé. Central tendency representation and exemplar matching in visual short-term memory. Memory & Cognition, 47:589–602, 2019. doi: 10 . 3758/s13421-019-00900-0
- How do we measure trust in visual data communication? In 2022 IEEE Evaluation and Beyond - Methodological Approaches for Visualization (BELIV), pp. 85–92. IEEE, New York, 2022. doi: 10 . 1109/BELIV57783 . 2022 . 00014
- Perception of average value in multiclass scatterplots. IEEE Transactions on Visualization and Computer Graphics, 19(12):2316–2325, 2013. doi: 10 . 1109/TVCG . 2013 . 183
- Ranking visualizations of correlation using weber’s law. IEEE Transactions on Visualization and Computer Graphics, 20(12):1943–1952, 2014. doi: 10 . 1109/TVCG . 2014 . 2346979
- The weighted average illusion: Biases in perceived mean position in scatterplots. IEEE Transactions on Visualization and Computer Graphics, 28(1):987–997, 2022. doi: 10 . 1109/TVCG . 2021 . 3114783
- M. Kay and J. Heer. Beyond weber’s law: A second look at ranking visualizations of correlation. IEEE Transactions on Visualization and Computer Graphics, 22(1):469–478, 2016. doi: 10 . 1109/TVCG . 2015 . 2467671
- J. Matejka and G. Fitzmaurice. Same stats, different graphs: Generating datasets with varied appearance and identical statistics through simulated annealing. In Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems, pp. 1290––1294. ACM, New York, 2017. doi: 10 . 1145/3025453 . 3025912
- Trust in information visualization. In EuroVis Workshop on Trustworthy Visualization (TrustVis), pp. 25–29. Eurographics Association, Goslar, 2019. doi: 10 . 2312/trvis . 20191187
- Sample size determination for logistic regression. Journal of Computational and Applied Mathematics, 255:743–752, 2014. doi: 10 . 1016/j . cam . 2013 . 06 . 031
- T. Munzner. Visualization Analysis and Design. A K Peters/CRC Press, Florida, 2014.
- T. Mühlbacher and H. Piringer. A partition-based framework for building and validating regression models. IEEE Transactions on Visualization and Computer Graphics, 19(12):1962–1971, 2013. doi: 10 . 1109/TVCG . 2013 . 125
- Fitting bell curves to data distributions using visualization. IEEE Transactions on Visualization and Computer Graphics, pp. 1–12, 2022. doi: 10 . 1109/TVCG . 2022 . 3210763
- It takes two to lie: One to lie, and one to listen. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 3811–3854. ACL, Stroudsburg, 2020. doi: 10 . 18653/v1/2020 . acl-main . 353
- Prolific. Prolific · quickly find research participants you can trust. https://www.prolific.co. Location: London, UK. Accessed: 2023-03-20.
- R. A. Rensink and G. Baldridge. The perception of correlation in scatterplots. Computer Graphics Forum, 29(3):1203–1210, 2010. doi: 10 . 1111/j . 1467-8659 . 2009 . 01694 . x
- How not to lie with visualization. Computers in Physics and IEEE Computational Science & Engineering, 10(3):268–273, 1996. doi: 10 . 1063/1 . 4822401
- E. H. Simpson. The interpretation of interaction in contingency tables. Journal of the Royal Statistical Society: Series B (Methodological), 13(2):238–241, 1951. doi: 10 . 1111/j . 2517-6161 . 1951 . tb00088 . x
- Four types of ensemble coding in data visualizations. Journal of Vision, 16(5):11:1–11:19, 2016. doi: 10 . 1167/16 . 5 . 11
- W. M. Vagias. Likert-type scale response anchors. Clemson International Institute for Tourism & Research Development, Department of Parks, Recreation and Tourism Management. Clemson University, 2006.
- Seeing what you believe or believing what you see? belief biases correlation estimation. IEEE Transactions on Visualization and Computer Graphics, 29(1):493–503, 2023. doi: 10 . 1109/TVCG . 2022 . 3209405
- Correlation judgment and visualization features: A comparative study. IEEE Transactions on Visualization and Computer Graphics, 25(3):1474–1488, 2019. doi: 10 . 1109/TVCG . 2018 . 2810918
- Perceptual proxies for extracting averages in data visualizations. Psychonomic Bulletin & Review, 26(2):669–676, 2019. doi: 10 . 3758/s13423-018-1525-7
- Daniel Braun (92 papers)
- Ashley Suh (18 papers)
- Remco Chang (31 papers)
- Michael Gleicher (44 papers)
- Tatiana von Landesberger (9 papers)