A new set of tools for goodness-of-fit validation (2209.07295v2)
Abstract: We introduce two new tools to assess the validity of statistical distributions. These tools are based on components derived from a new statistical quantity, the $comparison$ $curve$. The first tool is a graphical representation of these components on a $bar$ $plot$ (B plot), which can provide a detailed appraisal of the validity of the statistical model, in particular when supplemented by acceptance regions related to the model. The knowledge gained from this representation can sometimes suggest an existing $goodness$-$of$-$fit$ test to supplement this visual assessment with a control of the type I error. Otherwise, an adaptive test may be preferable and the second tool is the combination of these components to produce a powerful $\chi2$-type goodness-of-fit test. Because the number of these components can be large, we introduce a new selection rule to decide, in a data driven fashion, on their proper number to take into consideration. In a simulation, our goodness-of-fit tests are seen to be powerwise competitive with the best solutions that have been recommended in the context of a fully specified model as well as when some parameters must be estimated. Practical examples show how to use these tools to derive principled information about where the model departs from the data.
- The power to see: A new graphical test of normality. The American Statistician 67, 249–260.
- Algeri, S. (2021). Informative goodness-of-fit for multivariate distributions. Electronic Journal of Statistics 15, 5570-5597.
- Anderson, G. (1994). Simple tests of distributional form. Journal of Econometrics 62, 265–276.
- An exhaustive power comparison of normality tests. Mathematics 9, 788–808.
- Goodness-of-fit test statistics that dominate the Kolmogorov-Smirnov statistics. Zeitschrift fur Wahrscheinlichkeitstheorie und verwandte Gebiete 47, 47–59.
- On sequential point estimation in a uniform distribution with adjusted non-sufficient estimators: a comparative study and real data illustration. Calcutta Statistical Association Bulletin 65, 103–121.
- Mathematical Statistics: Basic Ideas and Selected Topics. Holden-Day: San Francisco.
- Testing for lack of fit in inverse regression–with applications to biophotonic imaging. Journal of the Royal Statistical Society: Series B 71, 25–48.
- Decompositions of Pearson’s chi-squared test. Journal of Econometrics 123, 189–193.
- The sensitivity of chi-squared goodness-of-fit tests to the partitioning of data. Econometric Reviews 23, 341–370.
- Bogdan, M. (1995). Data driven version of Pearson’s chi-square test for uniformity. Journal of Statistical Computation and Simulation 52, 217–237.
- Intermediate efficiency of some weighted goodness-of-fit statistics. Journal of Nonparametric Statistics 32, 667–703.
- Csörgő, S. (2003). Weighted correlation tests for location-scale families. Mathematical and Computer Modeling 38, 753–762.
- Tests of goodness of fit based on L2subscript𝐿2L_{2}italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT-Wasserstein distance. Annals of Statistics 27, 1230–1239.
- A smooth test of goodness-of-fit for growth curves and monotonic nonlinear regression models. Biometrics 60, 977–986.
- A goodness-of-fit tests for normality for the innovations in ARMA models. Journal of Time Series Analysis 25, 373–395.
- A goodness-of-fit test for elliptical distributions with diagnostic capabilities. Journal of Multivariate Analysis 178, 104602.
- Durbin, J. (1973). Weak convergence of the sample distribution function when parameters are estimated. Annals of Statistics 1, 279–290.
- An automatic Portmanteau test for serial correlation. Journal of Econometrics 151, 140–149.
- Goodness-of-fit test based on P-P probability plots. Technometrics 32, 289–303.
- Probability plots and distribution curves for assessing the fit of probability models. The American Statistician 45, 14–21.
- Relative Distribution Methods in the Social Sciences. Springer: New York.
- Vanishing shortcoming and asymptotic relative efficiency. Annals of Statistics 28, 215–238.
- Data driven chi-square test for uniformity with unequal cells. Journal of Statistical Computation and Simulation 73, 545–561.
- Data driven score tests for a homoscedastic linear regression model: asymptotic results. Probability and Mathematical Statistics 26.1, 41–61.
- Inglot, T. (2020). Intermediate efficiency of tests under heavy-tailed alternatives. Probability and Mathematical Statistics 40, 331–348.
- Data driven smooth tests for a location-scale family revisited. Journal of Statistical Theory and Practice. Special Issue: Modern Goodness of Fit Methods 3, 645–664.
- Janssen, A. (2000). Global power functions of goodness of fit tests. Annals of Statistics 28, 239–253.
- Data driven rank tests for independence. Journal of the American Statistical Association 94, 285–301.
- A Dictionary of Statistical Terms. Oliver and Boyd: London.
- Global and local two-sample tests via regression. Electronic Journal of Statistics 13, 5253–5305.
- Ledwina, T. (1994). Data driven version of Neyman’s smooth test of fit. Journal of the American Statistical Association 89, 1000–1005.
- Nonparametric tests for first order stochastic dominance. Test 21, 730–756.
- Two-sample test for one-sided alternatives. Scandinavian Journal of Statistics 39, 358–381.
- Detection of non-Gaussianity. Journal of Statistical Computation and Simulation 85, 3480–3497.
- ODC and ROC curves, comparison curves, and stochastic dominance. International Statistical Review accepted; arXiv:2401.1409v1.
- On the asymptotic power of the two-sided Kolmogorov-Smirnov test. Journal of Statistical Planning and Inference 26, 1–23.
- Neuhaus, G. (1979). Asymptotic theory of goodness of fit tests when parameters are present : A survey. Mathematische Operationsforschung und Statistik, Series Statistics 10, 479–494.
- Neyman, J. (1937). ‘Smooth’ test for goodness of fit. Skandinavisk Aktuarietidskrift 20, 149–199.
- Parzen, E. (2004). Quantile probability and statistical data modelling. Statistical Science 19, 652–662.
- Peña, E. A. (2003). Classes of fixed-order and adaptive smooth goodness-of-fit tests with discrete right-censored data. In Mathematical and Statistical Methods in Reliability. Series on Quality, Reliability and Engineering Statistics, eds B. Lindqvist and K. Doksum, 485–501.
- A chi-square goodness-of-fit test for continuous distributions against a known alternative. Computational Statistics 36, 1885–1900.
- Rosenkrantz, W. A. (2000). Confidence bands for quantile functions: a parametric and graphic alternative for testing goodness of fit. The American Statistician 54, 185–190.
- Thas, O. (2001). Nonparametrical Tests Based on Sample Space Partitions (Ph.D. thesis). Ghent University, Belgium.
- Thas, O. (2010). Comparing Distributions. Springer: New York.
- A generalised smooth tests of goodness of fit utilising L-moments. Australian and New Zealand Journal of Statistics 57, 481–499.
- Voinov, V. (2010). A decomposition of Pearson-Fisher and Dzaparidze-Nikulin statistics and some ideas for a more powerful test construction. Communications in Statistics -Theory and Methods 39, 667–677.
- Consistent model selection and data driven smooth tests for longitudinal data in the estimating equation approach. Journal of the Royal Statistical Society: Series B 71, 177–190.
- Wolfram Research, Inc. (2020). Mathematica ,Version 12.1. Wolfram Research, Inc.: Champaign, Illinois.
- Wyłupek, G. (2010). Data driven k𝑘kitalic_k-sample tests. Technometrics 52, 107–123.
- Wyłupek, G. (2021). A permutation test for the two-sample right-censored model. Annals of the Institute of Statistical Mathematics 73, 1037–1261.
- Pairwise nonlinear dependence analysis of genome data. Annals of Applied Statistic 17, 2924–2943.
- Zhang, K. (2019). BET on independence. Journal of the American Statistical Association 114, 1620–1637.
- Fan, J. (1996). Test of significance based on wavelet thresholding and Neyman’s truncation. Journal of the American Statistical Association 91, 674–688.
- On Neyman-type smooth tests of fit. Statistics 21, 549–558.
- Jaeschke, D. (1979). The asymptotic distribution of the supremum of the standardized empirical distribution function on subintervals. Annals of Statistics 7, 108–115.
- Multivariate Analysis. Academic Press: London.
- A modified Kolmogorov-Smirnov test sensitive to tail alternatives. The Annals of Statistics 11, 933–946.
- Tests for departures from normality: comparison of powers. Biometrika 64, 231–246.
- Wallis, K. F. (2014). The two-piece normal, binormal or double Gaussian distribution : Its origin and rediscoveries. Statistical Science 29, 106–112.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.