Finite-sample expansions for the optimal error probability in asymmetric binary hypothesis testing (2404.09605v2)
Abstract: The problem of binary hypothesis testing between two probability measures is considered. New sharp bounds are derived for the best achievable error probability of such tests based on independent and identically distributed observations. Specifically, the asymmetric version of the problem is examined, where different requirements are placed on the two error probabilities. Accurate nonasymptotic expansions with explicit constants are obtained for the error probability, using tools from large deviations and Gaussian approximation. Examples are shown indicating that, in the asymmetric regime, the approximations suggested by the new bounds are significantly more accurate than the approximations provided by either of the two main earlier approaches -- normal approximation and error exponents.
- Z. Bar-Yossef. The complexity of massive data set computations. PhD thesis, Department of Computer Sciecen, University of California, Berkeley, Berkeley, CA, 2002.
- R.E. Blahut. Hypothesis testing and information theory. IEEE Trans. Inform. Theory, 20(4):405–417, July 1974.
- A.A. Borovkov. Mathematical statistics. Gordon and Breach Science Publishers, Amsterdam, The Netherlands, 1998.
- C.L. Canonne. A survey on distribution testing: Your data is big. But is it blue? Number 9 in Graduate Surveys. Theory of Computing Library, 2020.
- H. Chernoff. A measure of asymptotic efficiency for tests of a hypothesis based on the sum of observations. Ann. Math. Statist., 23(4):493–507, 1952.
- Elements of information theory. John Wiley & Sons, New York, NY, second edition, 2012.
- I. Csiszár and J. Körner. Information theory: Coding theorems for discrete memoryless systems. Cambridge University Press, Cambridge, U.K., second edition, 2011.
- I. Csiszár and G. Longo. On the error exponent for source coding and for testing simple statistical hypotheses. Studia Sci. Math. Hungar., 6:181–191, 1971.
- W. Feller. An introduction to probability theory and its applications. Vol. II. John Wiley & Sons, New York, NY, second edition, 1971.
- L. Gavalakis and I. Kontoyiannis. Fundamental limits of lossless data compression with side information. IEEE Trans. Inform. Theory, 67(5):2680–2692, May 2021.
- T.S. Han. Information-spectrum methods in information theory. Springer, Berlin, 2003.
- W. Hoeffding. Asymptotically optimal tests for multinomial distributions. Ann. Math. Statist., 36(2):369–408, April 1965.
- J.L. Jensen. Saddlepoint approximations. Oxford University Press, Oxford, U.K., 1995.
- I. Kontoyiannis and S. Verdú. Optimal lossless data compression: Non-asymptotics and asymptotics. IEEE Trans. Inform. Theory, 60(2):777–795, February 2014.
- On the upper bound for the absolute constant in the Berry–Esséen inequality. Theory Probab. Appl., 54(4):638–658, 2010.
- S. Kullback. Information theory and statistics. Dover Publications, Mineola, NY, 1997. Reprint of the second (1968) edition.
- V. Lungu and I. Kontoyiannis. The optimal finite-sample error probability in asymmetric binary hypothesis testing. In 2024 IEEE International Symposium on Information Theory (ISIT), Athens, Greece, July 2024.
- J. Neyman and E.S. Pearson. On the problem of the most efficient tests of statistical hypotheses. Philos. Trans. Roy. Soc. London A, 231(694-706):289–337, 1933.
- V.V. Petrov. Limit theorems of probability theory. The Clarendon Press, Oxford University Press, New York, NY, 1995.
- Channel coding rate in the finite blocklength regime. IEEE Trans. Inform. Theory, 56(5):2307–2359, May 2010.
- V. Strassen. Asymptotische Abschätzungen in Shannons Informationstheorie. In Trans. Third Prague Conf. Information Theory, Statist. Decision Functions, Random Processes (Liblice, 1962), pages 689–723. Publ. House Czech. Acad. Sci., Prague, 1964.
- V.Y.F. Tan. Asymptotic estimates in information theory with non-vanishing error probabilities. Foundations and Trends in Communications and Information Theory, 11(1-2):1–184, September 2014.
- Saddlepoint approximation of the error probability of binary hypothesis testing. In 2018 IEEE International Symposium on Information Theory (ISIT), pages 2306–2310, Vail, CO, June 2018.
Collections
Sign up for free to add this paper to one or more collections.