Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
153 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Note on the Prediction-Powered Bootstrap (2405.18379v3)

Published 28 May 2024 in stat.ML, cs.LG, and stat.ME

Abstract: We introduce PPBoot: a bootstrap-based method for prediction-powered inference. PPBoot is applicable to arbitrary estimation problems and is very simple to implement, essentially only requiring one application of the bootstrap. Through a series of examples, we demonstrate that PPBoot often performs nearly identically to (and sometimes better than) the earlier PPI(++) method based on asymptotic normality$\unicode{x2013}$when the latter is applicable$\unicode{x2013}$without requiring any asymptotic characterizations. Given its versatility, PPBoot could simplify and expand the scope of application of prediction-powered inference to problems where central limit theorems are hard to prove.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (15)
  1. Prediction-powered inference. Science, 382(6671):669–674, 2023a.
  2. PPI++: Efficient prediction-powered inference. arXiv preprint arXiv:2311.01453, 2023b.
  3. Clustering predicted structures at the scale of the known protein universe. Nature, 622(7983):637–645, 2023.
  4. The structural context of posttranslational modifications at a proteome-wide scale. PLoS biology, 20(5):e3001636, 2022.
  5. Autoeval done right: Using synthetic data for model evaluation. arXiv preprint arXiv:2403.07008, 2024.
  6. An introduction to the bootstrap. Chapman and Hall/CRC, 1994.
  7. From narratives to numbers: Valid inference using language model predictions from verbal autopsy narratives. arXiv preprint arXiv:2404.02438, 2024.
  8. Highly accurate protein structure prediction with alphafold. Nature, 596(7873):583–589, 2021.
  9. The evolution, evolvability and engineering of gene regulatory dna. Nature, 603(7901):455–463, 2022.
  10. Methods for correcting inference based on outcomes predicted by machine learning. Proceedings of the National Academy of Sciences, 117(48):30266–30275, 2020.
  11. Galaxy zoo 2: detailed morphological classifications for 304 122 galaxies from the sloan digital sky survey. Monthly Notices of the Royal Astronomical Society, 435(4):2835–2860, 2013.
  12. Combining multiple observational data sources to estimate causal effects. Journal of the American Statistical Association, 2019.
  13. The sloan digital sky survey: Technical summary. The Astronomical Journal, 120(3):1579, 2000.
  14. Judging LLM-as-a-judge with MT-Bench and Chatbot Arena. Advances in Neural Information Processing Systems, 36, 2024.
  15. Cross-prediction-powered inference. Proceedings of the National Academy of Sciences (PNAS), 121(15), 2024.
Citations (1)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com