Statistical comparison of classifiers through Bayesian hierarchical modelling (1609.08905v3)

Published 28 Sep 2016 in cs.LG, stat.ME, and stat.ML

Abstract: Usually one compares the accuracy of two competing classifiers via null hypothesis significance tests (nhst). Yet the nhst tests suffer from important shortcomings, which can be overcome by switching to Bayesian hypothesis testing. We propose a Bayesian hierarchical model which jointly analyzes the cross-validation results obtained by two classifiers on multiple data sets. It returns the posterior probability of the accuracies of the two classifiers being practically equivalent or significantly different. A further strength of the hierarchical model is that, by jointly analyzing the results obtained on all data sets, it reduces the estimation error compared to the usual approach of averaging the cross-validation results obtained on a given data set.

Citations (52)

View on Semantic Scholar

Collections

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Paper Prompts

Explore 10 Community Prompts

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

Generate Now

Statistical comparison of classifiers through Bayesian hierarchical modelling (1609.08905v3)

Collections

Summary

Paper Prompts

Follow-up Questions

Related Papers

Authors (5)