Distributed Evaluations: Ending Neural Point Metrics (1806.03790v1)

Published 11 Jun 2018 in cs.IR

Abstract: With the rise of neural models across the field of information retrieval, numerous publications have incrementally pushed the envelope of performance for a multitude of IR tasks. However, these networks often sample data in random order, are initialized randomly, and their success is determined by a single evaluation score. These issues are aggravated by neural models achieving incremental improvements from previous neural baselines, leading to multiple near state of the art models that are difficult to reproduce and quickly become deprecated. As neural methods are starting to be incorporated into low resource and noisy collections that further exacerbate this issue, we propose evaluating neural models both over multiple random seeds and a set of hyperparameters within $\epsilon$ distance of the chosen configuration for a given metric.

Citations (5)

View on Semantic Scholar

Collections

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

Generate Now

Distributed Evaluations: Ending Neural Point Metrics (1806.03790v1)

Collections

Summary

Follow-up Questions

Related Papers

Authors (3)