What is the State of Neural Network Pruning? (2003.03033v1)

Published 6 Mar 2020 in cs.LG and stat.ML

Abstract: Neural network pruning---the task of reducing the size of a network by removing parameters---has been the subject of a great deal of work in recent years. We provide a meta-analysis of the literature, including an overview of approaches to pruning and consistent findings in the literature. After aggregating results across 81 papers and pruning hundreds of models in controlled conditions, our clearest finding is that the community suffers from a lack of standardized benchmarks and metrics. This deficiency is substantial enough that it is hard to compare pruning techniques to one another or determine how much progress the field has made over the past three decades. To address this situation, we identify issues with current practices, suggest concrete remedies, and introduce ShrinkBench, an open-source framework to facilitate standardized evaluations of pruning methods. We use ShrinkBench to compare various pruning techniques and show that its comprehensive evaluation can prevent common pitfalls when comparing pruning methods.

Authors (4)

Davis Blalock (10 papers)
Jose Javier Gonzalez Ortiz (10 papers)
Jonathan Frankle (37 papers)
John Guttag (42 papers)

Citations (968)

View on Semantic Scholar

Summary

Analyzing the Current State of Neural Network Pruning

The paper "What is the State of Neural Network Pruning?" by Davis Blalock, Jose Javier Gonzalez Ortiz, Jonathan Frankle, and John Guttag undertakes a comprehensive review of neural network pruning literature to assess its current state. The paper examines methodologies from 81 research contributions to identify prevailing practices, limitations, and insights into the effectiveness of pruning techniques. This essay aims to summarize the paper's key findings, evaluate its methodological contributions, and discuss its implications for future AI research.

Key Contributions

The paper makes several notable contributions:

Comprehensive Meta-Analysis: It compiles and analyzes data from 81 pruning papers, offering a broad perspective on the field.
Identification of Issues: It systematically identifies the lack of standardized benchmarks, common experimental practices, and consistent methodologies as significant barriers to progress in the field.
ShrinkBench: It introduces ShrinkBench, an open-source framework designed to standardize the evaluation of pruning methods.

Findings and Observations

Efficacy of Pruning

The paper establishes several consistent findings:

Magnitude-based pruning significantly compresses networks without reducing their accuracy.
Sparse models generally outperform dense ones when matched for parameter count.
Pruned models can sometimes achieve higher accuracy than their unpruned counterparts, especially at modest levels of compression.

Experimental Inconsistencies

A central theme in the paper is the fragmentation in experimental practices:

Dataset and Architecture Fragmentation: Among the 81 papers analyzed, there were 49 different datasets and 132 different architectures used. No dataset-architecture pair was used in more than a third of the studies, complicating direct comparisons.
Metrics Variability: The metrics for evaluating pruning methods vary widely. The most common metrics, such as parameter reduction and FLOPs, are used inconsistently, making cross-paper comparisons challenging.
Comparison Deficiencies: Few papers compare directly to existing methods. Many modern methods fail to establish their superiority by not comparing against pre-existing pruning techniques comprehensively, including earlier methods from the 1990s.

Confounding Variables

The paper highlights several confounding factors impacting the evaluation of pruning methods:

Differences in model architectures, hyperparameters, and training regimes.
Lack of consistency in initial model states (weights) before pruning, leading to unreliable comparisons.
Variations in the libraries, data loaders, and computational environments used across studies.

ShrinkBench: Addressing Standardization

ShrinkBench is introduced as a standardized framework for evaluating pruning methods. It emphasizes reproducibility and comparability by providing:

Consistent initial model states and hyperparameters.
Uniform evaluation metrics across multiple datasets and architectures.
Detailed and reproducible experiment setups.

ShrinkBench's results illustrate the importance of standardized methodologies. For example, the paper shows that different pruning strategies (e.g., magnitude vs. gradient-based pruning) yield different trade-offs in accuracy and compression, depending on the dataset and architecture used.

Implications and Future Directions

The paper argues that without standardized benchmarks, meaningful progress in pruning research is hindered. The lack of direct and controlled comparisons means current claims of methodological advancements are often unsubstantiated. Key implications include:

Development of Standardized Benchmarks: The introduction and adoption of tools like ShrinkBench can provide a foundation for comparing pruning methods robustly.
Rigorous Comparison Practices: Future research should implement consistent controls and compare against a broad spectrum of existing methods.
Detailed Reporting: Researchers should publish detailed experimental setups, including hyperparameters, library versions, and random seed initialization.

Conclusion

The paper by Blalock et al. represents an effort to synthesize the fragmented field of neural network pruning and address inconsistencies through the introduction of ShrinkBench. The key takeaway is the need for standardized benchmarks, comprehensive comparisons, and rigorous reporting to advance the understanding and development of efficient pruning techniques. As the field progresses, adherence to these practices will be crucial in developing robust, scalable, and efficient neural networks.