Solving Mixed Integer Programs Using Neural Networks (2012.13349v3)

Published 23 Dec 2020 in math.OC, cs.AI, cs.DM, cs.LG, and cs.NE

Abstract: Mixed Integer Programming (MIP) solvers rely on an array of sophisticated heuristics developed with decades of research to solve large-scale MIP instances encountered in practice. Machine learning offers to automatically construct better heuristics from data by exploiting shared structure among instances in the data. This paper applies learning to the two key sub-tasks of a MIP solver, generating a high-quality joint variable assignment, and bounding the gap in objective value between that assignment and an optimal one. Our approach constructs two corresponding neural network-based components, Neural Diving and Neural Branching, to use in a base MIP solver such as SCIP. Neural Diving learns a deep neural network to generate multiple partial assignments for its integer variables, and the resulting smaller MIPs for un-assigned variables are solved with SCIP to construct high quality joint assignments. Neural Branching learns a deep neural network to make variable selection decisions in branch-and-bound to bound the objective value gap with a small tree. This is done by imitating a new variant of Full Strong Branching we propose that scales to large instances using GPUs. We evaluate our approach on six diverse real-world datasets, including two Google production datasets and MIPLIB, by training separate neural networks on each. Most instances in all the datasets combined have $10^{3-10^6$} variables and constraints after presolve, which is significantly larger than previous learning approaches. Comparing solvers with respect to primal-dual gap averaged over a held-out set of instances, the learning-augmented SCIP is 2x to 10x better on all datasets except one on which it is $10^5$x better, at large time limits. To the best of our knowledge, ours is the first learning approach to demonstrate such large improvements over SCIP on both large-scale real-world application datasets and MIPLIB.

PDF Abstract

Overview of "Solving Mixed Integer Programs Using Neural Networks"

The paper by Nair et al. presents an innovative approach to improving the efficiency and effectiveness of Mixed Integer Programming (MIP) solvers through the use of neural networks. The authors propose a method consisting of two main components: Neural Diving and Neural Branching. Both components are designed to integrate with existing MIP solvers like SCIP and are aimed at optimizing the solver's performance on large-scale MIP problems.

Core Components and Methodology

1. Neural Diving:

Neural Diving focuses on generating high-quality partial variable assignments. It utilizes a deep neural network trained on feasible assignments to predict values for a subset of integer variables in a MIP. By providing a smaller and more manageable sub-problem, solving becomes faster and more efficient. This component is crucial in guiding the solver towards promising regions in the solution space, thereby accelerating the derivation of high-quality feasible solutions.

2. Neural Branching:

Neural Branching addresses the challenge of optimizing variable selection in the branch-and-bound process. It uses an imitation learning approach to approximate Full Strong Branching—a powerful yet computationally intense method—thereby enabling it to make more informed and efficient branching decisions. This is achieved by leveraging a graph neural network that learns to predict variable importance, ultimately leading to smaller search trees and faster convergence.

Numerical Results

The empirical evaluation demonstrates significant improvements in the primal-dual gap and solution times across several real-world datasets, including two from Google's production systems and the heterogeneous MIPLIB benchmark. For instance, the learning-augmented solver achieves up to $10^4$ times better gap improvement on certain datasets, and in some cases, it reduces time to reach a 10% gap by up to five times with respect to primal heuristic alone.

Implications and Future Directions

The integration of deep learning techniques with traditional optimization solvers marks a promising direction for solving complex discrete optimization problems. By automatically tailoring heuristics to specific problem distributions, the approach can significantly reduce dependency on handcrafted methodologies and expert tuning.

Practical Implications:

Scalability: The proposed method demonstrates scalability to MIPs with up to $10^6$ variables and constraints after presolve, a scale beyond previous learning-based methods.
Generality: The evaluation shows potential applicability beyond homogeneous datasets, evidenced by improvements on the diverse MIPLIB benchmark.

Theoretical Implications:

Modeling Complexity: There is potential for deeper exploration into the design of neural architectures that can capture the inherent structure in MIPs more effectively, potentially expanding to other types of optimization problems.
Learning Theories: The findings motivate further studies into reinforcement and unsupervised learning methodologies that may surpass traditional imitation strategies, potentially leading to even greater performance gains.

Conclusion

The paper effectively demonstrates the capability of neural networks to complement and enhance traditional MIP solvers. The work is substantial in bridging the gap between artificial intelligence and mathematical optimization, paving the way for more intelligent, adaptive, and efficient optimization systems in industry-scale applications. Building upon these foundations, future research may explore the integration of neural network-based architectures across a broader spectrum of optimization frameworks, potentially disrupting traditional paradigms. The presented approach underscores a shift towards data-driven optimization strategies, heralding a new era in solving combinatorial problems.

PDF Markdown Bookmark Chat (Pro)

Authors (19)

Vinod Nair (8 papers)
Sergey Bartunov (12 papers)
Felix Gimeno (8 papers)
Ingrid von Glehn (4 papers)
Pawel Lichocki (4 papers)
Ivan Lobov (3 papers)
Brendan O'Donoghue (30 papers)
Nicolas Sonnerat (10 papers)
Christian Tjandraatmadja (9 papers)
Pengming Wang (7 papers)
Ravichandra Addanki (3 papers)
Tharindi Hapuarachchi (1 paper)
Thomas Keck (10 papers)
James Keeling (5 papers)
Pushmeet Kohli (116 papers)
Ira Ktena (14 papers)
Yujia Li (54 papers)
Oriol Vinyals (116 papers)
Yori Zwols (10 papers)

Citations (219)

View on Semantic Scholar