OMLT: Optimization & Machine Learning Toolkit (2202.02414v2)

Published 4 Feb 2022 in stat.ML, cs.AI, cs.LG, and math.OC

Abstract: The optimization and machine learning toolkit (OMLT) is an open-source software package incorporating neural network and gradient-boosted tree surrogate models, which have been trained using machine learning, into larger optimization problems. We discuss the advances in optimization technology that made OMLT possible and show how OMLT seamlessly integrates with the algebraic modeling language Pyomo. We demonstrate how to use OMLT for solving decision-making problems in both computer science and engineering.

Citations (53)

View on Semantic Scholar

Summary

The paper demonstrates automated conversion of machine learning models into optimization formulations, reducing manual coding and errors.
It employs Pyomo with ONNX and Keras interfaces to seamlessly integrate neural networks and gradient-boosted trees into optimization frameworks.
Application cases include verifying neural network robustness and modeling engineering processes, highlighting broad research impact.

OMLT: Optimization & Machine Learning Toolkit

The paper, "OMLT: Optimization & Machine Learning Toolkit," presents an open-source software package that bridges trained neural network (NN) and gradient-boosted tree (GBT) models with optimization frameworks. The toolkit demonstrates how surrogate models trained via machine learning can be effectively integrated into larger optimization problems. This integration showcases applicability in diverse fields within both computer science and engineering.

Software Integration and Functionality

OMLT uses the algebraic modeling language Pyomo as its foundational structure. This integration is pivotal because Pyomo provides a flexible, Python-based platform for mathematical optimization, allowing researchers to employ various solver interfaces without manual coding for specific interactions. The toolkit supports GBTs via an ONNX interface and NNs via ONNX and Keras interfaces. This choice of interoperability is noteworthy as it leverages the strengths of various deep learning frameworks like TensorFlow and PyTorch, given ONNX's broad compatibility.

The core advantage of OMLT lies in its ability to automate the conversion of high-level machine learning models into optimization problem formulations, including decision variables, objectives, and constraints. This automation not only expedites the optimization modeling process but also reduces the likelihood of human errors during formulation changes. The toolkit supports multiple optimization formulations, allowing users to experiment with dense and convolutional layers embedded in neural networks.

Use Cases and Applications

The utility of OMLT is illustrated across computer science and engineering domains. In computer science, it can aid in verifying neural networks by formalizing verification problems into optimization tasks. The paper mentions applications like verifying adversarial robustness in neural networks as part of its computational use cases. In the field of engineering, it focuses on optimization problems such as the auto-thermal reforming process model, where NNs are used to create surrogate models that improve optimization accuracy or replace direct simulation models with more efficient equation-oriented representations.

Library Structure

OMLT's design showcases several distinct elements:

Input Interfaces: It primarily uses ONNX for input interfacing, enabling a broad range of machine learning models to be transformed into Pyomo constructs.
Optimization Formulations: These are built into Pyomo blocks, permitting users to focus on the high-level input-output functionality without exploring the internal complexities of NN or GBT embeddings.
Surrogate Model Formulations: By offering both full-space and reduced-space formulations, OMLT users can capture models as either constraints or objectives depending on the problem structure.

Research Implications and Prospective Developments

OMLT's development epitomizes the evolving interconnection between machine learning models and optimization pathways. It expands practical functionalities by enabling users to seamlessly switch between various optimization formulations, traditionally viewed as competitive approaches in the literature. This flexibility could foster broader applications across different domains where both machine learning and optimization are relevant.

Theoretical implications of OMLT involve its potential to streamline surrogate modeling processes in optimization, encouraging future investigations into the balance between model complexity and optimization efficiency. Practically, the toolkit could serve as a template for integrating emerging machine learning architectures with optimization frameworks, presenting a versatile solution to complex decision-making problems.

In summary, OMLT positions itself as a versatile toolkit, empowering researchers to leverage machine learning surrogates in optimization problems without the need to manually encode every formulation. As machine learning and optimization continually evolve, toolkits like OMLT will be instrumental in accommodating and exploring new frontiers in these converging fields.