🔎 Research Insights:
Optimization meets machine learning for process systems engineering - another successful interdisciplinary collaboration by SmartProSys researchers!
Researchers lead by Caroline Ganzer and Sebastian Sager from the Max Planck Institute for Dynamics of Complex Technical Systems and the Faculty of Mathematics at Otto-von-Guericke-Universität Magdeburg have worked together to improve optimization over embedded neural networks.
📖 Read the preprint: https://arxiv.org/abs/2502.03016
An analysis of optimization problems involving ReLU neural networks
Solving mixed-integer optimization problems with embedded neural networks with ReLU activation functions is challenging. Big-M coefficients that arise in relaxing binary decisions related to these functions grow exponentially with the number of layers. We survey and propose different approaches to analyze and improve the run time behavior of mixed-integer programming solvers in this context. Among them are clipped variants and regularization techniques applied during training as well as optimization-based bound tightening and a novel scaling for given ReLU networks. We numerically compare these approaches for three benchmark problems from the literature. We use the number of linear regions, the percentage of stable neurons, and overall computational effort as indicators. As a major takeaway we observe and quantify a trade-off between the often desired redundancy of neural network models versus the computational costs for solving related optimization problems.










