Bridging the Gap between Operations Research and Machine Learning with Decision Trees and Neural Nets

Date
2024-04-19
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract

This thesis focuses on bridging the overlap between the fields of Operations Research and Machine Learning. We do so by providing efficient Mixed Integer Linear Optimization (MILO) formulations that solve the optimal decision tree, both the univariate and multivariate cases, and binarized neural nets. We provide four MILO formulations for designing optimal binary classification trees: two flow-based formulations and two cut-based formulations. Given that an optimal binary can be obtained by solving a biobjective optimization problem that seeks to (i) maximize the number of correctly classified datapoints and (ii) minimize the number of branching vertices we also are the first to introduce using a biobjective approach that avoids the numerical issues associated with tuning hyperparameters of weighted objective functions. We use a unique fractional separation procedure to speed up our cut-based models given that MILO solvers often employ reductions only to flow-based models. A binarized neural net, which cannot be trained using gradient descent based backpropagation, can be implemented using Boolean operations and is fundamentally a discrete optimization problem. In particular the fixed network structures, discrete edge weights, and our definition of decision variables allow us to transfer some of the techniques used for decision trees into binarized neural nets. We efficiently model the non-linear properties of neural nets by choosing activation factions to be sign(X). We provide computational results of our proposed models against benchmark methods from the literature.

Description
Degree
Doctor of Philosophy
Type
Thesis
Keywords
machine learning, mixed integer programming, combinatorial optimization, graph theory, decision tree, neural network
Citation

Alston, Brandon. Bridging the Gap between Operations Research and Machine Learning with Decision Trees and Neural Nets. (2024). https://hdl.handle.net/1911/116142

Has part(s)
Forms part of
Published Version
Rights
Copyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.
Link to license
Citable link to this page