Mixed Integer Linear Optimization Formulations for Learning Optimal Binary Classification Trees

dc.contributor.advisorHicks, Illya V.
dc.creatorAlston, Brandon
dc.date.accessioned2022-10-11T19:20:34Z
dc.date.available2022-10-11T19:20:34Z
dc.date.created2021-08
dc.date.issued2021-11-10
dc.date.submittedAugust 2021
dc.date.updated2022-10-11T19:20:34Z
dc.description.abstractDecision trees are powerful tools for classification and regression that attract many researchers working in the burgeoning area of machine learning. A classification decision tree has two types of vertices: (i) branching vertices at which datapoints are tested on a selection of discrete features, and (ii) leaf vertices at which datapoints are assigned classes. An optimal binary classification tree is a special type of classification tree in which each branching vertex has exactly two children and can be obtained by solving a biobjective mixed integer linear optimization problem that seeks to minimize the (i) number of misclassified datapoints and (ii) number of branching vertices. In this thesis we present two new multicommodity flow formulations and a new cut-based formulation to learn such optimal binary classification trees. We then provide a comparison of the formulations' strength, valid inequalities to strengthen all formulations, and accompanying computational results.
dc.format.mimetypeapplication/pdf
dc.identifier.citationAlston, Brandon. "Mixed Integer Linear Optimization Formulations for Learning Optimal Binary Classification Trees." (2021) Master’s Thesis, Rice University. <a href="https://hdl.handle.net/1911/113687">https://hdl.handle.net/1911/113687</a>.
dc.identifier.urihttps://hdl.handle.net/1911/113687
dc.language.isoeng
dc.rightsCopyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.
dc.subjectMILO
dc.subjectclassification
dc.subjectdecision trees
dc.subjectmixed integer programming, machine learning
dc.titleMixed Integer Linear Optimization Formulations for Learning Optimal Binary Classification Trees
dc.typeThesis
dc.type.materialText
thesis.degree.departmentComputational and Applied Mathematics
thesis.degree.disciplineEngineering
thesis.degree.grantorRice University
thesis.degree.levelMasters
thesis.degree.nameMaster of Arts
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
ALSTON-DOCUMENT-2021.pdf
Size:
7.14 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
PROQUEST_LICENSE.txt
Size:
5.84 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
LICENSE.txt
Size:
2.61 KB
Format:
Plain Text
Description: