Mixed Integer Linear Optimization Formulations for Learning Optimal Binary Classification Trees

dc.contributor.advisorHicks, Illya V.en_US
dc.creatorAlston, Brandonen_US
dc.date.accessioned2022-10-11T19:20:34Zen_US
dc.date.available2022-10-11T19:20:34Zen_US
dc.date.created2021-08en_US
dc.date.issued2021-11-10en_US
dc.date.submittedAugust 2021en_US
dc.date.updated2022-10-11T19:20:34Zen_US
dc.description.abstractDecision trees are powerful tools for classification and regression that attract many researchers working in the burgeoning area of machine learning. A classification decision tree has two types of vertices: (i) branching vertices at which datapoints are tested on a selection of discrete features, and (ii) leaf vertices at which datapoints are assigned classes. An optimal binary classification tree is a special type of classification tree in which each branching vertex has exactly two children and can be obtained by solving a biobjective mixed integer linear optimization problem that seeks to minimize the (i) number of misclassified datapoints and (ii) number of branching vertices. In this thesis we present two new multicommodity flow formulations and a new cut-based formulation to learn such optimal binary classification trees. We then provide a comparison of the formulations' strength, valid inequalities to strengthen all formulations, and accompanying computational results.en_US
dc.format.mimetypeapplication/pdfen_US
dc.identifier.citationAlston, Brandon. "Mixed Integer Linear Optimization Formulations for Learning Optimal Binary Classification Trees." (2021) Master’s Thesis, Rice University. <a href="https://hdl.handle.net/1911/113687">https://hdl.handle.net/1911/113687</a>.en_US
dc.identifier.urihttps://hdl.handle.net/1911/113687en_US
dc.language.isoengen_US
dc.rightsCopyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.en_US
dc.subjectMILOen_US
dc.subjectclassificationen_US
dc.subjectdecision treesen_US
dc.subjectmixed integer programmingen_US
dc.subjectmachine learningen_US
dc.titleMixed Integer Linear Optimization Formulations for Learning Optimal Binary Classification Treesen_US
dc.typeThesisen_US
dc.type.materialTexten_US
thesis.degree.departmentComputational and Applied Mathematicsen_US
thesis.degree.disciplineEngineeringen_US
thesis.degree.grantorRice Universityen_US
thesis.degree.levelMastersen_US
thesis.degree.nameMaster of Artsen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
ALSTON-DOCUMENT-2021.pdf
Size:
7.14 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
PROQUEST_LICENSE.txt
Size:
5.84 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
LICENSE.txt
Size:
2.61 KB
Format:
Plain Text
Description: