Convergence of K-indicators Clustering with Alternating Projection Algorithms

dc.contributor.advisorZhang, Yinen_US
dc.contributor.committeeMemberSchaefer, Andrew J.en_US
dc.contributor.committeeMemberHand, Paul Een_US
dc.creatorYang, Yuchenen_US
dc.date.accessioned2019-05-16T20:10:54Zen_US
dc.date.available2019-05-16T20:10:54Zen_US
dc.date.created2017-12en_US
dc.date.issued2017-11-21en_US
dc.date.submittedDecember 2017en_US
dc.date.updated2019-05-16T20:10:54Zen_US
dc.description.abstractData clustering is a fundamental unsupervised machine learning problem, and the most widely used method of data clustering over the decades is k-means. Recently, a newly proposed algorithm called KindAP, based on the idea of subspace matching and a semi-convex relaxation scheme, outperforms k-means in many aspects, such as no random replication and insensitivity to initialization. Unlike k-means, empirical evidence suggests that KindAP can correctly identify well-separated globular clusters even when the number of clusters is large, but a rigorous theoretical analysis is necessary. This study improves the algorithm design and establishes the first-step theory for KindAP. KindAP is actually a two-layered alternating projection procedure applied to two non-convex sets. The inner loop solves an intermediate model via a semi-convex relaxation scheme that relaxes one more complicated non-convex set while keeping the other intact. We first derive a convergence result for this inner loop. Then under the “ideal data” assumption where n data points are exactly located at k positions, we prove that KindAP converges globally to the global minimum with the help of outer loop. Further work is ongoing to extend this analysis from the ideal data case to more general cases.en_US
dc.format.mimetypeapplication/pdfen_US
dc.identifier.citationYang, Yuchen. "Convergence of K-indicators Clustering with Alternating Projection Algorithms." (2017) Master’s Thesis, Rice University. <a href="https://hdl.handle.net/1911/105482">https://hdl.handle.net/1911/105482</a>.en_US
dc.identifier.urihttps://hdl.handle.net/1911/105482en_US
dc.language.isoengen_US
dc.rightsCopyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.en_US
dc.subjectData clusteringen_US
dc.subjectAlternating Projectionen_US
dc.subjecten_US
dc.titleConvergence of K-indicators Clustering with Alternating Projection Algorithmsen_US
dc.typeThesisen_US
dc.type.materialTexten_US
thesis.degree.departmentComputational and Applied Mathematicsen_US
thesis.degree.disciplineEngineeringen_US
thesis.degree.grantorRice Universityen_US
thesis.degree.levelMastersen_US
thesis.degree.nameMaster of Artsen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
YANG-DOCUMENT-2017.pdf
Size:
3.52 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
PROQUEST_LICENSE.txt
Size:
5.84 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
LICENSE.txt
Size:
2.6 KB
Format:
Plain Text
Description: