Multilevel classification: Classification of populations from measurements on members

dc.contributor.advisorCox, Dennis D.en_US
dc.creatorYamal, Jose-Miguelen_US
dc.date.accessioned2009-06-03T21:06:48Zen_US
dc.date.available2009-06-03T21:06:48Zen_US
dc.date.issued2007en_US
dc.description.abstractMultilevel classification is a problem in statistics which has gained increasing importance in many real-world problems, but it has not yet received the same statistical understanding as the general problem of classification. An example we consider here is to develop a method to detect cervical neoplasia (pre-cancer) using quantitative cytology, which involves measurements on the cells obtained in a Papanicolou smear. The multilevel structure comes from the embedded cells within a patient, where we have quantitative measurements on the cells, yet we want to classify the patients, not the cells. An additional challenge comes from the fact that we have a high-dimensional feature vector of measurements on each cell. The problem has historically been approached in two ways: (a) ignore this multilevel structure of the data and perform classification at the microscopic (cellular) level, and then use ad-hoc methods to classify at the macroscopic (patient) level, or (b) summarize the microscopic level data using a few statistics and then use these to compare the subjects at the macroscopic level. We consider a more rigorous statistical approach, the Cumulative Log-Odds (CLO) Method, which models the posterior log-odds of disease for a patient given the cell-level measured feature vectors for that patient. Combining the CLO method with a latent variable model (Latent-Class CLO Method) helps to account for between-patient heterogeneity. We apply many different approaches and evaluate their performance using out of sample prediction. We find that our best methods classify with substantial greater accuracy than the subjective Papanicolou Smear interpretation by a clinical pathologist.en_US
dc.format.extent187 p.en_US
dc.format.mimetypeapplication/pdfen_US
dc.identifier.callnoTHESIS STAT. 2007 YAMALen_US
dc.identifier.citationYamal, Jose-Miguel. "Multilevel classification: Classification of populations from measurements on members." (2007) Diss., Rice University. <a href="https://hdl.handle.net/1911/20669">https://hdl.handle.net/1911/20669</a>.en_US
dc.identifier.urihttps://hdl.handle.net/1911/20669en_US
dc.language.isoengen_US
dc.rightsCopyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.en_US
dc.subjectStatisticsen_US
dc.titleMultilevel classification: Classification of populations from measurements on membersen_US
dc.typeThesisen_US
dc.type.materialTexten_US
thesis.degree.departmentStatisticsen_US
thesis.degree.disciplineEngineeringen_US
thesis.degree.grantorRice Universityen_US
thesis.degree.levelDoctoralen_US
thesis.degree.nameDoctor of Philosophyen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
3256765.PDF
Size:
6.7 MB
Format:
Adobe Portable Document Format