Sparse Factor Analysis for Learning and Content Analytics

dc.contributor.advisorBaraniuk, Richard G.
dc.contributor.committeeMemberVeeraraghavan, Ashok
dc.contributor.committeeMemberAllen, Genevera
dc.creatorLan, Shiting
dc.date.accessioned2014-09-22T19:55:24Z
dc.date.available2014-09-22T19:55:24Z
dc.date.created2014-05
dc.date.issued2014-04-23
dc.date.submittedMay 2014
dc.date.updated2014-09-22T19:55:24Z
dc.description.abstractWe develop a new model and algorithms for machine learning-based learning analytics, which estimate a learner’s knowledge of the concepts underlying a domain, and content analytics, which estimate the relationships among a collection of questions and those concepts. Our model represents the probability that a learner provides the correct response to a question in terms of three factors: their understanding of a set of underlying concepts, the concepts involved in each question, and each question’s intrinsic difficulty. We estimate these factors given the graded responses to a collection of questions. The underlying estimation problem is ill-posed in general, especially when only a subset of the questions are answered. The key observation that enables a well-posed solution is the fact that typical educational domains of interest involve only a small number of key concepts. Leveraging this observation, we develop a bi-convex maximum-likelihood solution to the resulting SPARse Factor Analysis (SPARFA) problem. We also incorporate instructor-defined tags on questions and question text to facilitate the interpretability of the estimated factors. Experiments with synthetic and real-world data demonstrate the efficacy of our approach.
dc.format.mimetypeapplication/pdf
dc.identifier.citationLan, Shiting. "Sparse Factor Analysis for Learning and Content Analytics." (2014) Master’s Thesis, Rice University. <a href="https://hdl.handle.net/1911/77215">https://hdl.handle.net/1911/77215</a>.
dc.identifier.urihttps://hdl.handle.net/1911/77215
dc.language.isoeng
dc.rightsCopyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.
dc.subjectFactor analysis
dc.subjectSparse probit regression
dc.subjectSparse logistic regression
dc.subjectBayesian latent factor analysis
dc.subjectPersonalized learning
dc.titleSparse Factor Analysis for Learning and Content Analytics
dc.typeThesis
dc.type.materialText
thesis.degree.departmentElectrical and Computer Engineering
thesis.degree.disciplineEngineering
thesis.degree.grantorRice University
thesis.degree.levelMasters
thesis.degree.nameMaster of Science
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
LAN-DOCUMENT-2014.pdf
Size:
1.08 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
LICENSE.txt
Size:
2.6 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
PROQUEST_LICENSE.txt
Size:
5.84 KB
Format:
Plain Text
Description: