Scott, David W.2009-06-042009-06-041998Salch, John David. "Practical methods for data mining with massive data sets." (1998) Diss., Rice University. <a href="https://hdl.handle.net/1911/19307">https://hdl.handle.net/1911/19307</a>.https://hdl.handle.net/1911/19307The increasing size of data sets has necessitated advancement in exploratory techniques. Methods that are practical for moderate to small data sets become infeasible when applied to massive data sets. Advanced techniques such as binned kernel density estimation, tours, and mode-based projection pursuit will be explored. Mean-centered binning will be introduced as an improved method for binned density estimation. The density grand tour will be demonstrated as a means of exploring massive high-dimensional data sets. Projection pursuit by clustering components will be described as a means to find interesting lower-dimensional subspaces of data sets.115 p.application/pdfengCopyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.StatisticsPractical methods for data mining with massive data setsThesisThesis Stat. 1998 Salch