On the operating characteristics of some non-parametric methodologies for the classification of distributions by tail behavior

dc.contributor.advisorRojo, Javieren_US
dc.creatorOtt, Richard Charlesen_US
dc.date.accessioned2009-06-04T08:44:40Zen_US
dc.date.available2009-06-04T08:44:40Zen_US
dc.date.issued2005en_US
dc.description.abstractNew methods for classifying tails of probability distributions based on data are proposed. Some methods apply the nonparametric theories of Rojo [35] and Schuster [36] and differ from classical extreme value theory and other well established methods. All the methods implement the extreme spacing of the data, the difference of the largest and second largest values. The results are then compared based on power properties to the classical technique of a Points Over Threshold model based on the Generalized Pareto Distribution (GPD). The following topics are the foundation of this thesis: Chapter 1. Review of classical extreme value theory and discussion on the class of medium-tailed distributions. Chapter 2. Review of the tail classification schemes of Parzen, Schuster, and Rojo upon which the latter two suggest the usage of the Extreme Spacing (ES) as a possible classifying instrument. Additional subcategorizations are also provided for the schemes of Schuster and Rojo. Chapter 3. Review of estimation methods for the Points Over Threshold GPD parameters for classification purposes. A Monte Carlo study classifying tails of many common distributions using the GPD by way of maximum likelihood is also provided. Chapter 4. Three classification tests based on the ES are provided. The first is a test to decide whether a sample originates from a completely specified distribution such as Exp(1). The second classifies whether data originated from an exponential distribution with unknown parameter. The third classifies an underlying distribution as short-, medium-, or long-tailed. Also discussed, is the potential benefit of blocking the data before applying the above mentioned tests. Chapter 5. Classifying specific data sets by way of the new methods. Some of the new ES methods may be applicable to the data when classical methods are inapplicable, for example when the GPD maximum likelihood numerical algorithm does not converge to yield a shape parameter estimate or when the variance of the shape parameter cannot be estimated since the parameter estimate is close to a parameter space endpoint. Even when classical methods are applicable, these tests can give a more thorough understanding of the tail behavior of the underlying distribution.en_US
dc.format.extent135 p.en_US
dc.format.mimetypeapplication/pdfen_US
dc.identifier.callnoTHESIS STAT. 2005 OTTen_US
dc.identifier.citationOtt, Richard Charles. "On the operating characteristics of some non-parametric methodologies for the classification of distributions by tail behavior." (2005) Diss., Rice University. <a href="https://hdl.handle.net/1911/18792">https://hdl.handle.net/1911/18792</a>.en_US
dc.identifier.urihttps://hdl.handle.net/1911/18792en_US
dc.language.isoengen_US
dc.rightsCopyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.en_US
dc.subjectStatisticsen_US
dc.titleOn the operating characteristics of some non-parametric methodologies for the classification of distributions by tail behavioren_US
dc.typeThesisen_US
dc.type.materialTexten_US
thesis.degree.departmentStatisticsen_US
thesis.degree.disciplineEngineeringen_US
thesis.degree.grantorRice Universityen_US
thesis.degree.levelDoctoralen_US
thesis.degree.nameDoctor of Philosophyen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
3168115.PDF
Size:
8.51 MB
Format:
Adobe Portable Document Format