Show simple item record

dc.contributor.advisorZhang, Hao H.en
dc.contributor.authorZeng, Yue
dc.creatorZeng, Yueen
dc.date.accessioned2017-06-30T17:13:32Z
dc.date.available2017-06-30T17:13:32Z
dc.date.issued2017
dc.identifier.urihttp://hdl.handle.net/10150/624579
dc.description.abstractVariable screening techniques are fast and crude techniques to scan high-dimensional data and conduct dimension reduction before a refined variable selection method is applied. Its marginal analysis feature makes the method computationally feasible for ultra-high dimensional problems. However, most existing screening methods for classification problems are designed only for binary classification problems. There is lack of a comprehensive study on variable screening for multi-class classification problems. This research aims to fill the gap by developing variable screening for multi-class problems, to meet the need of high dimensional classification. The work has useful applications in cancer study, medicine, engineering and biology. In this research, we propose and investigate new and effective screening methods for multi-class classification problems. We consider two types of screening methods. The first one conducts screening for multiple binary classification problems separately and then aggregates the selected variables. The second one conducts screening for multi-class classification problems directly. In particular, for each method we investigate important issues such as choices of classification algorithms, variable ranking, and model size determination. We implement various selection criteria and compare their performance. We conduct extensive simulation studies to evaluate and compare the proposed screening methods with existing ones, which show that the new methods are promising. Furthermore, we apply the proposed methods to four cancer studies. R code has been developed for each method.
dc.language.isoen_USen
dc.publisherThe University of Arizona.en
dc.rightsCopyright © is held by the author. Digital access to this material is made possible by the University Libraries, University of Arizona. Further transmission, reproduction or presentation (such as public display or performance) of protected items is prohibited except with permission of the author.en
dc.subjectClassificationen
dc.subjectHigh Dimensionen
dc.subjectVariable Screeningen
dc.titleVariable Screening Methods in Multi-Category Problems for Ultra-High Dimensional Dataen_US
dc.typetexten
dc.typeElectronic Dissertationen
thesis.degree.grantorUniversity of Arizonaen
thesis.degree.leveldoctoralen
dc.contributor.committeememberZhang, Hao H.en
dc.contributor.committeememberHao, Ningen
dc.contributor.committeememberHu, Chengchengen
dc.contributor.committeememberWoutersen, Tiemenen
thesis.degree.disciplineGraduate Collegeen
thesis.degree.disciplineStatisticsen
thesis.degree.namePh.D.en
refterms.dateFOA2018-09-11T21:01:46Z
html.description.abstractVariable screening techniques are fast and crude techniques to scan high-dimensional data and conduct dimension reduction before a refined variable selection method is applied. Its marginal analysis feature makes the method computationally feasible for ultra-high dimensional problems. However, most existing screening methods for classification problems are designed only for binary classification problems. There is lack of a comprehensive study on variable screening for multi-class classification problems. This research aims to fill the gap by developing variable screening for multi-class problems, to meet the need of high dimensional classification. The work has useful applications in cancer study, medicine, engineering and biology. In this research, we propose and investigate new and effective screening methods for multi-class classification problems. We consider two types of screening methods. The first one conducts screening for multiple binary classification problems separately and then aggregates the selected variables. The second one conducts screening for multi-class classification problems directly. In particular, for each method we investigate important issues such as choices of classification algorithms, variable ranking, and model size determination. We implement various selection criteria and compare their performance. We conduct extensive simulation studies to evaluate and compare the proposed screening methods with existing ones, which show that the new methods are promising. Furthermore, we apply the proposed methods to four cancer studies. R code has been developed for each method.


Files in this item

Thumbnail
Name:
azu_etd_15454_sip1_m.pdf
Size:
728.7Kb
Format:
PDF

This item appears in the following Collection(s)

Show simple item record