Outlier Detection based on Robust Regression via Chance-Constrained Programming
| dc.contributor.advisor | Zhang, Hao Helen | |
| dc.contributor.author | Mohammadi Fathabad, Abolhassan | |
| dc.creator | Mohammadi Fathabad, Abolhassan | |
| dc.date.accessioned | 2022-01-27T02:05:30Z | |
| dc.date.available | 2022-01-27T02:05:30Z | |
| dc.date.issued | 2021 | |
| dc.identifier.citation | Mohammadi Fathabad, Abolhassan. (2021). Outlier Detection based on Robust Regression via Chance-Constrained Programming (Master's thesis, University of Arizona, Tucson, USA). | |
| dc.identifier.uri | http://hdl.handle.net/10150/663120 | |
| dc.description.abstract | Outlier detection is a critical step in data pre-processing to identify heterogeneous points in data. For high dimensional and extremely noisy data, many challenges are posed in outlier detection, including estimating the number of outliers, providing probabilistic confidence statement on identified outliers, fitting a model robust against outliers in the data set, and achieving high breakdown points with guarantee. In this paper, we propose a novel chance-constrained outlier detection (CCOD) model that not only finds a robust fit to the data set without guessing the proportion of outliers, but also automatically offers a diagnostic criteria (i.e., the relative outlying probabilities) to detect outliers with confidence. The main idea is to first model a probabilistic least quantile of squares (LQS) problem using chance-constrained optimization, then reformulate the problem using kernel density estimation. Since the resulting kernel-based LQS is nonlinear and nonconvex, we further propose a tractable convex approximation, the so-called CCOD model, and use its optimization to develop two outlier detection algorithms. Through numerical results, we show that our CCOD model outperforms the state-of-art LQS methodologies in terms of estimation accuracy, robustness, and computational time, and it provides robust fits to large-scale data that were otherwise intractable via other methodologies. | |
| dc.language.iso | en | |
| dc.publisher | The University of Arizona. | |
| dc.rights | Copyright © is held by the author. Digital access to this material is made possible by the University Libraries, University of Arizona. Further transmission, reproduction, presentation (such as public display or performance) of protected items is prohibited except with permission of the author. | |
| dc.rights.uri | http://rightsstatements.org/vocab/InC/1.0/ | |
| dc.subject | Chance-constrained Programming | |
| dc.subject | Kernel density estimation | |
| dc.subject | Least quantile of squares | |
| dc.subject | Outlier Detection | |
| dc.subject | Robust Regression | |
| dc.title | Outlier Detection based on Robust Regression via Chance-Constrained Programming | |
| dc.type | text | |
| dc.type | Electronic Thesis | |
| thesis.degree.grantor | University of Arizona | |
| thesis.degree.level | masters | |
| dc.contributor.committeemember | Zhou, Qiang | |
| dc.contributor.committeemember | Cheng, Jianqiang | |
| thesis.degree.discipline | Graduate College | |
| thesis.degree.discipline | Statistics | |
| thesis.degree.name | M.S. | |
| dc.description.admin-note | Replaced with revised and approved PDF on 21-Apr-2022 per Graduate College and student request, Kimberly | |
| refterms.dateFOA | 2022-01-27T02:05:30Z |
