Impacts of an Adversary Attacking Filter-Based Feature Selection Algorithms
Author
Gupta, SrishtiIssue Date
2021Keywords
Adversarial Machine LearningArtificial Intelligence
Feature Selection
Information-Theory
Security
Advisor
Ditzler, Gregory
Metadata
Show full item recordPublisher
The University of Arizona.Rights
Copyright © is held by the author. Digital access to this material is made possible by the University Libraries, University of Arizona. Further transmission, reproduction, presentation (such as public display or performance) of protected items is prohibited except with permission of the author.Abstract
Applying complex mathematical calculations to big data, extracting insightful information, adapting new data independently, and providing scalable solutions have attracted various industries including healthcare, financial, computer-vision, cyber-security, automation, etc. The ubiquitous use of Machine Learning (ML) has become almost ordinary. ML has not only lured businesses but has also interested the archenemies of society. Due to the multi-faceted applications of ML, practicing ML with malicious intent can cause severe deleterious effects on individuals, society, organizations, and the environment. With the recent spread of awareness for the ethical use of ML, we are centuries away from its noble only applications. Adversarial Machine Learning (AML) is a branch of ML that aims to make ML models robust and secure against adversaries. Since most of the feature selection and ML algorithms in a data science pipeline were developed in an adversary-unaware environment, studies have shown that these algorithms are vulnerable to attacks and can be easily compromised in the presence of an intelligent adversary. In the last decade, a tremendous amount of work has been done to develop carefully crafted attacks that can subvert the predictions of state-of-the-art ML models along with their suitable countermeasures. However, majority of these works are limited to the robustness of classifiers and their secure predictions. Unfortunately, with an intent to wreck an ML model, the adversary can seed an attack anywhere in a data science pipeline. Adversarial Feature Selection (AFS) is a novel sub-field of AML that intends to make feature selection algorithms guarded against adversaries. The study of AFS is ever more important due to the nature of damage an attack can do at the feature selection stage. For example, if an intelligently crafted adversarial input perturbation has been planted in the raw data right before the feature selection, a feature selector ends up selecting wrong features for training and testing data which may lead to bad learning of a classifier. The faulty classifier may even predict apparently well on testing data giving a false sense of legitimacy. Therefore, it is equally important to protect feature selectors as classifiers. Due to the novelty of the field, we do not have many targeted attacks for feature selection algorithms. In most literature, attacks studied in AML only refers to classifier attacks to the point it is used interchangeably. In this work, we demarcate classifier attacks from feature selection attacks, both being a sub-field of AML. The main focus of this thesis is to study the transferability of existing feature selection and classifier attacks on filter feature selection algorithms. The motivation of this study is to understand the behavior of filter algorithms in an adversary-aware environment even when the attacks are not directed towards the filter methods. Filter algorithms are studied because of their widespread usage due to their computationally inexpensive and classifier-independent characteristics. First, we show that feature selection attack designed for LASSO is transferable to filter algorithms. Then, we expand the study and show that classifier attacks are also transferable to filter feature selection algorithms, even when these attacks are not originally crafted for feature selection stage. The degree of impact varied among different feature selectors.Type
textElectronic Thesis
Degree Name
M.S.Degree Level
mastersDegree Program
Graduate CollegeElectrical & Computer Engineering
