Show simple item record

dc.contributor.advisorRychlik, Marek
dc.contributor.authorCoatney, Ryan Dean
dc.creatorCoatney, Ryan Dean
dc.date.accessioned2020-08-06T20:33:11Z
dc.date.available2020-08-06T20:33:11Z
dc.date.issued2020
dc.identifier.urihttp://hdl.handle.net/10150/642087
dc.description.abstractClustering algorithms are an important part of modern data analysis. The K-means and EM clustering algorithm both use an iterative process to find latent (or hidden) variables in a mixture distribution. These hidden variables may be interpreted as class label for the data points of a sample. In connection with these algorithms, I consider a family of nonlinear mappings called responsibility maps. The responsibility map is obtained as a gradient of the log likelihood of N independent samples, drawn from a mixture of K distributions. I look at the discrete dynamics of this family of maps and give a proof that iteration of responsibility converges to an estimate of the mixing coefficients. I also show that the convergence is consistent in the sense that the fixed point acts as a maximizer of the log likelihood. I call the process of determining class weight by iteration dynamic responsibility and show that it converges to a unique set of weights under mild assumptions. Dynamic responsibility (DR) is inspired by the expectation step of the expectation maximization (EM) algorithm and has a useful association with Bayesian methods. Like EM, dynamic responsibility is an iterative algorithm, but DR will converge to a unique maximum under reasonable conditions. The weights determined by DR can also be found using gradient descent but DR guarantees non-negative weights and gradient descent does not. I present a new algorithm which I call responsible softmax for doing classification with neural networks. This algorithm is intended to handle imbalanced training sets and is accomplished via multiplication by per class weights. These weights may be interpreted as class probabilities for a generalized mixture model, and are determined through DR rather than by empirical observation of the training set and heuristically selecting the underlying probability distributions. I compare the performance of responsible softmax with other standard techniques, including standard softmax, and weighted softmax using empirical class probabilities. I use generated Gaussian mixture model data and the MNIST data set for proof of concept. I show that in general, responsible softmax produces more useful classifiers than softmax when presented with imbalanced training data. It will also be seen that responsible softmax approximates the performance of empirically weighted softmax, and in some cases may do better.
dc.language.isoen
dc.publisherThe University of Arizona.
dc.rightsCopyright © is held by the author. Digital access to this material is made possible by the University Libraries, University of Arizona. Further transmission, reproduction, presentation (such as public display or performance) of protected items is prohibited except with permission of the author.
dc.subjectDeep Learning
dc.subjectDynamical Systems
dc.subjectExpectation Maximization
dc.subjectK-means
dc.subjectNeural Networks
dc.subjectResponsible Softmax
dc.titleA Responsible Softmax Layer in Deep Learning
dc.typetext
dc.typeElectronic Dissertation
thesis.degree.grantorUniversity of Arizona
thesis.degree.leveldoctoral
dc.contributor.committeememberMaier, Robert S.
dc.contributor.committeememberGlickenstein, David A.
dc.contributor.committeememberMorrison, Clayton T.
thesis.degree.disciplineGraduate College
thesis.degree.disciplineMathematics
thesis.degree.namePh.D.
refterms.dateFOA2020-08-06T20:33:11Z


Files in this item

Thumbnail
Name:
azu_etd_18153_sip1_m.pdf
Size:
1.834Mb
Format:
PDF

This item appears in the following Collection(s)

Show simple item record