Shrivastava, Anshumali2022-10-052022-10-052022-052022-02-18May 2022Daghaghi, Shabnam. "Generalized Zero-Shot Learning through Similarity Distribution Matching." (2022) Master’s Thesis, Rice University. <a href="https://hdl.handle.net/1911/113538">https://hdl.handle.net/1911/113538</a>.https://hdl.handle.net/1911/113538Recent advances in supervised learning methods in vision, specifically deep learning frameworks, are primarily built on the abundance of labeled images. However, image labeling is a laborious task, therefore many visual categories are unlabeled or even unavailable. Zero-Shot Learning (ZSL) deals with classifying unseen visual categories that have no samples during training phase. In particular, ZSL is a classification task where some classes referred to as unseen classes have no training images. Instead, we only have side information about seen and unseen classes, often in the form of semantic or descriptive attributes. Lack of training images from a set of classes restricts the use of standard classification techniques and losses, including the widespread cross-entropy loss. We introduce a novel Similarity Distribution Matching Network (SDM-Net) which is a standard fully connected neural-network architecture with non-trainable penultimate layer consisting of class attributes. The output layer of SDM-Net consists of both seen and unseen classes. To enable zero-shot learning, during training, we regularize the model such that the predicted distribution of unseen class is close in KL divergence to the distribution of similarities between the correct seen class and all the unseen classes. We evaluate the proposed model on five benchmark datasets for zero-shot learning, AwA1, AwA2, aPY, SUN and CUB datasets. We show that, despite the simplicity, our approach achieves competitive performance with state-of-the-art methods in Generalized-ZSL setting for all of these datasets.application/pdfengCopyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.Generalized Zero-Shot LearningImage ClassificationSemantic SimilarityKL DivergenceGeneralized Zero-Shot Learning through Similarity Distribution MatchingThesis2022-10-05