Generalized Zero-Shot Learning through Similarity Distribution Matching
Date
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
Recent advances in supervised learning methods in vision, specifically deep learning frameworks, are primarily built on the abundance of labeled images. However, image labeling is a laborious task, therefore many visual categories are unlabeled or even unavailable. Zero-Shot Learning (ZSL) deals with classifying unseen visual categories that have no samples during training phase. In particular, ZSL is a classification task where some classes referred to as unseen classes have no training images. Instead, we only have side information about seen and unseen classes, often in the form of semantic or descriptive attributes. Lack of training images from a set of classes restricts the use of standard classification techniques and losses, including the widespread cross-entropy loss. We introduce a novel Similarity Distribution Matching Network (SDM-Net) which is a standard fully connected neural-network architecture with non-trainable penultimate layer consisting of class attributes. The output layer of SDM-Net consists of both seen and unseen classes. To enable zero-shot learning, during training, we regularize the model such that the predicted distribution of unseen class is close in KL divergence to the distribution of similarities between the correct seen class and all the unseen classes. We evaluate the proposed model on five benchmark datasets for zero-shot learning, AwA1, AwA2, aPY, SUN and CUB datasets. We show that, despite the simplicity, our approach achieves competitive performance with state-of-the-art methods in Generalized-ZSL setting for all of these datasets.
Description
Advisor
Degree
Type
Keywords
Citation
Daghaghi, Shabnam. "Generalized Zero-Shot Learning through Similarity Distribution Matching." (2022) Master’s Thesis, Rice University. https://hdl.handle.net/1911/113538.