Generalized Zero-Shot Learning through Similarity Distribution Matching

Date
2022-02-18
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract

Recent advances in supervised learning methods in vision, specifically deep learning frameworks, are primarily built on the abundance of labeled images. However, image labeling is a laborious task, therefore many visual categories are unlabeled or even unavailable. Zero-Shot Learning (ZSL) deals with classifying unseen visual categories that have no samples during training phase. In particular, ZSL is a classification task where some classes referred to as unseen classes have no training images. Instead, we only have side information about seen and unseen classes, often in the form of semantic or descriptive attributes. Lack of training images from a set of classes restricts the use of standard classification techniques and losses, including the widespread cross-entropy loss. We introduce a novel Similarity Distribution Matching Network (SDM-Net) which is a standard fully connected neural-network architecture with non-trainable penultimate layer consisting of class attributes. The output layer of SDM-Net consists of both seen and unseen classes. To enable zero-shot learning, during training, we regularize the model such that the predicted distribution of unseen class is close in KL divergence to the distribution of similarities between the correct seen class and all the unseen classes. We evaluate the proposed model on five benchmark datasets for zero-shot learning, AwA1, AwA2, aPY, SUN and CUB datasets. We show that, despite the simplicity, our approach achieves competitive performance with state-of-the-art methods in Generalized-ZSL setting for all of these datasets.

Description
Degree
Master of Science
Type
Thesis
Keywords
Generalized Zero-Shot Learning, Image Classification, Semantic Similarity, KL Divergence
Citation

Daghaghi, Shabnam. "Generalized Zero-Shot Learning through Similarity Distribution Matching." (2022) Master’s Thesis, Rice University. https://hdl.handle.net/1911/113538.

Has part(s)
Forms part of
Published Version
Rights
Copyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.
Link to license
Citable link to this page