AWE: Attention Word Embedding

Sonkar, Shashank

AWE: Attention Word Embedding

Files

SONKAR-DOCUMENT-2020.pdf (583.82 KB)

Date

2020-09-02

Authors

Sonkar, Shashank

Abstract

Word embedding models learn semantically rich vector representations of words and are widely used to initialize natural processing language (NLP) models. The popular continuous bag-of-words (CBOW) model of word2vec learns a vector embedding by masking a given word in a sentence and then using the other words as a context to predict it. A limitation of CBOW is that it equally weights the context words when making a prediction, which is inefficient, since some words have higher predictive value than others. We tackle this inefficiency by introducing the Attention Word Embedding (AWE) model, which integrates the attention mechanism into the CBOW model. We also propose AWE-S, which incorporates subword information. We demonstrate that AWE and AWE-S outperform the state-of-the-art word embedding models both on a variety of word similarity datasets and when used for initialization of NLP models.

Advisor

Baraniuk, Richard G.

Degree

Master of Science

Type

Thesis

Keywords

Natural Language Processing, Machine Learning, Word Embeddings

Citation

Sonkar, Shashank. "AWE: Attention Word Embedding." (2020) Master’s Thesis, Rice University. https://hdl.handle.net/1911/109309.

Rights

Copyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.

Citable link to this page

https://hdl.handle.net/1911/109309

Collections

Rice University Theses and Dissertations

Full item page