Representing Formal Languages: A Comparison Between Finite Automata and Recurrent Neural Networks

Michalenko, Joshua James

Representing Formal Languages: A Comparison Between Finite Automata and Recurrent Neural Networks

dc.contributor.advisor	Patel, Ankit	en_US
dc.contributor.committeeMember	Baraniuk , Richard	en_US
dc.creator	Michalenko, Joshua James	en_US
dc.date.accessioned	2019-05-16T19:27:17Z	en_US
dc.date.available	2019-05-16T19:27:17Z	en_US
dc.date.created	2019-05	en_US
dc.date.issued	2019-04-19	en_US
dc.date.submitted	May 2019	en_US
dc.date.updated	2019-05-16T19:27:17Z	en_US
dc.description.abstract	We investigate the internal representations that a recurrent neural network (RNN) uses while learning to recognize a regular formal language. Specially, we train a RNN on positive and negative examples from a regular language, and ask if there is a simple decoding function that maps states of this RNN to states of the minimal deterministic fnite automaton (MDFA) for the language. Our experiments show that such a decoding function indeed exists, and that it maps states of the RNN not to MDFA states, but to states of an abstraction obtained by clustering small sets of MDFA states into “superstates”. A framework for performing large scale systematic representation analysis between the two language models is discussed. Quantitative analysis surprisingly shows that linear decoding functions are suffcient for the task and an analysis of a range of abstraction functions is given. A qualitative analysis reveals new interpretations of how RNNs implement hierarchical priors during the language recognition task. Overall, the results suggest a strong structural relationship between internal representations used by RNNs and fnite automata, and explain the well-known ability of RNNs to recognize formal grammatical structure.	en_US
dc.format.mimetype	application/pdf	en_US
dc.identifier.citation	Michalenko, Joshua James. "Representing Formal Languages: A Comparison Between Finite Automata and Recurrent Neural Networks." (2019) Master’s Thesis, Rice University. <a href="https://hdl.handle.net/1911/105421">https://hdl.handle.net/1911/105421</a>.	en_US
dc.identifier.uri	https://hdl.handle.net/1911/105421	en_US
dc.language.iso	eng	en_US
dc.rights	Copyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.	en_US
dc.subject	Language recognition	en_US
dc.subject	Recurrent Neural Networks	en_US
dc.subject	Representation Learning	en_US
dc.subject	deterministic finite automaton	en_US
dc.subject	automaton	en_US
dc.title	Representing Formal Languages: A Comparison Between Finite Automata and Recurrent Neural Networks	en_US
dc.type	Thesis	en_US
dc.type.material	Text	en_US
thesis.degree.department	Electrical and Computer Engineering	en_US
thesis.degree.discipline	Engineering	en_US
thesis.degree.grantor	Rice University	en_US
thesis.degree.level	Masters	en_US
thesis.degree.name	Master of Science	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: MICHALENKO-DOCUMENT-2019.pdf
Size:: 4.2 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 2 of 2

Name:: PROQUEST_LICENSE.txt
Size:: 5.85 KB
Format:: Plain Text
Description:

Download

Name:: LICENSE.txt
Size:: 2.61 KB
Format:: Plain Text
Description:

Download

Collections

Rice University Theses and Dissertations