Probabilistic graph formalisms for meaning representations

Gilroy, Sorcha

Probabilistic graph formalisms for meaning representations

Simple item page

dc.contributor.advisor

Lopez, Adam

en

dc.contributor.advisor

Fancellu, Federico

en

dc.contributor.author

Gilroy, Sorcha

en

dc.contributor.sponsor

Engineering and Physical Sciences Research Council (EPSRC)

en

dc.date.accessioned

2019-03-26T11:50:07Z

dc.date.available

2019-03-26T11:50:07Z

dc.date.issued

2019-07-01

dc.description.abstract

In recent years, many datasets have become available that represent natural language semantics as graphs. To use these datasets in natural language processing (NLP), we require probabilistic models of graphs. Finite-state models have been very successful for NLP tasks on strings and trees because they are probabilistic and composable. Are there equivalent models for graphs? In this thesis, we survey several graph formalisms, focusing on whether they are probabilistic and composable, and we contribute several new results. In particular, we study the directed acyclic graph automata languages (DAGAL), the monadic second-order graph languages (MSOGL), and the hyperedge replacement languages (HRL). We prove that DAGAL cannot be made probabilistic, we explain why MSOGL also most likely cannot be made probabilistic, and we review the fact that HRL are not composable. We then review a subfamily of HRL and MSOGL: the regular graph languages (RGL; Courcelle 1991), which have not been widely studied, and particularly have not been studied in an NLP context. Although Courcelle (1991) only sketches a proof, we present a full, more NLP-accessible proof that RGL are a subfamily of MSOGL. We prove that RGL are probabilistic and composable, and we provide a novel Earley-style parsing algorithm for them that runs in time linear in the size of the input graph. We compare RGL to two other new formalisms: the restricted DAG languages (RDL; Bj¨orklund et al. 2016) and the tree-like languages (TLL; Matheja et al. 2015). We show that RGL and RDL are incomparable; TLL and RDL are incomparable; and either RGL are incomparable to TLL, or RGL are contained within TLL. This thesis provides a clearer picture of this field from an NLP perspective, and suggests new theoretical and empirical research directions.

en

dc.identifier.uri

http://hdl.handle.net/1842/35606

dc.language.iso

en

dc.publisher

The University of Edinburgh

en

dc.relation.hasversion

Gilroy, S., A. Lopez, and S. Maneth 2017a. Parsing graphs with regular graph grammars. In Proceedings of the Sixth Joint Conference on Lexical and Computational Semantics, *SEM@ACL 2017, Vancouver, Canada, 3-4 August 2017.

en

dc.relation.hasversion

Gilroy, S., A. Lopez, S. Maneth, and P. Simonaitis 2017b. (Re)introducing regular graph languages. In Proceedings of the 15th Meeting on the Mathematics of Language (MoL 15), London, United Kingdom. Association for Computational Linguistics.

en

dc.relation.hasversion

Vasiljeva, I., S. Gilroy, and A. Lopez 2018. The problem with probabilistic dag automata for semantic graphs.

en

dc.subject

graph formalisms

en

dc.subject

directed acyclic graph automata languages

en

dc.subject

DAGAL

en

dc.subject

monadic second-order graph languages

en

dc.subject

MSOGL

en

dc.subject

hyperedge replacement languages

en

dc.subject

HRL

en

dc.title

Probabilistic graph formalisms for meaning representations

en

dc.type

Thesis or Dissertation

en

dc.type.qualificationlevel

Doctoral

en

dc.type.qualificationname

PhD Doctor of Philosophy

en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Gilroy2019.pdf
Size:: 911.09 KB
Format:: Adobe Portable Document Format

Download

This item appears in the following Collection(s)

Informatics thesis and dissertation collection