Probabilistic graph formalisms for meaning representations
dc.contributor.advisor
Lopez, Adam
en
dc.contributor.advisor
Fancellu, Federico
en
dc.contributor.author
Gilroy, Sorcha
en
dc.contributor.sponsor
Engineering and Physical Sciences Research Council (EPSRC)
en
dc.date.accessioned
2019-03-26T11:50:07Z
dc.date.available
2019-03-26T11:50:07Z
dc.date.issued
2019-07-01
dc.description.abstract
In recent years, many datasets have become available that represent natural language
semantics as graphs. To use these datasets in natural language processing (NLP), we
require probabilistic models of graphs. Finite-state models have been very successful
for NLP tasks on strings and trees because they are probabilistic and composable. Are
there equivalent models for graphs? In this thesis, we survey several graph formalisms,
focusing on whether they are probabilistic and composable, and we contribute several
new results. In particular, we study the directed acyclic graph automata languages
(DAGAL), the monadic second-order graph languages (MSOGL), and the hyperedge
replacement languages (HRL). We prove that DAGAL cannot be made probabilistic,
we explain why MSOGL also most likely cannot be made probabilistic, and we review
the fact that HRL are not composable. We then review a subfamily of HRL and
MSOGL: the regular graph languages (RGL; Courcelle 1991), which have not been
widely studied, and particularly have not been studied in an NLP context. Although
Courcelle (1991) only sketches a proof, we present a full, more NLP-accessible proof
that RGL are a subfamily of MSOGL. We prove that RGL are probabilistic and composable,
and we provide a novel Earley-style parsing algorithm for them that runs in
time linear in the size of the input graph. We compare RGL to two other new formalisms:
the restricted DAG languages (RDL; Bj¨orklund et al. 2016) and the tree-like
languages (TLL; Matheja et al. 2015). We show that RGL and RDL are incomparable;
TLL and RDL are incomparable; and either RGL are incomparable to TLL, or RGL
are contained within TLL. This thesis provides a clearer picture of this field from an
NLP perspective, and suggests new theoretical and empirical research directions.
en
dc.identifier.uri
http://hdl.handle.net/1842/35606
dc.language.iso
en
dc.publisher
The University of Edinburgh
en
dc.relation.hasversion
Gilroy, S., A. Lopez, and S. Maneth 2017a. Parsing graphs with regular graph grammars. In Proceedings of the Sixth Joint Conference on Lexical and Computational Semantics, *SEM@ACL 2017, Vancouver, Canada, 3-4 August 2017.
en
dc.relation.hasversion
Gilroy, S., A. Lopez, S. Maneth, and P. Simonaitis 2017b. (Re)introducing regular graph languages. In Proceedings of the 15th Meeting on the Mathematics of Language (MoL 15), London, United Kingdom. Association for Computational Linguistics.
en
dc.relation.hasversion
Vasiljeva, I., S. Gilroy, and A. Lopez 2018. The problem with probabilistic dag automata for semantic graphs.
en
dc.subject
graph formalisms
en
dc.subject
directed acyclic graph automata languages
en
dc.subject
DAGAL
en
dc.subject
monadic second-order graph languages
en
dc.subject
MSOGL
en
dc.subject
hyperedge replacement languages
en
dc.subject
HRL
en
dc.title
Probabilistic graph formalisms for meaning representations
en
dc.type
Thesis or Dissertation
en
dc.type.qualificationlevel
Doctoral
en
dc.type.qualificationname
PhD Doctor of Philosophy
en
Files
Original bundle
1 - 1 of 1
- Name:
- Gilroy2019.pdf
- Size:
- 911.09 KB
- Format:
- Adobe Portable Document Format
This item appears in the following Collection(s)

