Graph-based broad-coverage semantic parsing
dc.contributor.advisor
Titov, Ivan
dc.contributor.advisor
Cohen, Shay
dc.contributor.author
Lyu, Chunchuan
dc.date.accessioned
2021-10-06T10:44:03Z
dc.date.available
2021-10-06T10:44:03Z
dc.date.issued
2021-07-31
dc.description.abstract
Many broad-coverage meaning representations can be characterized as directed graphs,
where nodes represent semantic concepts and directed edges represent semantic relations among the concepts. The task of semantic parsing is to generate such a meaning
representation from a sentence. It is quite natural to adopt a graph-based approach for
parsing, where nodes are identified conditioning on the individual words, and edges
are labeled conditioning on the pairs of nodes. However, there are two issues with
applying this simple and interpretable graph-based approach for semantic parsing:
first, the anchoring of nodes to words can be implicit and non-injective in several
formalisms (Oepen et al., 2019, 2020). This means we do not know which nodes
should be generated from which individual word and how many of them. Consequently, it makes a probabilistic formulation of the training objective problematical;
second, graph-based parsers typically predict edge labels independent from each other.
Such an independence assumption, while being sensible from an algorithmic point of
view, could limit the expressiveness of statistical modeling. Consequently, it might fail
to capture the true distribution of semantic graphs.
In this thesis, instead of a pipeline approach to obtain the anchoring, we propose to
model the implicit anchoring as a latent variable in a probabilistic model. We induce
such a latent variable jointly with the graph-based parser in an end-to-end differentiable training. In particular, we test our method on Abstract Meaning Representation
(AMR) parsing (Banarescu et al., 2013). AMR represents sentence meaning with a
directed acyclic graph, where the anchoring of nodes to words is implicit and could be
many-to-one. Initially, we propose a rule-based system that circumvents the many-to-one anchoring by combing nodes in some pre-specified subgraphs in AMR and treats
the alignment as a latent variable. Next, we remove the need for such a rule-based system by treating both graph segmentation and alignment as latent variables. Still, our
graph-based parsers are parameterized by neural modules that require gradient-based
optimization. Consequently, training graph-based parsers with our discrete latent variables can be challenging. By combing deep variational inference and differentiable
sampling, our models can be trained end-to-end. To overcome the limitation of graph-based parsing and capture interdependency in the output, we further adopt iterative
refinement. Starting with an output whose parts are independently predicted, we iteratively refine it conditioning on the previous prediction. We test this method on
semantic role labeling (Gildea and Jurafsky, 2000). Semantic role labeling is the task
of predicting the predicate-argument structure. In particular, semantic roles between
the predicate and its arguments need to be labeled, and those semantic roles are interdependent. Overall, our refinement strategy results in an effective model, outperforming
strong factorized baseline models.
en
dc.identifier.uri
https://hdl.handle.net/1842/38121
dc.identifier.uri
http://dx.doi.org/10.7488/era/1390
dc.language.iso
en
en
dc.publisher
The University of Edinburgh
en
dc.relation.hasversion
Lyu, Chunchuan, Shay B. Cohen, and Ivan Titov. 2019. Semantic role labeling with iterative structure refinement. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Confer ence on Natural Language Processing (EMNLP-IJCNLP), pages 1071–1082, Hong Kong, China. Association for Computational Linguistics
en
dc.relation.hasversion
Lyu, Chunchuan, Shay B. Cohen, and Ivan Titov. 2020. A differentiable relaxation of graph segmentation and alignment for amr parsing. ArXiv, abs/2010.12676.
en
dc.relation.hasversion
Lyu, Chunchuan and Ivan Titov. 2018. AMR parsing as graph prediction with la tent alignment. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 397–407, Melbourne, Australia. Association for Computational Linguistics.
en
dc.subject
semantic parsing
en
dc.subject
graph-based parsers
en
dc.subject
hand-crafted pipelines
en
dc.subject
semantic role labeling
en
dc.subject
Abstract Meaning Representation parsing
en
dc.subject
AMR parsing
en
dc.title
Graph-based broad-coverage semantic parsing
en
dc.type
Thesis or Dissertation
en
dc.type.qualificationlevel
Doctoral
en
dc.type.qualificationname
PhD Doctor of Philosophy
en
Files
Original bundle
1 - 1 of 1
- Name:
- Lyu2021.pdf
- Size:
- 1.34 MB
- Format:
- Adobe Portable Document Format
- Description:
This item appears in the following Collection(s)

