Understanding and generating language with abstract meaning representation

Damonte, Marco

Understanding and generating language with abstract meaning representation

Simple item page

dc.contributor.advisor

Cohen, Shay

en

dc.contributor.advisor

Lopez, Adam

en

dc.contributor.author

Damonte, Marco

en

dc.date.accessioned

2020-02-03T10:52:01Z

dc.date.available

2020-02-03T10:52:01Z

dc.date.issued

2020-01-08

dc.description.abstract

Abstract Meaning Representation (AMR) is a semantic representation for natural language that encompasses annotations related to traditional tasks such as Named Entity Recognition (NER), Semantic Role Labeling (SRL), word sense disambiguation (WSD), and Coreference Resolution. AMR represents sentences as graphs, where nodes represent concepts and edges represent semantic relations between them. Sentences are represented as graphs and not trees because nodes can have multiple incoming edges, called reentrancies. This thesis investigates the impact of reentrancies for parsing (from text to AMR) and generation (from AMR to text). For the parsing task, we showed that it is possible to use techniques from tree parsing and adapt them to deal with reentrancies. To better analyze the quality of AMR parsers, we developed a set of fine-grained metrics and found that state-of-the-art parsers predict reentrancies poorly. Hence we provided a classification of linguistic phenomena causing reentrancies, categorized the type of errors parsers do with respect to reentrancies, and proved that correcting these errors can lead to significant improvements. For the generation task, we showed that neural encoders that have access to reentrancies outperform those who do not, demonstrating the importance of reentrancies also for generation. This thesis also discusses the problem of using AMR for languages other than English. Annotating new AMR datasets for other languages is an expensive process and requires defining annotation guidelines for each new language. It is therefore reasonable to ask whether we can share AMR annotations across languages. We provided evidence that AMR datasets for English can be successfully transferred to other languages: we trained parsers for Italian, Spanish, German, and Chinese to investigate the cross-linguality of AMR. We showed cases where translational divergences between languages pose a problem and cases where they do not. In summary, this thesis demonstrates the impact of reentrancies in AMR as well as providing insights on AMR for languages that do not yet have AMR datasets.

en

dc.identifier.uri

https://hdl.handle.net/1842/36731

dc.identifier.uri

http://dx.doi.org/10.7488/era/38

dc.language.iso

en

dc.publisher

The University of Edinburgh

en

dc.relation.hasversion

Damonte, M. and Cohen, S. B. (2018). Cross-lingual abstract meaning representation parsing. In Proceedings of NAACL.

en

dc.relation.hasversion

Damonte, M. and Cohen, S. B. (2019). Structural neural encoders for amr-totext generation. In Proceedings of NAACL.

en

dc.relation.hasversion

Damonte, M., Cohen, S. B., and Satta, G. (2017). An incremental parser for abstract meaning representation. In Proceedings of EACL.

en

dc.relation.hasversion

Damonte, M., Szubert, I., Cohen, S., and Steedman, M. (2019). The role of reentrancies in abstract meaning representation parsing. arXiv preprint.

en

dc.relation.hasversion

Issa, F., Damonte, M., Cohen, S. B., Yan, X., and Chang, Y. (2018). Abstract meaning representation for paraphrase detection. In Proceedings of NAACL.

en

dc.subject

natural language processing

en

dc.subject

NLP

en

dc.subject

Abstract Meaning Representation

en

dc.subject

AMR

en

dc.subject

algorithms

en

dc.subject

reentrancies

en

dc.subject

parsing

en

dc.title

Understanding and generating language with abstract meaning representation

en

dc.type

Thesis or Dissertation

en

dc.type.qualificationlevel

Doctoral

en

dc.type.qualificationname

PhD Doctor of Philosophy

en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Damonte2020.pdf
Size:: 670.83 KB
Format:: Adobe Portable Document Format
Description:

Download

This item appears in the following Collection(s)

Informatics thesis and dissertation collection