dc.contributor.advisor | Webber, Bonnie | en |
dc.contributor.advisor | Shimodaira, Hiroshi | en |
dc.contributor.advisor | Van Deemter, Kees | en |
dc.contributor.author | Pareti, Silvia | en |
dc.date.accessioned | 2016-01-26T16:48:40Z | |
dc.date.available | 2016-01-26T16:48:40Z | |
dc.date.issued | 2015-11-26 | |
dc.identifier.uri | http://hdl.handle.net/1842/14170 | |
dc.description.abstract | Our society is overwhelmed with an ever growing amount of information. Effective
management of this information requires novel ways to filter and select the most relevant
pieces of information. Some of this information can be associated with the source
or sources expressing it. Sources and their relation to what they express affect information
and whether we perceive it as relevant, biased or truthful. In news texts in
particular, it is common practice to report third-party statements and opinions. Recognizing
relations of attribution is therefore a necessary step toward detecting statements
and opinions of specific sources and selecting and evaluating information on the basis
of its source.
The automatic identification of Attribution Relations has applications in numerous
research areas. Quotation and opinion extraction, discourse and factuality have
all partly addressed the annotation and identification of Attribution Relations. However,
disjoint efforts have provided a partial and partly inaccurate picture of attribution.
Moreover, these research efforts have generated small or incomplete resources, thus
limiting the applicability of machine learning approaches. Existing approaches to extract
Attribution Relations have focused on rule-based models, which are limited both
in coverage and precision.
This thesis presents a computational approach to attribution that recasts attribution
extraction as the identification of the attributed text, its source and the lexical cue linking
them in a relation. Drawing on preliminary data-driven investigation, I present a
comprehensive lexicalised approach to attribution and further refine and test a previously
defined annotation scheme. The scheme has been used to create a corpus annotated
with Attribution Relations, with the goal of contributing a large and complete
resource than can lay the foundations for future attribution studies.
Based on this resource, I developed a system for the automatic extraction of attribution
relations that surpasses traditional syntactic pattern-based approaches. The system
is a pipeline of classification and sequence labelling models that identify and link each
of the components of an attribution relation. The results show concrete opportunities
for attribution-based applications. | en |
dc.contributor.sponsor | Engineering and Physical Sciences Research Council (EPSRC) | en |
dc.language.iso | en | |
dc.publisher | The University of Edinburgh | en |
dc.relation.hasversion | Cervone, A., Pareti, S., Bell, P., Prodanof, I., and Caselli, T. (2014). Detecting attribution relations in speech. In Basili, R., Lenci, A., and Magnini, B., editors, First Italian Conference on Computational Linguistics CLiC-it 2014, Pisa, Italy. Pisa University Press. | en |
dc.relation.hasversion | O’Keefe, T., Pareti, S., Curran, J., Koprinska, I., and Honnibal, M. (2012). A sequence labelling approach to quote attribution. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), Jeju, Korea. | en |
dc.relation.hasversion | Pareti, S. (2011). Annotating attribution relations and their features. In Alonso, O., Kamps, J., and Karlgren, J., editors, ESAIR’11: Proceedings of the CIKM’11 Workshop on Exploiting Semantic Annotations in Information Retrieval. ACM Press. | en |
dc.relation.hasversion | Pareti, S. (2012). A database of attribution relations. In Calzolari, N., Choukri, K., Declerck, T., Do˘gan, M. U., Maegaard, B., Mariani, J., Odijk, J., and Piperidis, S., editors, Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC), Istanbul, Turkey. European Language Resources Association (ELRA). | en |
dc.relation.hasversion | Pareti, S. (2012). The independent encoding of attribution relations. In Proceedings of the Eight Joint ACL-ISO Workshop on Interoperable Semantic Annotation (ISA- 8), Pisa, Italy. | en |
dc.relation.hasversion | Pareti, S. (2015). Annotating attribution relations across languages and genres. In Proceedings of the Eleventh Joint ACL-ISO Workshop on Interoperable Semantic Annotation (ISA-11), London, UK. | en |
dc.relation.hasversion | Pareti, S., O’Keefe, T., Konstas, I., Curran, J. R., and Koprinska, I. (2013). Automatically detecting and attributing indirect quotations. In Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pages 989– 999, Seattle, Washington, USA. Association for Computational Linguistics. | en |
dc.relation.hasversion | Pareti, S. and Prodanof, I. (2010). Annotating attribution relations: Towards an Italian discourse treebank. In Calzolari, N., Choukri, K., Maegaard, B., Mariani, J., Odijk, J., Piperidis, S., Rosner, M., and Tapias, D., editors, Proceedings of LREC, Malta. European Language Resources Association (ELRA). | en |
dc.subject | attribution | en |
dc.subject | quotation | en |
dc.subject | opinion | en |
dc.subject | relation extraction | en |
dc.subject | discourse | en |
dc.title | Attribution: a computational approach | en |
dc.type | Thesis or Dissertation | en |
dc.type.qualificationlevel | Doctoral | en |
dc.type.qualificationname | PhD Doctor of Philosophy | en |