Automatic movie analysis and summarisation
dc.contributor.advisor
Lapata, Maria
en
dc.contributor.advisor
Sarkar, Rik
en
dc.contributor.author
Gorinski, Philip John
en
dc.date.accessioned
2018-06-04T10:07:15Z
dc.date.available
2018-06-04T10:07:15Z
dc.date.issued
2018-07-02
dc.description.abstract
Automatic movie analysis is the task of employing Machine Learning methods to the
field of screenplays, movie scripts, and motion pictures to facilitate or enable various
tasks throughout the entirety of a movie’s life-cycle. From helping with making
informed decisions about a new movie script with respect to aspects such as its originality,
similarity to other movies, or even commercial viability, all the way to offering
consumers new and interesting ways of viewing the final movie, many stages in the
life-cycle of a movie stand to benefit from Machine Learning techniques that promise
to reduce human effort, time, or both. Within this field of automatic movie analysis,
this thesis addresses the task of summarising the content of screenplays, enabling users
at any stage to gain a broad understanding of a movie from greatly reduced data. The
contributions of this thesis are four-fold: (i)We introduce ScriptBase, a new large-scale
data set of original movie scripts, annotated with additional meta-information such as
genre and plot tags, cast information, and log- and tag-lines. To our knowledge, Script-
Base is the largest data set of its kind, containing scripts and information for almost
1,000 Hollywood movies. (ii) We present a dynamic summarisation model for the
screenplay domain, which allows for extraction of highly informative and important
scenes from movie scripts. The extracted summaries allow for the content of the original
script to stay largely intact and provide the user with its important parts, while
greatly reducing the script-reading time. (iii) We extend our summarisation model
to capture additional modalities beyond the screenplay text. The model is rendered
multi-modal by introducing visual information obtained from the actual movie and by
extracting scenes from the movie, allowing users to generate visual summaries of motion
pictures. (iv) We devise a novel end-to-end neural network model for generating
natural language screenplay overviews. This model enables the user to generate short
descriptive and informative texts that capture certain aspects of a movie script, such as
its genres, approximate content, or style, allowing them to gain a fast, high-level understanding
of the screenplay. Multiple automatic and human evaluations were carried
out to assess the performance of our models, demonstrating that they are well-suited
for the tasks set out in this thesis, outperforming strong baselines. Furthermore, the
ScriptBase data set has started to gain traction, and is currently used by a number of
other researchers in the field to tackle various tasks relating to screenplays and their
analysis.
en
dc.identifier.uri
http://hdl.handle.net/1842/31053
dc.language.iso
en
dc.publisher
The University of Edinburgh
en
dc.relation.hasversion
Gorinski, P. J. and Lapata, M. (2015). Movie script summarization as graph-based scene extraction. In Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 1066–1076, Denver, Colorado.
en
dc.subject
summarisation
en
dc.subject
information extraction
en
dc.subject
mulitmodal
en
dc.subject
neural networks
en
dc.subject
ilp
en
dc.subject
natural language processing
en
dc.subject
natural language generation
en
dc.subject
extractive summarisation
en
dc.subject
abstractive summarisation
en
dc.title
Automatic movie analysis and summarisation
en
dc.type
Thesis or Dissertation
en
dc.type.qualificationlevel
Doctoral
en
dc.type.qualificationname
PhD Doctor of Philosophy
en
Files
Original bundle
1 - 1 of 1
- Name:
- Gorinski2018.pdf
- Size:
- 3.76 MB
- Format:
- Adobe Portable Document Format
This item appears in the following Collection(s)

