Computational methods for RNA integrative biology
dc.contributor.advisor
Sanguinetti, Guido
en
dc.contributor.advisor
Granneman, Sander
en
dc.contributor.author
Selega, Alina
en
dc.contributor.sponsor
Engineering and Physical Sciences Research Council (EPSRC)
en
dc.date.accessioned
2018-04-25T09:12:42Z
dc.date.available
2018-04-25T09:12:42Z
dc.date.issued
2018-07-02
dc.description.abstract
Ribonucleic acid (RNA) is an essential molecule, which carries out a wide variety
of functions within the cell, from its crucial involvement in protein synthesis to
catalysing biochemical reactions and regulating gene expression. Such diverse functional
repertoire is indebted to complex structures that RNA can adopt and its flexibility
as an interacting molecule.
It has become possible to experimentally measure these two crucial aspects of RNA
regulatory role with such technological advancements as next-generation sequencing
(NGS). NGS methods can rapidly obtain the nucleotide sequence of many molecules
in parallel. Designing experiments, where only the desired parts of the molecule (or
specific parts of the transcriptome) are sequenced, allows to study various aspects
of RNA biology. Analysis of NGS data is insurmountable without computational
methods.
One such experimental method is RNA structure probing, which aims to infer RNA
structure from sequencing chemically altered transcripts. RNA structure probing data
is inherently noisy, affected both by technological biases and the stochasticity of the
underlying process. Most existing methods do not adequately address the issue of
noise, resorting to heuristics and limiting the informativeness of their output. In this
thesis, a statistical pipeline was developed for modelling RNA structure probing data,
which explicitly captures biological variability, provides automated bias-correcting
strategies, and generates a probabilistic output based on experimental measurements.
The output of our method agrees with known RNA structures, can be used to constrain
structure prediction algorithms, and remains robust to reduced sequence coverage,
thereby increasing sensitivity of the technology.
Another recent experimental innovation maps RNA-protein interactions at very
high temporal resolution, making it possible to study rapid binding events happening
on a minute time scale. In this thesis, a non-parametric algorithm was developed for
identifying significant changes in RNA-protein binding time-series between different
conditions. The method was applied to novel yeast RNA-protein binding time-course
data to study the role of RNA degradation in stress response. It revealed pervasive
changes in the binding to the transcriptome of the yeast transcription termination
factor Nab3 and the cytoplasmic exoribonuclease Xrn1 under nutrient stress. This
challenged the common assumption of viewing transcriptional changes as the major
driver of changes in RNA expression during stress and highlighted the importance of
degradation. These findings inspired a dynamical model for RNA expression, where
transcription and degradation rates are modelled using RNA-protein binding time-series
data.
en
dc.identifier.uri
http://hdl.handle.net/1842/29630
dc.language.iso
en
dc.publisher
The University of Edinburgh
en
dc.relation.hasversion
Selega, A., Granneman, S., and Sanguinetti, G. (2016). BUMHMM: Computational pipeline for computing probability of modification from structure probing experiment data. Available at http://www.bioconductor.org/packages/BUMHMM.
en
dc.relation.hasversion
Selega, A. and Sanguinetti, G. (2016). Trends and challenges in computational RNA biology. Genome biology, 17(1):253.
en
dc.relation.hasversion
Selega, A., Sirocchi, C., Iosub, I., Granneman, S., and Sanguinetti, G. (2017). Robust statistical modeling improves sensitivity of high-throughput RNA structure probing experiments. Nature methods, 14(1):83–89.
en
dc.relation.hasversion
van Nues, R., Schweikert, G., de Leau, E., Selega, A., Langford, A., Franklin, R., Iosub, I., Wadsworth, P., Sanguinetti, G., and Granneman, S. (2017). Kinetic CRAC uncovers a role for Nab3 in determining gene expression profiles during stress. Nature Communications, 8.
en
dc.subject
RNA
en
dc.subject
next-generation sequencing
en
dc.subject
NGS
en
dc.subject
RNA structure probing
en
dc.subject
automated bias-correcting strategies
en
dc.subject
structure prediction algorithms
en
dc.subject
modelling
en
dc.title
Computational methods for RNA integrative biology
en
dc.type
Thesis or Dissertation
en
dc.type.qualificationlevel
Doctoral
en
dc.type.qualificationname
PhD Doctor of Philosophy
en
Files
Original bundle
1 - 1 of 1
- Name:
- Selega2018.pdf
- Size:
- 24.24 MB
- Format:
- Adobe Portable Document Format
This item appears in the following Collection(s)

