Statistical inference in population genetics using microsatellites
dc.contributor.advisor
Pemberton, Josephine
en
dc.contributor.advisor
Johnson, Toby
en
dc.contributor.author
Csilléry, Katalin
en
dc.contributor.sponsor
Principal’s Studentship from the School of Biological Sciences (University of Edinburgh) and a travel grant from the James Rennie Bequest
en
dc.date.accessioned
2010-10-06T10:18:32Z
dc.date.available
2010-10-06T10:18:32Z
dc.date.issued
2009
dc.description.abstract
Statistical inference from molecular population genetic data is currently a very active
area of research for two main reasons. First, in the past two decades an enormous
amount of molecular genetic data have been produced and the amount of data is
expected to grow even more in the future. Second, drawing inferences about complex
population genetics problems, for example understanding the demographic and genetic
factors that shaped modern populations, poses a serious statistical challenge.
Amongst the many different kinds of genetic data that have appeared in the past
two decades, the highly polymorphic microsatellites have played an important role.
Microsatellites revolutionized the population genetics of natural populations, and were
the initial tool for linkage mapping in humans and other model organisms. Despite
their important role, and extensive use, the evolutionary dynamics of microsatellites
are still not fully understood, and their statistical methods are often underdeveloped
and do not adequately model microsatellite evolution. In this thesis, I address some
aspects of this problem by assessing the performance of existing statistical tools, and
developing some new ones. My work encompasses a range of statistical methods from
simple hypothesis testing to more recent, complex computational statistical tools. This
thesis consists of four main topics.
First, I review the statistical methods that have been developed for microsatellites
in population genetics applications. I review the different models of the microsatellite
mutation process, and ask which models are the most supported by data, and how
models were incorporated into statistical methods. I also present estimates of mutation
parameters for several species based on published data.
Second, I evaluate the performance of estimators of genetic relatedness using real
data from five vertebrate populations. I demonstrate that the overall performance
of marker-based pairwise relatedness estimators mainly depends on the population
relatedness composition and may only be improved by the marker data quality within
the limits of the population relatedness composition.
Third, I investigate the different null hypotheses that may be used to test for
independence between loci. Using simulations I show that testing for statistical
independence (i.e. zero linkage disequilibrium, LD) is difficult to interpret in
most cases, and instead a null hypothesis should be tested, which accounts for the
“background LD” due to finite population size. I investigate the utility of a novel
approximate testing procedure to circumvent this problem, and illustrate its use on a
real data set from red deer.
Fourth, I explore the utility of Approximate Bayesian Computation, inference
based on summary statistics, to estimate demographic parameters from admixed
populations. Assuming a simple demographic model, I show that the choice of
summary statistics greatly influences the quality of the estimation, and that different
parameters are better estimated with different summary statistics. Most importantly, I
show how the estimation of most admixture parameters can be considerably improved
via the use of linkage disequilibrium statistics from microsatellite data.
en
dc.identifier.uri
http://hdl.handle.net/1842/3865
dc.language.iso
en
dc.publisher
The University of Edinburgh
en
dc.subject
molecular population genetic data
en
dc.subject
statistical inference
en
dc.subject
demographic factors of modern populations
en
dc.subject
genetic factors of modern populations
en
dc.subject
microsatellite evolution
en
dc.subject
statistical methods
en
dc.title
Statistical inference in population genetics using microsatellites
en
dc.type
Thesis or Dissertation
en
dc.type.qualificationlevel
Doctoral
en
dc.type.qualificationname
PhD Doctor of Philosophy
en
Files
Original bundle
1 - 1 of 1
- Name:
- Csillery2009.pdf
- Size:
- 2.97 MB
- Format:
- Adobe Portable Document Format
- Description:
This item appears in the following Collection(s)

