Phylogenetics I

Trees, tree likelihoods, and models of evolution

Public Health Modeling Unit

2025-08-05

Barney Isaksen Potter

Series overview

  1. Trees, tree likelihoods, and models of evolution
  2. Rate heterogeneity and maximum likelihood
  3. Bayesian phylogenetics, Markov chain Monte Carlo, and summary trees
  4. Phylogeography and Kingman's coalescent

What do the data look like?

Probability vs. likelihood

Some notation

  • $P(X)$: probability of $X$
  • $P(X|Y)$: conditional probability of $X$ given $Y$
  • $\theta$: model parameters
  • $\mathcal{L}(\theta|X)$: likelihood of $\theta$ given data $X$


$\mathcal{L}(\theta | X) = P(X | \theta)$

So what is the probability of a tree?

Two key assumptions:

  1. Each site evolves independently.
  2. Each lineage evolves independently.

Continuous Time Markov Chain model of nucleotide evolution

Substitution models

JC69: Jukes and Cantor (1969)

HKY: Hasegawa, Kishino, and Yano (1985)

GTR: Tavaré (1986)

FIN