-
Notifications
You must be signed in to change notification settings - Fork 3
Description
I've noticed a consistent tree building failure when using cmaple for mpox trees.
The failure is pretty obvious but it'll take a few screenshots to show what I mean - an alternative is to run iqtree on the alignment and compare the result.
While in general tree builders might differ, in this case, I'm pretty convinced that the tree builder is wrong and this might point at a bug hiding somewhere in the code (either likelihood calculation or search heuristics). I'm using EXHAUSTIVE search already but this doesn't seem to help unfortunately. Maybe there needs to be a SUPEREXHAUSTIVE mode? The builder runs for only 22 seconds, I'd be happy to wait extra time if it fixed this issue.
I found this error as clade Ib has a clade that has 3 mutations reverting back to the root genotype and these are the only mutations, indicating that the clade should really be the basal part of clade Ib.
This is inset 1, showing clade Ib and the sibling clade Ia:
The bug is that the branch leading to Ib should not attach where it currently does, but at the green internal node (which shares genotype with the ancestor branch in the outset
The command used to run cmaple was like this:
cmaple \
-aln results/all-clades/masked.fasta \
-st DNA \
--search EXHAUSTIVE \
--out-mul-tree \
--make-consistent \
-nt AUTO \
--overwrite
These are input and output files (remove the txt extension and untar): repro.tar.zst.txt
Mutations were inferred by treetime as cmaple doesn't output reconstructed states (yet).
If you struggle to understand my explanation of the error I'm happy to explain in further detail.
I suppose the ultimate proof would be to compare likelihoods of the wrong and correct trees. I'll see whether I can figure this out somehow.