At least estimation of:
_maxbad.txt
.pos
-annot.h5
-counts.h5
These steps are slow so currently they are recomputed only when they are missing. This means the way to say "recompute these" is to delete files from disk (not great). We could add an overwrite/recompute parameter to control which of these to recompute. Or maybe make it so the inputs to the computation function are cached properly (joblib?) so that recomputation is automatic when the relevant parameters change.