Set up a benchmarking suite for calculating storage size for user studies (probably from reVisit).
A script to traverse the provenance and update the state for each node according to different frequencies of checkpoints (#65) and using the new diff mechanism (#64).