-
Notifications
You must be signed in to change notification settings - Fork 0
Open
Description
Each GeMMA job is run within a project directory:
data/
data/projects.txt
data/starting_clusters/<project_id>/*.faa
It would be really useful to have a reliable way of knowing the current status of this project - ie is this job still running? if not, did it complete successfully?
Effectively, look at a data directory and identify the status from one of the following:
- invalid data (
projects.txt,starting_clusters/, etc) - not started
- running (current progress?)
- finished with error
- success
Each project data directory should only have one parent process running at a time. This would suggest that a lock file would be useful (containing info like parent PID, submit_dir, etc). A complicating factor is when should we remove this lock file? Each process can spawn multiple child processes and these are pretty independent.
Current plan:
prepare_research_data.plproduces a lock file when run- if the lock file exists, this needs a
--forceargument to overwrite - the lock file is only removed if a job has completed successfully
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels