Skip to content

make it clear when a project is still running / has completed successfully #20

@sillitoe

Description

@sillitoe

Each GeMMA job is run within a project directory:

data/
data/projects.txt
data/starting_clusters/<project_id>/*.faa

It would be really useful to have a reliable way of knowing the current status of this project - ie is this job still running? if not, did it complete successfully?

Effectively, look at a data directory and identify the status from one of the following:

  • invalid data (projects.txt, starting_clusters/, etc)
  • not started
  • running (current progress?)
  • finished with error
  • success

Each project data directory should only have one parent process running at a time. This would suggest that a lock file would be useful (containing info like parent PID, submit_dir, etc). A complicating factor is when should we remove this lock file? Each process can spawn multiple child processes and these are pretty independent.

Current plan:

  • prepare_research_data.pl produces a lock file when run
  • if the lock file exists, this needs a --force argument to overwrite
  • the lock file is only removed if a job has completed successfully

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions