Skip to content

Workflow Software

Many researchers develop scalable and reproducible scientific workflows using a workflow tools. There are several workslow tools that you can use on the HPC.

An important factor when choosing a wokflow software for use in HPC systems is that it must have support for the job scheduling software being used. In our case that is PBS. For each “step” or “task” within a workflow you need to be able to launch a PBS job and to be able to specify the CPU and memory requirements for that step.

Common Workflow Language

The Common Workflow Language site can be found here: https://www.commonwl.org
This is a language specification so for implementations of this see https://www.commonwl.org/#Implementations.
Some users in c3 are using cwltool: https://github.com/common-workflow-language/cwltool.
See also https://github.com/common-workflow-language for CWL viewers and utilities.

Cylc

Cylc: https://cylc.github.io/
This looks quite good. It has support for PBS.

Nextflow

Nextflow: https://www.nextflow.io
Many users running genomics pipelines are using this. We have a short page on using Nextflow here: Using Nextflow

References

Wikipedia entry for the Common Workflow Language https://en.wikipedia.org/wiki/Common_Workflow_Language

Tutorial: How to run CWL with Nextflow https://www.biostars.org/p/422848/