Dorado
DoradoΒΆ
Dorado is a high-performance, easy-to-use, open source basecaller for Oxford Nanopore reads. It needs to be run on a partition/node with GPU compute/CUDA support, and is heavily-optimised for Nvidia A100 and H100 GPUs.
Dorado is made available on the cluster as a shared Apptainer container image.
You can use the apptainer/dorado
module to add a convenient alias to running dorado
within the container:
Terminal
module avail dorado
module load apptainer/dorado/0.7.1
# The following is required to use aliases in a non-interactive/SLURM batch script:
shopt -s expand_aliases
dorado ....
Alternatively run with apptainer directly; i.e. to run binaries within the container, prefix any
command with apptainer -s run --nv <$APPTAINER_IMG/apptainer_image.sif>
.
Make sure to specify --nv
to enable NVIDIA GPU support. e.g.
Terminal
apptainer -s run --nv $APPTAINER_IMG/dorado-<version>.sif dorado basecaller /models/dna_r10.4.1_e8.2_400bps_hac@v4.1.0 ~/pod5s/
Also note that -s / --silent
is required here to suppress the verbose container output that may otherwise contaminate standard output.
- To list the available basecaller models, run
Terminal
$APPTAINER_IMG/dorado-<version>.sif ls -1 /models
As with all Apptainer containers, take care to qualify the files and paths in the context of the
container image, i.e. Models are located within the container in /models/
, and any other files
and data stored outside the container needs to be in a folder that is bound by Apptainer into the
container (either by default, such as your $HOME
, /scratch
or /projects
), or
by explicitely specifying a bind mount with Apptainer's --bind
option).
Please refer to the dorado GitHub page for more information regarding running dorado. https://github.com/nanoporetech/dorado