A clade project is run using closely related genomes, e.g., genomes of the same species. For this example we use 10 Dehalococoides genomes downloaded from NCBI.
#!/bin/bash --login
########## SBATCH Lines for Resource Request ##########
#SBATCH --time=02:00:00 # limit of wall clock time - how long the job will run (same as -t)
#SBATCH --nodes=1 # number of different nodes - could be an exact number or a range of nodes (same as -N)
#SBATCH --ntasks=1 # number of tasks - how many tasks (nodes) that you require (same as -n)
#SBATCH --cpus-per-task=12 # number of CPUs (or cores) per task (same as -c)
#SBATCH --mem-per-cpu=2G # memory required per allocated CPU (or core) - amount of memory (in bytes)
#SBATCH --job-name MiGA_clade # you can give your job a name for easier identification (same as -J)
########## Command Lines to Run ##########
rm -rf ~/miga_clade
mkdir ~/miga_clade
cd ~/miga_clade
singularity shell ~/MiGA << EOF
miga new -P ~/miga_clade -t clade
miga add -P . -t genome -i assembly ~/dehalo_genomes/*.fna -m run_mytaxa_scan=false,run_distances=true
time miga daemon start -t -P . --shutdown-when-done
exit
EOF
Entering
less ~/miga_clade/data/10.clades/02.ani/miga-project.class.tsv
presents a table with the genomes in the first column and their clade assignment in the second column: