Running Genomic pre-analysis using SnakePipes in Supercloud
We have snakePipes module set-up in the Galloway Supercloud account. To read more: https://snakepipes.readthedocs.io/en/stable/. This protocol aims to guide the user how to create an alignment script using the SnakePipes module and instructions on running it in Supercloud.
Log in to your SuperCloud from Terminal.
Navigate to
~/galloway_shared/data/NGS_DATA/raw_reads
and confirm the path to the FASTQ files that you intend to analyze.Go back to your local directory and create a bash script using
vim alignment.sh
Type the following lines into the bash script, and don’t forget to modify the directory paths, organism genome , Read_1 and Read_1 extensions , accordingly.
#!/bin/bash # SBATCH --output=snakePipes-%j.txt # Loading the required module source /etc/profile module load anaconda/2023a # Activate local conda module eval "$(conda shell.bash hook)" conda activate /home/gridsan/groups/galloway/conda_envs/snakepipes ## SnakePipes for DNA mapping command ## DNA-mapping -i ~/galloway_shared/data/path/to/FASTQ_files -o ~/galloway_shared/data/path/to/output_files -v --ext ".fastq" --reads '_1_sequence' '_2_sequence' mm10_gencodeM19
Save and exit from script by pressing
:wq
. You can always open and edit the script usingvim alignment.sh
To run the script in Supercloud, you need to submit to SLURM via LLsub (See Supercloud user guidelines ) using the following command:
LLsub -i alignment.sh
Once submitted, you can check the status of the run via
LLstat
. To verify the success/failure of the run, you can check the logfile—alignment.sh.logXXXXX
for more details. Usually the log file contains the progress of the alignment run and/or the details if the alignment failed.