-
Notifications
You must be signed in to change notification settings - Fork 0
Minimal Tutorial
Ken Nakatsu edited this page Aug 3, 2023
·
4 revisions
Running sRNAfrag on an HPC allows for users to run it on a large number of samples. In this example, I will ran sRNAfrag on >200 small RNA-seq samples.
- Load modules, and clone the github repo and install.
module load git
module load Anaconda
module load R_4.2.1
git clone https://github.com/kenminsoo/sRNAfrag
cd sRNAfrag/install
sh easy_install.sh
- Build a bowtie-index for my genome of interest. Obtain a gtf file to label for out-of-space maps.
The ITAS publication has a nice summary of transcripts that are not snoRNA, which is what I wanted for this example.
https://github.com/EpiEpiMSU/ITAS/tree/main/Integrated_annotation/human
cd my/dir/somewhere/ref_genome
bowtie-build hg38.fa hg38_index // add path to the config file
// download annotation from ITAS, add to config
-
Add directories to config file. Change any adapters if needed.
-
Run the pipeline
python sRNA_fragment.py
- TIP: Keep the working directory the first time you run sRNAfrag if you are processing reads. Then
cd working
mkdir ../processed
mv *.fastq.gz ../processed
This will prevent the need to rerun all modules again, and will speed up your analysis time.