read-pipeline

Short read pipeline of CIWARS for taxonomy classification of bacterial pathogens and computation of ARG abundance based on rpoB marker gene normalization

Requirements

Linux operating system
conda

Installation

git clone https://github.com/muhit-emon/read-pipeline.git
cd read-pipeline
bash install.sh
conda env create -f environment.yml

conda environment activation

After installation, a conda environment named read_pipeline will be created.
To activate the environment, run the following command

conda activate read_pipeline

Download bacterial pathogen database (22 GB) and non-prokaryote database (11 GB)

Go inside read-pipeline directory. Download the pathogen DB and non-prokaryote DB compatible with Kraken2 and uncompress them.

wget https://zenodo.org/records/14537567/files/CIWARS_Pathogen_DB.tar.gz
tar -zxvf CIWARS_Pathogen_DB.tar.gz
rm CIWARS_Pathogen_DB.tar.gz

wget https://zenodo.org/records/14537567/files/non-prokaryote-DB.tar.gz
tar -zxvf non-prokaryote-DB.tar.gz
rm non-prokaryote-DB.tar.gz

Usage on metagenomic paired-end short read data

Go inside read-pipeline directory.

To run the short read pipeline on metagenomic paired-end short read data ( * .fastq/ * .fq/ * .fastq.gz/ * .fq.gz), use the following command

nextflow run short-read-pipeline.nf --R1 <absolute/path/to/forward/read/file> --R2 <absolute/path/to/reverse/read/file> --out_fname <prefix of output file name>
rm -r work

The command line options for this script (short-read-pipeline.nf) are:

--R1: The absolute path of the fastq file containing forward read sequences
--R2: The absolute path of the fastq file containing reverse read sequences
--out_fname: The prefix of the output file name

With --out_fname S1, output files named S1.k2report, S1_rpoB_ARG_norm.tsv, and S1_drug_wise_rpoB_norm.tsv will be generated inside read-pipeline directory.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
DB		DB
README.md		README.md
drug_class_wise_norm.py		drug_class_wise_norm.py
environment.yml		environment.yml
install.sh		install.sh
rpoB_abund.py		rpoB_abund.py
short-read-pipeline.nf		short-read-pipeline.nf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

read-pipeline

Requirements

Installation

conda environment activation

Download bacterial pathogen database (22 GB) and non-prokaryote database (11 GB)

Usage on metagenomic paired-end short read data

About

Releases

Packages

Languages

muhit-emon/read-pipeline

Folders and files

Latest commit

History

Repository files navigation

read-pipeline

Requirements

Installation

conda environment activation

Download bacterial pathogen database (22 GB) and non-prokaryote database (11 GB)

Usage on metagenomic paired-end short read data

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages