Skip to content

shengyongniu/bulk_ATAC_seq

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

bulk_ATAC_seq

A bioinformatic analysis pipeline for bulk ATAC-seq

  1. FastQC
  • Files: fastqc.sh
  • Description:
    • This shell script help us understand the sequencing qualities of each samples and generate well-organized visualization of qualities
  1. Trimming
  • Files: trim.sh
  • Description:
    • Check the fastqc result and decide how many bases to be trimmed
  1. Mapping
  • Files: mapping.sh, mapping.qsub
  • Description:
    • Map reads and check reads size by the follwing script
  1. Reads size check
  • Files: flagstat.sh
  • Description:
    • check the reads mapping size for experimental adjustment
  1. Sort and remove duplicates
  • Files: sam2Sortedbam_rmdup.qub, sam2Sortedbam_rmdup.sh
  • Description:
  • sort reads and remove duplicates. Note: Be careful for the memory issues. Need to check the result logs and set tmp dir in argument. Also,
  1. Trim ChrY, ChrM, chrUn, and shift the coordinates
  • Files: trim_Un_chY_chM_shift.qsub, trim_Un_chY_chM_shift.sh
  • Description:
    • This file help trim the unused chromosome reads and shift the coordinates for the experimental reason of ATAC-seq
  1. Delete negative reads
  • Files: delete_negative.sh
  • Description:
    • Delete the reads that are shifted to negative coordination
  1. Convert BED file to BAM format for DiffBind
  • Files: bedtobam.sh
  • Description:
    • This script convert the BED format to BAM format for the following DiffBind analysis
  1. Peaks calling
  • Files: peakCalling_y.sh, peakCalling_y.qsub
  • Description:
    • Call peaks by DiffBind
  1. Filter out black list

About

This repo focus on bulk ATAC-seq analysis

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages