Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Prepare release v0.2.0 #112

Merged
merged 76 commits into from
Oct 27, 2023
Merged
Changes from 1 commit
Commits
Show all changes
76 commits
Select commit Hold shift + click to select a range
bc618b9
refactor: fully separate run & init (copy config) commands
kelly-sovacool Oct 11, 2023
184a577
fix: copy assets on init
kelly-sovacool Oct 11, 2023
deacddb
refactor: run tests from tests/ dir
kelly-sovacool Oct 11, 2023
3c02261
docs: document init command
kelly-sovacool Oct 11, 2023
1d2bb82
feat: support local or slurm execution
kelly-sovacool Oct 11, 2023
0ebdad3
refactor!: require python 3.10 for itertools.pairwise()
kelly-sovacool Oct 11, 2023
b985c0c
feat!: modify -profile depending on hpc if run mode is slurm
kelly-sovacool Oct 11, 2023
6682a81
chore: remove nextflow dep
kelly-sovacool Oct 11, 2023
d7079fc
docs: create release guide
kelly-sovacool Oct 11, 2023
61966ae
fix: make sure profile gets set when mode=slurm
kelly-sovacool Oct 12, 2023
9aa5772
docs: delete placeholders & start fresh
kelly-sovacool Oct 12, 2023
c5f6be8
style: sort imports alphabetically
kelly-sovacool Oct 12, 2023
be7358e
feat: run local version or from github
kelly-sovacool Oct 12, 2023
1def175
chore: document CLI updates
kelly-sovacool Oct 12, 2023
49703c9
docs: overhaul readme for biowulf
kelly-sovacool Oct 12, 2023
8b949d1
fix(cli): main.nf path must be a flag, not positional
kelly-sovacool Oct 12, 2023
4bd9c27
fix(cli): correctly pair up nextflow args
kelly-sovacool Oct 12, 2023
8155aa6
docs: describe custom genome usage
kelly-sovacool Oct 12, 2023
0134e17
docs: minor tweaks
kelly-sovacool Oct 12, 2023
540734d
fix: remove baseDir from conf
kelly-sovacool Oct 12, 2023
0c93f75
chore: assets don't need to be executable
kelly-sovacool Oct 12, 2023
d84bb71
ci: test gha pip paths
kelly-sovacool Oct 12, 2023
b0a0569
chore: update version for release candidate
kelly-sovacool Oct 12, 2023
474ede0
docs: need conda py3.11 to install successfully
kelly-sovacool Oct 12, 2023
cde5c94
fix: typo in config filename
kelly-sovacool Oct 12, 2023
95e9a0b
feat: create install script for ccbrpipeliner module
kelly-sovacool Oct 12, 2023
2228d54
chore: Merge branch 'release-v0.2.0' of https://github.com/CCBR/CHAMP…
kelly-sovacool Oct 12, 2023
6958b4f
docs: need to export path & pythonpath after installing
kelly-sovacool Oct 12, 2023
ed6cb1f
chore: Merge branch 'release-v0.2.0' of https://github.com/CCBR/CHAMP…
kelly-sovacool Oct 12, 2023
94bc9e2
docs: use py311 conda env during installation
kelly-sovacool Oct 12, 2023
a4d9ba0
fix: include assets & alphabetize
kelly-sovacool Oct 12, 2023
29b2abd
fix: need wildcard to add subdirs to package data
kelly-sovacool Oct 13, 2023
0a2c417
fix: biowulf requires min memory
kelly-sovacool Oct 13, 2023
0ea9c81
fix: correct nextflow arg parsing and allow biowulf/frce config with …
kelly-sovacool Oct 13, 2023
a336154
fix: typo
kelly-sovacool Oct 13, 2023
ed243e5
ci: don't use pip -t
kelly-sovacool Oct 13, 2023
e99f71b
chore: bump version to 0.2.0
kelly-sovacool Oct 13, 2023
a76ea34
fix: out of memory for samtools sort
kelly-sovacool Oct 13, 2023
c07dd3c
test: switch mem back to 72 GB
kelly-sovacool Oct 13, 2023
f6da4a0
fix: increase picard markdup resources
kelly-sovacool Oct 14, 2023
c3f4855
fix: make sure scripts are executable for nextflow
kelly-sovacool Oct 16, 2023
21db7cf
refactor: rename install script -- it's biowulf-specific
kelly-sovacool Oct 16, 2023
633180a
fix: don't hide errors
kelly-sovacool Oct 16, 2023
09a08b9
fix: download gem read dists file
kelly-sovacool Oct 17, 2023
f60cd43
fix: nf comment
kelly-sovacool Oct 17, 2023
98085ca
fix: assign process labels for peak callers
kelly-sovacool Oct 18, 2023
5f11030
chore: Merge branch 'release-v0.2.0' of https://github.com/CCBR/CHAMP…
kelly-sovacool Oct 18, 2023
7935fa2
fix: bump maxRetries to 2
kelly-sovacool Oct 19, 2023
e54c49c
fix: increase resources for ppqt
kelly-sovacool Oct 19, 2023
cf8b075
chore: remove nf-core/khmers
kelly-sovacool Oct 19, 2023
8e654b0
fix: update cutadapt module w/ corrected PE behavior
kelly-sovacool Oct 19, 2023
bf561fa
fix: include assets subdirs
kelly-sovacool Oct 19, 2023
446f4ef
fix(cli): recursively include data subdirs
kelly-sovacool Oct 19, 2023
867307f
fix: increase memory for process_higher label
kelly-sovacool Oct 19, 2023
d9dea8a
chore: slurm node doesn't have to be interactive
kelly-sovacool Oct 19, 2023
2b95c0e
fix: effective gsize calculation
kelly-sovacool Oct 19, 2023
e35c0b6
chore: Merge branch 'release-v0.2.0' of https://github.com/CCBR/CHAMP…
kelly-sovacool Oct 19, 2023
5109519
refactor: use filter_blacklist subworkflow from CCBR/nf-modules
kelly-sovacool Oct 23, 2023
e1a3652
chore: Merge branch 'release-v0.2.0' of https://github.com/CCBR/CHAMP…
kelly-sovacool Oct 23, 2023
b973dba
chore: update modules
kelly-sovacool Oct 23, 2023
d49f3ef
chore: don't write test dag to assets anymore
kelly-sovacool Oct 23, 2023
1165fc7
feat: install samtools/flagstat module
kelly-sovacool Oct 24, 2023
f74e1dd
feat: update CCBR nf-modules
kelly-sovacool Oct 24, 2023
891f150
fix: udpate filter_blacklist w/ bugfix
kelly-sovacool Oct 24, 2023
9460580
fix: update samtools/sort to output bai files
kelly-sovacool Oct 24, 2023
46ed621
chore: don't use process.scratch for interactive profile
kelly-sovacool Oct 24, 2023
1e34c72
feat: process to filter by alignment quality
kelly-sovacool Oct 24, 2023
0f10d23
refactor: use align_genome submodule instead of process
kelly-sovacool Oct 24, 2023
dad66d4
fix: flagstat now outputs metadata too
kelly-sovacool Oct 24, 2023
7299db6
fix: biowulf repo path
kelly-sovacool Oct 24, 2023
2bfef25
test: run QC & input normalization steps
kelly-sovacool Oct 24, 2023
655d711
chore: Merge branch 'release-v0.2.0' of https://github.com/CCBR/CHAMP…
kelly-sovacool Oct 24, 2023
861c233
style: tweak plot dimensions
kelly-sovacool Oct 25, 2023
dd5d12b
style: plot broad & narrow peaks separately
kelly-sovacool Oct 25, 2023
ee6049d
fix: don't log-scale the peak width plot axes
kelly-sovacool Oct 26, 2023
b5f0214
style: widen peak width plots
kelly-sovacool Oct 26, 2023
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
style: plot broad & narrow peaks separately
  • Loading branch information
kelly-sovacool committed Oct 25, 2023
commit dd5d12b220f4bb7bd2aede5613b815497fc18e14
16 changes: 11 additions & 5 deletions assets/multiqc_config.yaml
Original file line number Diff line number Diff line change
@@ -138,10 +138,14 @@ custom_data:
parent_id: peak_qc
id: frip_nbases
section_name: "FRiP vs. number of bases"
peak_widths:
peak_widths_broad:
parent_id: peak_qc
id: peak_widths
section_name: "Distribution of peak widths"
id: peak_widths_broad
section_name: "Broad peak widths"
peak_widths_narrow:
parent_id: peak_qc
id: peak_widths_narrow
section_name: "Narrow peak widths"
jaccard_heatmap:
parent_id: peak_qc
id: jaccard_heatmap
@@ -166,8 +170,10 @@ sp:
fn: "FRiP_samples*.png"
frip_nbases:
fn: "FRiP_nbases*.png"
peak_widths:
fn: "peak_widths*.png"
peak_widths_broad:
fn: "peak_widths_broad*.png"
peak_widths_narrow:
fn: "peak_widths_narrow*.png"
jaccard_heatmap:
fn: "jaccard_heatmap_all*.png"
jaccard_pca_all:
10 changes: 8 additions & 2 deletions bin/plot_frip.R
Original file line number Diff line number Diff line change
@@ -11,7 +11,12 @@ if (length(args) < 1) {
}

frip_filename <- args[1]
frip_dat <- read_tsv(frip_filename)
frip_dat <- read_tsv(frip_filename) %>%
mutate(peak_type = case_when(
bedtool == "sicer" | bedtool == "macs_broad" ~ "broad",
bedtool == "gem" | bedtool == "macs_narrow" ~ "narrow",
TRUE ~ NA_character_
))

sample_frip_plot <- frip_dat %>%
mutate(n_overlap_reads_mil = n_overlap_reads / 10^6) %>%
@@ -23,7 +28,7 @@ sample_frip_plot <- frip_dat %>%
names_to = "metric"
) %>%
ggplot(aes(value, bedsample, color = bedtool)) +
facet_wrap("metric", nrow = 1, scales = "free_x", strip.position = "bottom") +
facet_wrap(peak_type ~ metric, scales = "free_x", strip.position = "bottom") +
geom_jitter(alpha = 0.8, height = 0.1) +
geom_hline(
yintercept = seq(1.5, length(unique(frip_dat %>% pull(bedsample))) - 0.5, 1),
@@ -47,6 +52,7 @@ ggsave(filename = "FRiP_samples.png", plot = sample_frip_plot, device = "png", d
nbasesM_frip_plot <- frip_dat %>%
ggplot(aes(n_basesM, FRiP, color = bedsample, shape = bedtool)) +
geom_point(alpha = 0.8) +
facet_wrap(~peak_type, scales = "free") +
guides(
color = guide_legend(title = "Sample"),
shape = guide_legend(title = "Peak caller")
84 changes: 59 additions & 25 deletions bin/plot_peak_widths.R
Original file line number Diff line number Diff line change
@@ -2,6 +2,7 @@
library(dplyr)
library(ggplot2)
library(readr)
library(scales)

set.seed(20230926)
args <- commandArgs(trailingOnly = TRUE)
@@ -10,30 +11,63 @@ if (length(args) < 1) {
}
tsv_filename <- args[1]
peak_dat <- read_tsv(tsv_filename) %>%
mutate(peak_width = chromEnd - chromStart)
xmin <- peak_dat %>%
pull(peak_width) %>%
min()
xmax <- peak_dat %>%
pull(peak_width) %>%
max()
mutate(
peak_width = chromEnd - chromStart,
peak_type = case_when(
tool == "sicer" | tool == "macs_broad" ~ "broad",
tool == "gem" | tool == "macs_narrow" ~ "narrow",
TRUE ~ NA_character_
)
)

peak_widths_hist <- peak_dat %>%
ggplot(aes(peak_width, fill = tool)) +
geom_histogram(alpha = 0.7, position = "identity") +
scale_x_log10(
limits = c(xmin - 10^2, xmax + 10^2),
labels = scales::label_log(digits = 2)
) +
scale_fill_viridis_d() +
guides(fill = guide_legend(
label.position = "bottom",
title = "Peak caller",
title.position = "top"
)) +
facet_wrap("sample_id") +
labs(x = expression("" * log[10] * " Peak Widths")) +
theme_bw() +
theme(legend.position = "top")
tool_colors <- viridisLite::viridis(n = 4, alpha = 0.7, begin = 0, end = 1, direction = 1, option = "D")
names(tool_colors) <- peak_dat %>%
pull(tool) %>%
unique()

ggsave(filename = "peak_widths_histogram.png", plot = peak_widths_hist, device = "png", dpi = 300, height = 4, width = 5)
plot_hist <- function(peak_dat) {
xmin <- peak_dat %>%
pull(peak_width) %>%
min()
xmax <- peak_dat %>%
pull(peak_width) %>%
max()
peak_dat %>%
ggplot(aes(peak_width, fill = tool)) +
geom_histogram(alpha = 0.7, position = "identity") +
scale_x_log10(
limits = c(xmin - 10^2, xmax + 10^2),
labels = label_log(digits = 2)
) +
scale_fill_manual(
values = tool_colors,
breaks = names(tool_colors)
) +
guides(fill = guide_legend(
label.position = "bottom",
title = "Peak caller",
title.position = "top"
)) +
facet_wrap(~sample_id) +
labs(x = expression("" * log[10] * " Peak Widths")) +
theme_bw() +
theme(legend.position = "top")
}

hist_broad <- peak_dat %>%
filter(peak_type == "broad") %>%
plot_hist() +
scale_y_log10(labels = label_log(digits = 2))
hist_narrow <- peak_dat %>%
filter(peak_type == "narrow") %>%
plot_hist() +
scale_y_log10(labels = label_log(digits = 2))

ggsave(
filename = "peak_widths_broad_histogram.png", plot = hist_broad,
device = "png", dpi = 300, height = 4, width = 5
)
ggsave(
filename = "peak_widths_narrow_histogram.png", plot = hist_narrow,
device = "png", dpi = 300, height = 4, width = 5
)