-
Notifications
You must be signed in to change notification settings - Fork 23
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
AtacWorks errors on custom data #221
Comments
@asundaresan1 can you share your atacworks command please. I'm taking a look at your logs, will report back soon. |
atacworks train |
It seems like this error is occuring because of |
How should I verify this? I also tested this on a mouse cell line ATAC and get same error. |
The atacworks output directory should contain a If the files are small enough, please upload them so I can try it as well. Could you also share how you generated the peak file you passed in with |
The files are big to be uploaded here. |
Hi @asundaresan1, wondering if you were able to resolve this issue?
Please let us know the output of this command for your |
Hi @avantikalal and @ntadimeti, I was able to train the model only after I preprocessed my samples using https://github.com/zchiang/atacworks_analysis/tree/master/preprocessing. However, after denoising I don’t see much improvement in the quality of my data. I have attached the profiles of all the samples before and after denoising. I can see that the scale has improved, and the profiles are smoother, but I was expecting Sample_3 to profile like Sample_4 for example. Is there anything else that I can try? These are denoised using https://ngc.nvidia.com/catalog/models/nvidia:atac_bulk_lowcov_1m_50m |
Hi @asundaresan1, the model you've used is trained to make low-coverage data look like higher coverage data, so it will enhance the profile at a local scale, but it does not improve aggregate TSS score across the whole genome. |
Thank you for your suggestion. Let me try with that model. |
Thanks for sharing the details @asundaresan1. Based on this, I think the model I suggested above is the best suited for your data. If that too doesn't work, we can discuss whether it may be possible for you to train your own model. |
Hi @asundaresan1 , just checking in on this issue. Were you able to get better results using the second model? |
Hi @avantikalal , yes I got better results with the model you suggested. |
Hi @avantikalal, in cases where noisy samples have coverage >= that of the clean samples, should users always forego training and use your pretrained model, nvidia:atac_bulk_lowqual_20m_20m? If the pretrained model is not successful in reducing noise, are there training parameters that should be considered when constructing a custom model. Thanks for the help—atacworks looks like a gamechanger! |
Hello, I am trying to analyze my bulk ATAC samples but get the attached errors. These are ATAC from human donor tissues, so I am trying to train the model which is where I get the error. I am using Atacworks version 0.3.0. I tested with the test data (available as part of tutorial) and it worked fine without any errors.
The ATAC-seq reads are aligned to the human reference genome (hg19) using BWA. For unique alignments, duplicate reads were filtered out. The resulting uniquely mapped reads were normalized to the same read depth across all samples and converted into bigWig files using BEDTools. (genomecov -bga)
AtacWorks_train.err.txt
ATACWorks_train.out.txt
The text was updated successfully, but these errors were encountered: