Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Some questions about signac #7

Open
HelloWorldLTY opened this issue Aug 20, 2022 · 2 comments
Open

Some questions about signac #7

HelloWorldLTY opened this issue Aug 20, 2022 · 2 comments

Comments

@HelloWorldLTY
Copy link

Hi, I found that most of the datasets including atac information are very large, and it takes me very long time to run the step generating chr-gene relation. Therefore, could you please offer me some suggestions about this problem? Or is it possible to run your model without this relation as an input? Thanks a lot!

@PeterZZQ
Copy link
Owner

Hello, do you mean that the script calc_gact.R runs super slow? The script basically loops through all the regions in the count matrix and finds a gene for each region. The running time depends on the number of regions you have. You can filter some regions in advance before putting the regions into the script. It shouldn't cost more than an hour in most cases if you don't filter the genes.

The relationship is required in scDART. If you wish to train without the relationship, you can remove the corresponding loss in the model. I believe it should also work, but it would be a different framework.

@HelloWorldLTY
Copy link
Author

Ok, thanks for your help. I will reconsider my filter threshold.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants