Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

how can i solve this metasub_utils.pangea.cli #14

Open
wnvcks opened this issue Mar 10, 2021 · 2 comments
Open

how can i solve this metasub_utils.pangea.cli #14

wnvcks opened this issue Mar 10, 2021 · 2 comments

Comments

@wnvcks
Copy link

wnvcks commented Mar 10, 2021

  • MetaSUB Utilities version: v1.2.0
  • Python version: 3.8.5
  • Operating System: Linux
  • aws-cli version : 1.19.24

Description

Hi. my name is Juchan.hwang pasteur institut
when i tried to find Seoul_COVID19 data. i cannot find recent data project name : haib20CEM6134
i only can see the list of project haib18CEM5453

and also when i did other command

What I Did

root@juchan-VirtualBox:~/Desktop# metasub wasabi list nonhuman-reads --city-name seoul > all_seoul_sample_list.txt
Unable to import metasub_utils.pangea.cli

root@juchan-VirtualBox:~/Desktop# 

this is the some result of list that i command. i cannot see the project haib20CEM6134 about Seoul_COVID19 result 

s3://metasub/human_filtered_data/hudson_alpha_library/haib18CEM5453/HNHKFCCXY/haib18CEM5453_HNHKFCCXY_SL345999.filter_human_dna.nonhuman_read1.fastq.gz
s3://metasub/human_filtered_data/hudson_alpha_library/haib18CEM5453/HNHKFCCXY/haib18CEM5453_HNHKFCCXY_SL345999.filter_human_dna.nonhuman_read2.fastq.gz
s3://metasub/human_filtered_data/hudson_alpha_library/haib18CEM5453/HNHKFCCXY/haib18CEM5453_HNHKFCCXY_SL346000.filter_human_dna.nonhuman_read1.fastq.gz
s3://metasub/human_filtered_data/hudson_alpha_library/haib18CEM5453/HNHKFCCXY/haib18CEM5453_HNHKFCCXY_SL346000.filter_human_dna.nonhuman_read2.fastq.gz
s3://metasub/human_filtered_data/hudson_alpha_library/haib18CEM5453/HNHKFCCXY/haib18CEM5453_HNHKFCCXY_SL346001.filter_human_dna.nonhuman_read1.fastq.gz

@dcdanko
Copy link
Contributor

dcdanko commented Mar 10, 2021

Thank you for the detailed issue, this interface is actually deprecated (though my bad for not making that clear). To download MetaSUB data you should use the Pangea API. Hopefully the info below gets you up and running if not please keep replying to this thread.

You can find directions to use the API here https://pangeabio.io/docs/how-to-download-data
The data for Seoul is here https://pangeabio.io/sample-groups/7a01ea3e-df47-4a04-94f8-969ecf5ae2b6

Ultimately the command you'll need to run is

pangea-api download sample-results --module-name "cap2::clean_reads" "MetaSUB Consortium" "MetaSUB Seoul"

This will download files one at a time. You may want to include the --urls-only flag with the above command to get a list of urls instead which you can download with wget or similar.

@tiana-yuchen
Copy link

Hi. Thanks for your amazing work!! but I find that I could not access Pangea to register or browse the organization/data repo. It always appears "Error
Cannot destructure property 'status' of 'error.response' as it is undefined." Is it because it has suspended? Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants