Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

can not access the GDC dataset #12

Closed
JianGuoZhou3 opened this issue Sep 26, 2019 · 9 comments
Closed

can not access the GDC dataset #12

JianGuoZhou3 opened this issue Sep 26, 2019 · 9 comments
Assignees
Labels
bug Something isn't working

Comments

@JianGuoZhou3
Copy link

JianGuoZhou3 commented Sep 26, 2019

> XenaGenerate(subset = XenaHostNames=="gdcHub") %>% 
+   XenaFilter(filterDatasets = "methylation|phenotype") %>% 
+   XenaFilter(filterDatasets = "UCS") -> df_todo
> XenaQuery(df_todo) %>%
+   XenaDownload() -> xe_download
This will check url status, please be patient.
All downloaded files will under directory /tmp/RtmpciLecI.
The 'trans_slash' option is FALSE, keep same directory structure as Xena.
Creating directories for datasets...
'/tmp/RtmpciLecI/TCGA-UCS/Xena_Matrices' already exists'/tmp/RtmpciLecI/TCGA-UCS/Xena_Matrices' already exists/tmp/RtmpciLecI/TCGA-UCS/Xena_Matrices/TCGA-UCS.GDC_phenotype.tsv.gz, the file has been download!
/tmp/RtmpciLecI/TCGA-UCS/Xena_Matrices/TCGA-UCS.methylation450.tsv.gz, the file has been download!


@JianGuoZhou3
Copy link
Author

JianGuoZhou3 commented Sep 26, 2019

anthor question is the package can continue download or not ?

> XenaQuery(df_todo) %>%
+   XenaDownload() -> xe_download
This will check url status, please be patient.
All downloaded files will under directory /tmp/RtmpciLecI.
The 'trans_slash' option is FALSE, keep same directory structure as Xena.
Creating directories for datasets...
'/tmp/RtmpciLecI/TCGA.HNSC.sampleMap' already existsDownloading TCGA.HNSC.sampleMap/HumanMethylation450.gz
trying URL 'https://tcga.xenahubs.net/download/TCGA.HNSC.sampleMap/HumanMethylation450.gz'
Content type 'application/gzip' length 516443612 bytes (492.5 MB)
=======================================
downloaded 388.7 MB

downloaded length 407552000 != reported length 516443612URL 'https://tcga.xenahubs.net/download/TCGA.HNSC.sampleMap/HumanMethylation450.gz': status was 'Transferred a partial file'Can not find fileTCGA.HNSC.sampleMap/HumanMethylation450.gz, this file maybe not compressed.
Try downloading fileTCGA.HNSC.sampleMap/HumanMethylation450...
trying URL 'https://tcga.xenahubs.net/download/TCGA.HNSC.sampleMap/HumanMethylation450'
cannot open URL 'https://tcga.xenahubs.net/download/TCGA.HNSC.sampleMap/HumanMethylation450': HTTP status was '404 Not Found'Error in download.file(x[3], destfile = x[5], ...) : 
  cannot open URL 'https://tcga.xenahubs.net/download/TCGA.HNSC.sampleMap/HumanMethylation450'

@ShixiangWang
Copy link
Member

@jianguozhouzunyimedicaluniversity Thank you for reporting this issue. I can reproduce it.

It seems GDC hub has a different parse rule. I will try to fix this.

https://gdc.xenahubs.net/download/TCGA-UCS.GDC_phenotype.tsv.gz  # Actual link

https://gdc.xenahubs.net/download/TCGA-UCS/Xena_Matrices/TCGA-UCS.GDC_phenotype.tsv.gz  # The link this package try to download

By the way, download process cannot continue. I will take a look if it can be implemented.

Thanks again.
Shixiang

@ShixiangWang ShixiangWang self-assigned this Sep 26, 2019
@ShixiangWang ShixiangWang added the bug Something isn't working label Sep 26, 2019
@JianGuoZhou3
Copy link
Author

JianGuoZhou3 commented Sep 26, 2019 via email

@ShixiangWang
Copy link
Member

你可以从 github 安装最新版本看下问题解决了吗

@JianGuoZhou3
Copy link
Author

JianGuoZhou3 commented Sep 26, 2019 via email

@ShixiangWang
Copy link
Member

我测试过了,目前服务器端不支持断点续传。所以即使我写好了代码也成功不了。我已经写邮件问 UCSC Xena 了,有消息回复你。

@JianGuoZhou3
Copy link
Author

JianGuoZhou3 commented Sep 26, 2019 via email

@ShixiangWang
Copy link
Member

@jianguozhouzunyimedicaluniversity UCSCXena 已经支持断点续传。

下面是一个示例代码

library(UCSCXenaTools)
xe = XenaGenerate(subset = XenaDatasets == "TcgaTargetGtex_expected_count")
xe
xq = XenaQuery(xe)
# You cannot resume from breakpoint in default mode
XenaDownload(xq, destdir = "~/test/", force = TRUE)
# You can do it with 'curl' command
XenaDownload(xq, destdir = "~/test/", method = "curl", extra = "-C -", force = TRUE)
# You can do it with 'wget' command
XenaDownload(xq, destdir = "~/test/", method = "wget", extra = "-c", force = TRUE)

@JianGuoZhou3
Copy link
Author

JianGuoZhou3 commented Oct 8, 2019 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants