You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I get an Ensembl error when trying to download the hgfemale_gene_ensembl dataset.
mmurdjan_gene_ensembl
Querying mmurdjan_gene_ensembl for attributes {'source', 'zfin_id_id', 'ensembl_peptide_id', 'external_gene_name', 'description', 'gene_biotype', 'ensembl_gene_id', 'entrezgene_id', 'chromosome_name', 'external_gene_source'}.
cldingo_gene_ensembl
Querying cldingo_gene_ensembl for attributes {'source', 'ensembl_peptide_id', 'external_gene_name', 'description', 'gene_biotype', 'ensembl_gene_id', 'entrezgene_id', 'chromosome_name', 'external_gene_source'}.
mgallopavo_gene_ensembl
Querying mgallopavo_gene_ensembl for attributes {'source', 'ensembl_peptide_id', 'external_gene_name', 'description', 'gene_biotype', 'ensembl_gene_id', 'external_synonym', 'entrezgene_id', 'chromosome_name', 'external_gene_source'}.
hgfemale_gene_ensembl
Querying hgfemale_gene_ensembl for attributes {'source', 'zfin_id_id', 'ensembl_peptide_id', 'external_gene_name', 'description', 'gene_biotype', 'ensembl_gene_id', 'external_synonym', 'mgi_id', 'chromosome_name', 'external_gene_source', 'entrezgene_id', 'sgd_gene'}.
RuleException:
_BiomartException in file /Users/gaurav/Development/translator/babel/src/snakefiles/datacollect.snakefile, line 190:
Query ERROR: caught BioMart::Exception::Usage: Too many attributes selected for External References
File "/Users/gaurav/Development/translator/babel/src/snakefiles/datacollect.snakefile", line 190, in __rule_get_ensembl
File "/Users/gaurav/Development/translator/babel/src/datahandlers/ensembl.py", line 30, in pull_ensembl
File "/Users/gaurav/Development/translator/babel/venv/lib/python3.11/site-packages/apybiomart/apybiomart.py", line 84, in query
File "/Users/gaurav/Development/translator/babel/venv/lib/python3.11/site-packages/apybiomart/classes.py", line 386, in query
File "/Users/gaurav/.pyenv/versions/3.11.2/lib/python3.11/concurrent/futures/thread.py", line 58, in run
As far as I can tell, this is because this is the one dataset that contains five ID fields (an old post on Bioconductor suggests that three might be their limit for External Sources).
In recent Babel runs I've just skipped it on the assumption that Naked Mole Rats are not very relevant to Translator, but a more complete solution would be to replace the Ensembl retrieval code so that if we find too many attributes we can query for, we break it up into smaller groups, then reassemble the final dataset.
I get an Ensembl error when trying to download the
hgfemale_gene_ensembl
dataset.As far as I can tell, this is because this is the one dataset that contains five ID fields (an old post on Bioconductor suggests that three might be their limit for External Sources).
In recent Babel runs I've just skipped it on the assumption that Naked Mole Rats are not very relevant to Translator, but a more complete solution would be to replace the Ensembl retrieval code so that if we find too many attributes we can query for, we break it up into smaller groups, then reassemble the final dataset.
Babel/src/datahandlers/ensembl.py
Lines 13 to 32 in 8647162
The text was updated successfully, but these errors were encountered: