You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
It took me a bit to figure out how to fetch a local file for an extract config, because it's not really clear in the documentation or the library what to use for the working directory. My directory structure looks like this:
After a bit of fooling around, I managed to get what I wanted (load gen_manifest.csv for extraction and use s3_scrape.csv to supplement it), but it was really just after guessing what the paths should be. Here's what I mean:
So using the file protocol I have to go up two directories, but for reading with pandas I don't. Changing how this works in the library probably breaks some previous extract configs, so I assume a change to the documentation is preferable.
The text was updated successfully, but these errors were encountered:
Side question: Is there a reason why you're loading a second file manually and merging it in do_after_read instead of combining them in the transform module?
Side question: Is there a reason why you're loading a second file manually and merging it in do_after_read instead of combining them in the transform module?
No. I haven't done an ingest in a while, so I was referencing the Chung one I did almost a year ago. But I did it there because I was again referencing someone else's ingest package. So it seems like it happens sometimes.
It took me a bit to figure out how to fetch a local file for an extract config, because it's not really clear in the documentation or the library what to use for the working directory. My directory structure looks like this:
After a bit of fooling around, I managed to get what I wanted (load
gen_manifest.csv
for extraction and uses3_scrape.csv
to supplement it), but it was really just after guessing what the paths should be. Here's what I mean:genomic.py
So using the file protocol I have to go up two directories, but for reading with pandas I don't. Changing how this works in the library probably breaks some previous extract configs, so I assume a change to the documentation is preferable.
The text was updated successfully, but these errors were encountered: