Skip to content
This repository has been archived by the owner on Nov 10, 2020. It is now read-only.

Data: Explore > Jobs > Wage & Salary #926

Closed
shawnbot opened this issue Nov 30, 2015 · 5 comments
Closed

Data: Explore > Jobs > Wage & Salary #926

shawnbot opened this issue Nov 30, 2015 · 5 comments
Assignees

Comments

@shawnbot
Copy link
Contributor

This data comes from BLS (here) and needs to be downloaded and transformed to match the format we've been using.

@shawnbot
Copy link
Contributor Author

shawnbot commented Dec 1, 2015

@Isabelle1512 can you help me figure out which field in the data we should be using? I think it's the one labeled "Annual average of monthly employment levels for a given year" in the field layout:

image

Just want to make sure. Thanks!

@shawnbot
Copy link
Contributor Author

shawnbot commented Dec 1, 2015

Whoops, I meant to reference the1975-2013 field layout, which has many more columns:

image

@Isabelle1512
Copy link

@shawnbot that is the one! (annual_avg_emplvl)

@shawnbot
Copy link
Contributor Author

shawnbot commented Dec 1, 2015

Thanks, @Isabelle1512!

Quick status update: my first impulse was to try streaming the zip downloads in Node so that we could programmatically grab the one file we need from each 140mb (!!) zip, but both unzip and unzip2 both have issues with BLS's zip files. So now I'm writing a Makefile to download each zip and extract the CSVs we care about from each, then I'm putting those into data/_input/bls. After that's done, I'll add those files to git and write a script to stream them all in and convert them into the tabular format that we want with just the county-level data.

For now I'm going to focus on getting the NAICS code 21 data only, which is all mining/extractives. This means that we won't have natural resource type breakdowns initially, but we can bring them in during a second pass.

@shawnbot
Copy link
Contributor Author

shawnbot commented Dec 1, 2015

This is done. We can handle any problems with the data in new issues.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants