-
Notifications
You must be signed in to change notification settings - Fork 10
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #319 from biglocalnews/precommit
Added pre-commit with first round of fixes
- Loading branch information
Showing
53 changed files
with
192 additions
and
92 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,49 @@ | ||
# See https://pre-commit.com for more information | ||
# See https://pre-commit.com/hooks.html for more hooks | ||
repos: | ||
- repo: https://github.com/pre-commit/pre-commit-hooks | ||
rev: v3.2.0 | ||
hooks: | ||
- id: trailing-whitespace | ||
language_version: python3 | ||
- id: end-of-file-fixer | ||
language_version: python3 | ||
- id: check-yaml | ||
language_version: python3 | ||
- id: check-added-large-files | ||
language_version: python3 | ||
- id: check-byte-order-marker | ||
language_version: python3 | ||
- id: check-case-conflict | ||
language_version: python3 | ||
- id: check-json | ||
language_version: python3 | ||
- id: mixed-line-ending | ||
language_version: python3 | ||
- repo: https://github.com/psf/black | ||
rev: 21.12b0 | ||
hooks: | ||
- id: black | ||
language_version: python3 | ||
- repo: https://github.com/asottile/blacken-docs | ||
rev: v1.8.0 | ||
hooks: | ||
- id: blacken-docs | ||
additional_dependencies: [black] | ||
language_version: python3 | ||
- repo: https://github.com/timothycrosley/isort | ||
rev: 5.7.0 | ||
hooks: | ||
- id: isort | ||
language_version: python3 | ||
- repo: https://gitlab.com/pycqa/flake8 | ||
rev: 3.8.4 | ||
hooks: | ||
- id: flake8 | ||
language_version: python3 | ||
- repo: https://github.com/asottile/pyupgrade | ||
rev: v2.7.2 | ||
hooks: | ||
- id: pyupgrade | ||
language_version: python3 | ||
args: [--py37-plus] |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,6 +1,6 @@ | ||
# | ||
# Colors | ||
# | ||
# | ||
|
||
# Define ANSI color codes | ||
RESET_COLOR = \033[m | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -8,7 +8,7 @@ | |
-Media Inquiries: 334.242.0400 | ||
-Email: [email protected] | ||
- WARN Data Contact found by Cody: Jessica D. (phone # unknown) | ||
|
||
### July 21, 2021 12PM | ||
I had an intuition that I needed to call earlier in order to get a response, so I did! I called back and I was transferred to Jessica D. (sounded like Jessica Dent or Dint?). Unfortunately I had to leave a voicemail again, but at least I got somewhere. | ||
|
||
|
@@ -21,7 +21,7 @@ Email follow-up | |
### July 1, 2021 | ||
Called the Media Inquiries number, left a voicemail with my callback number. Called the Toll Free number, left a voicemail with my callback number. Sent an email to the contact address with my questions. | ||
|
||
### Jun 30th, 2021 | ||
### Jun 30th, 2021 | ||
|
||
The following are some specific issues with this data that needs further processing and phone calls: | ||
- Q: I noticed in the last 8 lines of the data there are alternating rows of future dates and jan 1 1970, with some strange characters in the other fields. Do these lines mean anything, or can we discard them? | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -7,10 +7,10 @@ WARN site: https://cdle.colorado.gov/employers/layoff-separations/layoff-warn-li | |
Website feedback google form: https://docs.google.com/forms/d/e/1FAIpQLSdlZ6HyYpV4wzE-BMBS3S9YDclg5hfyaJlDblwXkETdMWFs1A/viewform (in case the other email isnt useful) | ||
|
||
### Email sent to [email protected] Aug 6, 2021 | ||
>Hi, I'm emailing on behalf of Stanford's Big Local News project. We're trying to increase journalists' access to WARN data, and we had a question about Colorado's WARN data. | ||
>Hi, I'm emailing on behalf of Stanford's Big Local News project. We're trying to increase journalists' access to WARN data, and we had a question about Colorado's WARN data. | ||
>In comparing your 2021 data with previous years, I'm wondering if 'layoff total' in the archived data (2015-2019) represents a sum of the 'permanent layoffs', 'temporary layoffs', and 'furloughs' columns in the 2021 data. Would that be a fair assessment? | ||
>Thanks for your help! | ||
>(P.S: if you happen to have any data for Colorado before 2015, that would be nice to have!) | ||
Response Received: | ||
> Thank you for your question. You are correct. The Total layoffs from 2015-2019 includes all permanent, temporary and furloughs reported under WARN for those years. We evolved the data as time progressed and we better understood the data needs of the public. | ||
> Thank you for your question. You are correct. The Total layoffs from 2015-2019 includes all permanent, temporary and furloughs reported under WARN for those years. We evolved the data as time progressed and we better understood the data needs of the public. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -13,7 +13,7 @@ Indianapolis, IN 46204 | |
[email protected] | ||
|
||
### July 21, 2021 | ||
Response: | ||
Response: | ||
|
||
>Cody: | ||
>This error should be fixed soon. | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -3,18 +3,18 @@ | |
- [Homepage](https://dol.ny.gov/warn-notices) | ||
- To request for full documentation of any notices prior to 2021, email [email protected] | ||
- [Contacts](https://dol.ny.gov/worker-adjustment-and-retraining-notification-warn) | ||
- NYS WARN Coordinator: Janet Faraone (phone: 518-457-1518, email: [[email protected]]([email protected])) | ||
- NYS WARN Coordinator: Janet Faraone (phone: 518-457-1518, email: [[email protected]]([email protected])) | ||
|
||
### Jan 5, 2022 | ||
We've implemented historical document scraping for NY (issue #136) and removed the old website scraping system while we wait for their website to be upgraded. As mentioned in July, we should implement a system of periodically requesting a historical document from NY. We are hosting this historical document at https://storage.googleapis.com/bln-data-public/warn-layoffs/ny_historical.xlsx . | ||
|
||
### Jul 12th, 2021 | ||
reaching out to the NYS WARN coordinator was fruitful. the state agency prepared and sent over an excel containing records from 2016 to 2021 within a day upon email request. Here is a snippet of the data: | ||
reaching out to the NYS WARN coordinator was fruitful. the state agency prepared and sent over an excel containing records from 2016 to 2021 within a day upon email request. Here is a snippet of the data: | ||
![image](https://user-images.githubusercontent.com/56002814/125342121-736f4f80-e322-11eb-8c22-b1f22346ded7.png) | ||
The full dataset is currently attached to issue #136, it should be hosted on some platform in the future. | ||
When requesting the data, the state agency asked for a timeframe from which data is needed (i responded with 2016-2021), but it might be possible to request data prior to 2016 as well. | ||
When requesting the data, the state agency asked for a timeframe from which data is needed (i responded with 2016-2021), but it might be possible to request data prior to 2016 as well. | ||
Regarding updating data in the future, the NYS WARN website is undergoing updates, and it should allow scraping in the future. In the meantime, we should periodically request updated data from NYS. | ||
|
||
### Jul 6th, 2021 | ||
- 2021 data doesn't directly contain # employee affected in the html table on the website, needs to parse the corresponding pdf to obtain the # affected. | ||
- for data prior to 2021, each year is contained in a pdf and doesn't contain # affected, needs to reach out and ask if there is a better form of data/access company specific data. | ||
- 2021 data doesn't directly contain # employee affected in the html table on the website, needs to parse the corresponding pdf to obtain the # affected. | ||
- for data prior to 2021, each year is contained in a pdf and doesn't contain # affected, needs to reach out and ask if there is a better form of data/access company specific data. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -4,7 +4,7 @@ URL: https://www.twc.texas.gov/businesses/worker-adjustment-and-retraining-notif | |
"To access older WARN notices or if you have any questions regarding WARN notices, contact TWC at [email protected]." | ||
|
||
### 1/5/22 | ||
Just want to document here that TX is an edge-case scraper, meaning that it has a hybrid strategy of both scraping the website and downloading a historical document, merging the two data. We've hosted the document at https://storage.googleapis.com/bln-data-public/warn-layoffs/tx_historical.xlsx | ||
Just want to document here that TX is an edge-case scraper, meaning that it has a hybrid strategy of both scraping the website and downloading a historical document, merging the two data. We've hosted the document at https://storage.googleapis.com/bln-data-public/warn-layoffs/tx_historical.xlsx | ||
|
||
### 8/19/21 | ||
**Cody:** "Hi Francisco, | ||
|
@@ -23,14 +23,14 @@ So it seems like our guess was correct, that the xtra data included in the histo | |
|
||
### 8/18/21 | ||
|
||
Noticed historical data seemed more comprehensive than yearly data for 2019 (see WARN/#227). Sent an email bringing to their attention & asking which is the best source of the WARN data. | ||
Noticed historical data seemed more comprehensive than yearly data for 2019 (see WARN/#227). Sent an email bringing to their attention & asking which is the best source of the WARN data. | ||
|
||
"Thanks so much for this data. I noticed something interesting that I thought you might like to be aware of. After looking at the differences between the WARN historical file you provided and the 2019 data available on the public-facing website, I noticed some interesting discrepancies: https://www.diffchecker.com/LBUWndL9I was surprised to find that the historical data includes about 12 additional rows.I think this trend might extend to other years as well. I am wondering if you all are aware of this, and if so, whether the historical data or the data on the website would be a more accurate source for Texas' WARN information." | ||
|
||
|
||
### 8/16/2021 | ||
|
||
Received a reply from TX from [email protected], with the following .xlsx file. | ||
Received a reply from TX from [email protected], with the following .xlsx file. | ||
|
||
[Warns 01-01-89-09-30-19.xlsx](https://github.com/biglocalnews/WARN/files/6994307/Warns.01-01-89-09-30-19.xlsx) | ||
|
||
|
@@ -39,5 +39,5 @@ This file seems to overlap quite a bit with our website scraper, maybe the websi | |
|
||
### 8/13/21 | ||
|
||
Sent an email to the provided email address requesting WARN from prior years. Received an email back from [email protected] with the historical WARN data. He also CC'd "Hession,Margaret" <[email protected]> and | ||
Sent an email to the provided email address requesting WARN from prior years. Received an email back from [email protected] with the historical WARN data. He also CC'd "Hession,Margaret" <[email protected]> and | ||
"Bernsen,James" <[email protected]> |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.