Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

AO 2019-01 missing citations #3669

Closed
lbeaufort opened this issue Apr 3, 2019 · 3 comments
Closed

AO 2019-01 missing citations #3669

lbeaufort opened this issue Apr 3, 2019 · 3 comments
Assignees
Labels
Milestone

Comments

@lbeaufort
Copy link
Member

https://www.fec.gov/data/legal/advisory-opinions/2019-01/

The “cited in this AO” should appear immediately, and the “this AO is cited by” should appear after the weekly reload (should have appeared for this one last weekend)

@lbeaufort lbeaufort added this to the Sprint 8.5 milestone Apr 3, 2019
@lbeaufort lbeaufort self-assigned this Apr 3, 2019
@lbeaufort lbeaufort added the Bug label Apr 3, 2019
@lbeaufort
Copy link
Member Author

Looks like AO 2019-01 is missing OCR text, along with others. I emailed Salient to see if they can get the OCR text to appear.

Also, it looks like other AO's are missing OCR text for their final opinions:

ao_no ocrtext
2018-06 NULL
2018-15 NULL
2019-01 NULL
2019-02 NULL
2019-03 NULL
SELECT ao_no, ocrtext FROM aouser.document
INNER JOIN aouser.ao USING (ao_id)
WHERE category = 'Final Opinion'
AND ocrtext IS NULL order by ao_no

@pkfec
Copy link
Contributor

pkfec commented Apr 3, 2019

Waiting to hear from salient on why the OCR texts are missing from their system.

@lbeaufort
Copy link
Member Author

I identified some others that were missing OCR text, but weren't NULL and Salient fixed those as well. These were harder to find- they weren't ' ' so I'm not sure how they were saved. I queried the length of the OCR text to find them:

SELECT ao_no, max(char_length(ocrtext))
FROM aouser.document
INNER JOIN aouser.ao USING (ao_id)
WHERE category = 'Final Opinion'
group by ao_no
having max(char_length(ocrtext)) < 100
order by ao_no desc 

Now that all AO's have OCR text, I did a global reload so the "cited by" citations would be populated as well.

cf run-task api "python manage.py refresh_current_legal_docs_zero_downtime" -m 4G --name refresh-legal-20190403

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants