-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add risky word alerts to cases uploaded by recap.email #49
Comments
From an architecture perspective, I think this is very similar to freelawproject/courtlistener#612, but it triggers different email content. |
I think there is a set of operations that should be performed as we ingest a PDF. We already do the convert to text, we will be able to do an X-ray looking for bad redactions shortly. Besides that, we should be running eyecite to find and create citation links. A word/phrase search seems to be another option. I'd suggest that this be created as another microservice provided by doctor. The service would consist of sending a document (text) and a list of phrases with a token associated with each. The return would be a list of tokens and positions in the document where that phrase was found. The user of the microservices can then use that token to find who gets the alert. |
Doctor is really about converting documents, so I don't think this CL-specific thing would necessarily go in there. I'd think this would work better as a search query, and we'll have the architecture for that fairly soon. One thing that does merit cleanup though is consolidating all these tasks into a better pipeline, though I think even that is tricky since some of the things can be done before you extract text (like, well, extracting text), and others need the text (like x-ray, citation extraction, alerts). That's only relevant because text extraction is async, so it's kind of a pain to get it all laced together nicely, but not a huge thing. |
See also #45, which allows for customized include/exclude words for docket alerts. |
I haven't thought through this completely yet, but it's an idea that I think people might find very useful.
Once we have alerts for RECAP content, as part of freelawproject/courtlistener#612 and freelawproject/courtlistener#1234, we should think about a new kind of per-case alert that we can suggest to people using @recap.email.
Imagine the following:
The idea here is to build on #62, so that when you start using @recap.email you get some immediate benefits and audits on the cases you are working on.
Of course, this could probably also have a webhook, but maybe that's unnecessary? Webhook all the things?
The text was updated successfully, but these errors were encountered: