-
Notifications
You must be signed in to change notification settings - Fork 40
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Implement prototype search for guidance policy #3554
Comments
With the list above and a related spreadsheet shared with @jason-upchurch, I'm passing this issue on to him to work on the rest! I'm available for any content-type questions that might come up. cc: @JonellaCulmer |
tentative xml sitemap. Sent to search.gov and awaiting next steps to begin search testing. <?xml version="1.0" encoding="UTF-8"?>
<urlset>
<url>
<loc>https://cg-47928592-406c-4536-8234-99b896e8d57d.s3-us-gov-west-1.amazonaws.com/cms-content/documents/fecfrm1m.pdf</loc>
<lastmod>2017-08-17T09:25:48+00:00</lastmod>
</url>
<url>
<loc>https://cg-47928592-406c-4536-8234-99b896e8d57d.s3-us-gov-west-1.amazonaws.com/cms-content/documents/fedreg_notice_2019-10_07292019.pdf</loc>
<lastmod>2019-09-17T08:00:33+00:00</lastmod>
</url>
<url>
<loc>https://cg-47928592-406c-4536-8234-99b896e8d57d.s3-us-gov-west-1.amazonaws.com/cms-content/documents/candgui.pdf</loc>
<lastmod>2018-10-03T11:07:29+00:00</lastmod>
</url>
<url>
<loc>https://cg-47928592-406c-4536-8234-99b896e8d57d.s3-us-gov-west-1.amazonaws.com/cms-content/documents/guideline-for-presentation-good-order.pdf</loc>
<lastmod>2020-02-05T13:12:21+00:00</lastmod>
</url>
<url>
<loc>https://cg-47928592-406c-4536-8234-99b896e8d57d.s3-us-gov-west-1.amazonaws.com/cms-content/documents/fecfrm1mi.pdf</loc>
<lastmod>2018-09-21T14:50:43+00:00</lastmod>
</url>
<url>
<loc>https://cg-47928592-406c-4536-8234-99b896e8d57d.s3-us-gov-west-1.amazonaws.com/cms-content/documents/fecfrm5.pdf</loc>
<lastmod>2018-03-16T10:58:46+00:00</lastmod>
</url>
<url>
<loc>https://cg-47928592-406c-4536-8234-99b896e8d57d.s3-us-gov-west-1.amazonaws.com/cms-content/documents/fedreg_notice2003-9.pdf</loc>
<lastmod></lastmod>
</url>
<url>
<loc>https://cg-47928592-406c-4536-8234-99b896e8d57d.s3-us-gov-west-1.amazonaws.com/cms-content/documents/enforcementprocedures_hearingtranscript-6-11-2003.pdf</loc>
<lastmod></lastmod>
</url>
<url>
<loc>https://cg-47928592-406c-4536-8234-99b896e8d57d.s3-us-gov-west-1.amazonaws.com/cms-content/documents/comment_democracy21_05222019.pdf</loc>
<lastmod></lastmod>
</url>
<url>
<loc>https://cg-47928592-406c-4536-8234-99b896e8d57d.s3-us-gov-west-1.amazonaws.com/cms-content/documents/comment_campaign_legal_center_05232019.pdf</loc>
<lastmod></lastmod>
</url>
</urlset> |
For posterity: search.gov attempted to index based on above but hit 403 error. We asked cloud.gov to whitelist search.gov's IP address. This was done and we are awaiting the indexing to test success/failure. coordinating with @dorothyyeager @AmyKort to plan for next steps per @PaulClark2 cc @pkfec |
Summary
This ticket is a prototype implementation ticket to continue work done under issue #3489, #3488, #3527. The main focus of this ticket is to test searchability of a handful of pdfs using a limited scope and functionality as a proof-of-concept/prototype. Follow-on work/tickets is/are identified in the high-level completion criteria below.
High-level completion criteria
content-s3
that should be made searchable (content work, cc @dorothyyeager @kathycarothers )i14y
search architecture to thesitemap.xml
architecture (larger scope and resource question: @PaulClark2 @AmyKort @patphongs @rfultz @johnnyporkchops @jason-upchurch )The text was updated successfully, but these errors were encountered: