Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Full text search capability for text, including OCR text—when searching within Hyku, search results should include the text of embedded pdfs #772

Closed
1 of 4 tasks
Tracked by #903
jillpe opened this issue Sep 8, 2023 · 1 comment

Comments

@jillpe
Copy link

jillpe commented Sep 8, 2023

Summary

Pals would like OCR text, and full text search capability for searching text from a PDF through the catalog

Accepted Criteria

  • A PDF can be found by inputting it's text in the search catalogue
  • When found in the search results, a text snippet displays with it's search result

Testing Instruction

  • Create a work with a PDF and make sure the viewer is visible

In-Viewer Search

  • Click the search icon at the top of the viewer
  • Search a term
    • the term is identified and highlighted on the PDF wherever it occurs

Catalogue Search

  • in the catalogue search, search for a term that is in the PDF
    • the PDF shows as a search result and the term is highlighted in the text snippet that appears with it
@jillpe
Copy link
Author

jillpe commented Sep 13, 2023

SoftServ QA:

PDF tested

In-viewer search ✅

Image

Catalogue Search ✅

Image

@jillpe jillpe moved this from Ready for Development to ReShare QA in palni-palci Sep 13, 2023
@jillpe jillpe moved this from ReShare QA to PALs QA in palni-palci Sep 13, 2023
@ndroark ndroark moved this from PALs QA to Deploy to Production in palni-palci Sep 14, 2023
kirkkwang added a commit that referenced this issue Sep 15, 2023
Sometimes when you search for a phrase the results will throw an
exception.  This commit will fix that by calling the IIIF Print version
of the `#render_ocr_snippets` method.

Ref:
  - #772
@ShanaLMoore ShanaLMoore moved this from Deploy to Production to Client Verification in palni-palci Sep 21, 2023
@ndroark ndroark moved this from Client Verification to Done in palni-palci Oct 2, 2023
@jillpe jillpe closed this as completed Oct 3, 2023
jeremyf pushed a commit to samvera/hyku that referenced this issue Dec 15, 2023
Sometimes when you search for a phrase the results will throw an
exception.  This commit will fix that by calling the IIIF Print version
of the `#render_ocr_snippets` method.

Ref:
  - scientist-softserv/palni-palci#772
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: Done
Development

No branches or pull requests

3 participants