Fix PDF scanner + support image extraction #1

cameron-dunn-sublime · 2021-10-14T02:01:31Z

xref_length was erroring on the old version of PyMuPDF.

mupdf_display_errors errored with the new version, so it was removed.

Image extraction from PDFs is a new feature.

I don't know why GitHub says that I'm merging 6 commits. Local git log:

nothing added to commit but untracked files present (use "git add" to track)
➜  strelka git:(cd.images-from-pdf) ✗  glg
commit cda2495b24c2661f251288ea3ec1f191ade39bc9 (HEAD -> cd.images-from-pdf, sublime/cd.images-from-pdf)
Author: Cameron Dunn <[email protected]>
Date:   Wed Oct 13 18:57:47 2021 -0700

    Fix PDF scanner + support image extraction

    xref_length was erroring on the old version of PyMuPDF.

    mupdf_display_errors errored with the new version, so it was removed.

    Image extraction from PDFs is a new feature.

 build/python/backend/requirements.txt   |  2 +-
 src/python/strelka/scanners/scan_pdf.py | 19 ++++++++++++++++---
 2 files changed, 17 insertions(+), 4 deletions(-)

commit d9086f35d709592733ff690ed3a9ddeff5bbb433 (sublime/master, origin/master, origin/HEAD, master)
Author: Paul Hutelmyer <[email protected]>
Date:   Tue Oct 12 08:12:36 2021 -0400

It shouldn't matter though.

Update exiftool to latest

Backend reported errors parsing previously.

Fix K8S backend configmap yaml

xref_length was erroring on the old version of PyMuPDF. mupdf_display_errors errored with the new version, so it was removed. Image extraction from PDFs is a new feature.

cameron-dunn-sublime · 2021-10-18T19:13:37Z

@jkamdjou this is the change I made while we were pairing the other day. I'll probably try and cleanup further and give a PR to target/strelka but in the mean time we can commit this to our [public] fork.

cameron-dunn-sublime and others added 6 commits October 4, 2021 14:00

Update exiftool to latest

274f116

Merge pull request target#180 from sublime-security/upgrade-exiftool

34c3f2d

Update exiftool to latest

Fix K8S backend configmap yaml

1ac0037

Backend reported errors parsing previously.

Merge pull request target#181 from sublime-security/backend-k8s-yml

12c1bbc

Fix K8S backend configmap yaml

Update CHANGELOG.md

d9086f3

Fix PDF scanner + support image extraction

cda2495

xref_length was erroring on the old version of PyMuPDF. mupdf_display_errors errored with the new version, so it was removed. Image extraction from PDFs is a new feature.

cameron-dunn-sublime changed the title ~~Cd.images from pdf~~ Fix PDF scanner + support image extraction Oct 14, 2021

cameron-dunn-sublime requested a review from jkamdjou October 18, 2021 19:12

cameron-dunn-sublime marked this pull request as ready for review October 18, 2021 19:13

jkamdjou approved these changes Oct 20, 2021

View reviewed changes

cameron-dunn-sublime merged commit 3abbf1d into master Oct 20, 2021

cameron-dunn-sublime deleted the cd.images-from-pdf branch October 20, 2021 19:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix PDF scanner + support image extraction #1

Fix PDF scanner + support image extraction #1

cameron-dunn-sublime commented Oct 14, 2021 •

edited

Loading

cameron-dunn-sublime commented Oct 18, 2021

Fix PDF scanner + support image extraction #1

Fix PDF scanner + support image extraction #1

Conversation

cameron-dunn-sublime commented Oct 14, 2021 • edited Loading

cameron-dunn-sublime commented Oct 18, 2021

cameron-dunn-sublime commented Oct 14, 2021 •

edited

Loading