Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

'Extract Files' operation #440

Merged
merged 25 commits into from
Mar 11, 2019
Merged

'Extract Files' operation #440

merged 25 commits into from
Mar 11, 2019

Conversation

n1474335
Copy link
Member

@n1474335 n1474335 commented Dec 14, 2018

Adds a new 'Extract Files' operation which detects files within a blob of data, parses them, and then extracts them.

Currently supports

  • JPG
  • EXE
  • ZIP
  • PDF
  • PNG
  • BMP
  • FLV
  • RTF
  • DOCX, XLSX, PPTX
  • EPUB
  • GZIP
  • ZLIB
  • ELF, BIN, AXF, O, PRX, SO

A Stream class has been added in src/core/lib/ to help parsing binary protocols.

TODO

  • Refactor file type detection to include more powerful search methods. I expect to address Support magic files for filetype detection #369 with this.
  • Refactor 'Scan for embedded files' to run much more efficiently. Pass offset to magic number check rather than reslicing the blob every time.
  • Add 'Move to input' button to displayFilesAsHTML function
  • Update 'Untar' operation to use the Stream lib.
  • ELF file format extractor

@n1474335 n1474335 self-assigned this Dec 14, 2018
@n1474335 n1474335 mentioned this pull request Dec 30, 2018
@n1474335 n1474335 merged commit cd2c807 into master Mar 11, 2019
@n1474335 n1474335 changed the title WIP: 'Extract Files' operation 'Extract Files' operation Mar 14, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant