Tool to help journalists analyze and sanitize metadata #543

runasand · 2014-09-08T18:44:52Z

As @garrettr said in #519 where we decided to remove MAT from SecureDrop: "I think we're going to start building a tool to help journalists analyze and sanitize metadata after 0.3 is released."

diracdeltas · 2014-11-08T19:37:58Z

MAT is in Tails. Should we just add instructions for journalists to use it? (I haven't tried using it)

psivesely · 2016-11-02T18:28:19Z

This is something that would still be good to add instructions for--maybe this (2016 Aaron Swartz Day) hackathon someone will do it.

I'm adding the Reading Room label to this one because I believe this is something that could be automatically done by the reading room client. A submission is downloaded, then in a DispVM it is authenticated, decrypted, decompressed, and wiped of metadata in that order.

psivesely · 2016-11-02T18:40:50Z

See #497. Metadata may be useful for a journalist trying to verify the authenticity of documents. Therefore, automatic stripping of metadata may not be appropriate. Further, it should be noted MAT is not a perfect solution that wipes all metadata. That said, the journalist should use MAT on any appropriate documents if they intend to take them off the airgap to be published.

Closing, because this was implemented at some point (I can't tell when because the migration to .rst from .md).

redshiftzero · 2016-11-02T18:44:38Z

My understanding was that issue was to create a tool (like MAT, but better!) for journalists to anonymize their documents? Something not to be done automatically but to have installed in Tails along with some written documentation on how to use effectively to keep sources safe.

psivesely · 2016-11-08T01:11:07Z

Okay, I misunderstood. Going to reopen. Also, have some more thoughts on the matter.

The best tool I can think of to do this, would be to take the Qubes PDF converter idea, and extend it to all photo and document types. Though it's design intention is to take a possibly malicious document, and produce a trusted one with the same contents, I believe it would also do an excellent job removing metadata. ImageMagick may add certain metadata when it re-constitutes the RGB bitmaps into the respective formats, however, this should be more predictable, less important, and easier to scrub. (E.g., ImageMagick might include the time of re-constitution, which is not nearly as bad as leaking the actual document creation time, but should still be removed.)

I've just started today diving into design of the reading room (RR). Here's the workflow I'm imagining for how a journalists removes documents for publication:

A USB is plugged into the RR machine. There is already a USB DispVM that has been assigned the USB controller via VT-D.
The journalist selects documents they would like to remove from the RR. By default, metadata and malware removal (to the best of our ability), will be performed automatically. However, we will expose some option with an appropriate warning allowing them to move a copy of the raw document off the machine.
The documents will be moved via a qrexec3 protocol from the storage VM to the USB VM, and onto the USB.

I think it's best we stop putting additional burdens on journalists, and adding to our now ~200 pages of documentation. We need to automate as much as possible, and stop relying on journalists as much possible to practice good opsec.

redshiftzero · 2016-11-08T19:26:35Z

Great workflow for exporting documents @fowlslegs. Also it looks like the developer is not currently maintaining MAT and is recommending not to use it:

redshiftzero · 2016-12-06T18:23:44Z

FYI it turns out:

qvm-convert-pdf does convert images (to PDFs) as well using DispVMs, though the "convert to trusted PDF" option does not appear unless you add the .PDF suffix to the file
there is actually already a variant of this for images qvm-convert-img (not installed by default, but I tried it out and it works great) that you can install in Qubes to go directly from e.g. PNG to trusted PNG using the same opening in a DispVM approach

redshiftzero · 2017-05-12T22:52:02Z

Just tried to redact a PDF using MAT on Tails 3 and PDFs are no longer supported files due to this bug found last year (and it looks like it's been disabled for a while). However if someone fixed this bug, they would likely become supported again...

redshiftzero · 2017-09-14T20:23:42Z

We're going to use a Qubes-based strategy to help journalists strip metadata from SecureDrop submissions. Followup: freedomofpress/securedrop-workstation#26

runasand mentioned this issue Sep 8, 2014

Remove MAT #519

Merged

diracdeltas added the hackathon label Nov 8, 2014

redshiftzero mentioned this issue Nov 2, 2016

Improve UX for sources #1437

Closed

psivesely added the Reading Room label Nov 2, 2016

psivesely closed this as completed Nov 2, 2016

psivesely reopened this Nov 8, 2016

psivesely mentioned this issue Nov 8, 2016

Add documentation for journalists on how to strip metadata and redact documents #1449

Closed

redshiftzero added this to the 1.0 milestone Dec 6, 2016

redshiftzero removed the hackathon label Dec 7, 2016

redshiftzero removed this from the 1.0 milestone May 11, 2017

redshiftzero mentioned this issue Sep 14, 2017

Add support for file conversion and metadata removal freedomofpress/securedrop-workstation#26

Open

redshiftzero closed this as completed Sep 14, 2017

DonnchaC mentioned this issue Nov 30, 2017

Install pdf-redact-tools on the SVS to allow document sanitization #2643

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tool to help journalists analyze and sanitize metadata #543

Tool to help journalists analyze and sanitize metadata #543

runasand commented Sep 8, 2014

diracdeltas commented Nov 8, 2014

psivesely commented Nov 2, 2016

psivesely commented Nov 2, 2016

redshiftzero commented Nov 2, 2016

psivesely commented Nov 8, 2016

redshiftzero commented Nov 8, 2016

redshiftzero commented Dec 6, 2016

redshiftzero commented May 12, 2017

redshiftzero commented Sep 14, 2017

Tool to help journalists analyze and sanitize metadata #543

Tool to help journalists analyze and sanitize metadata #543

Comments

runasand commented Sep 8, 2014

diracdeltas commented Nov 8, 2014

psivesely commented Nov 2, 2016

psivesely commented Nov 2, 2016

redshiftzero commented Nov 2, 2016

psivesely commented Nov 8, 2016

redshiftzero commented Nov 8, 2016

redshiftzero commented Dec 6, 2016

redshiftzero commented May 12, 2017

redshiftzero commented Sep 14, 2017