Releases: UAlbanyArchives/mailbagit
v0.7.3
Fixes PST error handling bug.
What's Changed
- Bug fix by @gwiedeman in #253
Full Changelog: v0.7.2...v0.7.3
v0.7.2
- Improved handling of RTF bodies for PST files and not correctly logs error for malformed RTF.
- Improved headless Chrome use for MacOS uses
- Fixed controller-level error handling bug per #205
What's Changed
- Better handles RTF bodies for PST files by @gwiedeman in #251
- Update pdf-chrome derivative by @jtruitt1 in #248
- Fix PST RTF bodies, controller error handling by @gwiedeman in #252
New Contributors
Full Changelog: V0.7.1...v0.7.2
v0.7.1
What's Changed
- updated docs and build by @gwiedeman in #243
- MSG encoding and WARC page requisites fixes by @gwiedeman in #244
- Handles encoding better for MSG source files by just relying on chardet
- Improved injection of attached images in message HTML for HTML, PDF, and WARC derivatives
- Fixes issue with --external-links not including requires CSS and JS in WARC derivatives
Full Changelog: v0.7.0...V0.7.1
v0.7.0
What's Changed
- improvded docker setup and docs by @gwiedeman in #239
- wonder if this gem will fix slow-loading pages? by @gwiedeman in #241
- Address MSG body encoding errors by @gwiedeman in #242
Full Changelog: v0.6.3...v0.7.0
v0.6.3
Full Changelog: v0.6.2...v0.6.3
V0.6.2
What's Changed
- pinned to version of extract_msg by @gwiedeman in #238
Full Changelog: v0.6.1...v0.6.2
v0.6.1
Bugfix release that correctly normalizes attachments with \
in filename and adds an additional check, so mailbagit doesn't fail when this normalization isn't sufficient.
What's Changed
- Normalization bugfix by @gwiedeman in #236
Full Changelog: v0.6.0...v0.6.1
v0.6.0
PST encoding bugfix
Includes a bugfix where PST encoding didn't use the first priority encoding, which could cause encoding errors in PDF, HTML, and WARC derivatives.
Improve PST HTML body extraction
PST files often contain messages that do not have an HTML body that still renders like it does in Outlook. Outlook and other clients instead use the RTF body. Mailbagit, which previously ignored RTF bodies, now extracts HTML from them when an HTML body is not present. This is then used for both PDF and WARC derivatives too. Previously this was only done for MSG sources.
WARC URI improvement
Previously, WARC derivatives made a custom URI for the important WARC-Target-URI header, using http://mailbag
, such as:
http://mailbag/39/body.html
http://mailbag/39/headers.json
http://mailbag/39/attachmentFilename.pdf
This wasn't great as they were likely to create conflicts outside of a mailbag and this didn't denote a real location as the WARC-Target-URI is supposed to have.
A better approach would be to use the Message-ID header, as specified by RFC2392. The reason we didn't originally, was that this was thought to be unreliable, as we had cases where the Message-ID headers were stripped. Yet, just ignoring the field wasn't a great approach, so this change uses Message-ID for WARC-Target-URI when it is present, and only falls back to http://mailbag
if it doesn't get a Message-ID that seems valid.
This approach uses the Message-ID header, but strips the leading and trailing brackets (<>
) that typically wrap it. To make it a valid URI according to RFC3986 it prepends the mailto:
URI scheme.
Thus, the Message-ID header <MN2PR04MB579157FAB038D851277E3908F9129@MN2PR04MB5791.namprd04.prod.outlook.com>
becomes the WARC-Target-URI mailto:MN2PR04MB579157FAB038D851277E3908F9129@MN2PR04MB5791.namprd04.prod.outlook.com
What's Changed
- Improve PST parsing and WARC URIs by @gwiedeman in #235
Full Changelog: v0.5.1...v0.6.0
v0.5.1
What's Changed
- now detects mbox files without extentions from Mac Mail by @gwiedeman in #233
Full Changelog: v0.5.0...v0.5.1
v0.5.0
- Implements
-k
/--keep
to preserve source files by copying them into a mailbag instead of moving them. - Allows
-m
/--mailbag
to create a mailbag in another directory by using a full path.-m my_mailbag
or-m path/to/my_mailbag
will still create a mailbag in the source directory, but-m /path/to/my_mailbag
orC:\Users\Me\my_mailbag
will create the mailbag in those directories.
What's Changed
- Moving options 226 227 by @gwiedeman in #228
- 0.5.0 release by @gwiedeman in #229
- forgot to bump version numbers by @gwiedeman in #230
Full Changelog: v0.4.2...v0.5.0