Getting started

Install package using NPM.

npm i twitter-archive-reader

This package internally use JSZip to read ZIP archives, you can load archives in this module the same way you load them in JSZip.

// ESModules
import TwitterArchive from 'twitter-archive-reader';

// CommonJS
const { TwitterArchive } = require('twitter-archive-reader');

Usage

You can create an instance with several types of objects, all of them must reference an archive. Once you've created the instance, you must wait for the ready-ness status of the object with the .ready() promise.

Supported loading methods:

File from browser
Buffer or ArrayBuffer
string (filename) for loading files in Node.js
number[] or Uint8Array, bytes arrays
JSZip instances
Archive instances (see StreamZip.ts/Archive class)

// You can create TwitterArchive with all supported 
// formats by JSZip's loadAsync() method 
// (see https://stuk.github.io/jszip/documentation/api_jszip/load_async.html).
// You can also use filename or node Buffer.

// By a filename
const archive = new TwitterArchive('filename.zip');

// By a file input (File object)
const archive = new TwitterArchive(document.querySelector('input[type="file"]').files[0]);

// Initialization can be long (unzipping, tweets & DMs reading...) 
// So archive supports events, you can listen for initialization steps
archive.addEventListener('zipready', () => {
  // ZIP is unzipped
});
archive.addEventListener('tweetsread', () => {
  // Tweet files has been read
});
// See all available listeners in Events section.

console.log("Reading archive...");
// You must wait for ZIP reading and archive object build
await archive.ready();

// Archive is ready !

Options for TwitterArchive

You can set options when you load the TwitterArchive instance.

Available options are:

new TwitterArchive(
  /** 
   * Archive to load.
   * Can be a string (filename), number[], Uint8Array,
   * JSZip, Archive, ArrayBuffer and File objects.
   */
  file: AcceptedZipSources,
  /** 
   * If the file is a modern/GDPR archive, TwitterArchive
   * will build the "extended" data. 
   */
  build_extended: boolean = true,
  /** 
   * After archive load, define if the dezipped archive 
   * should stay in memory. 
   * Setting this parameter to false could 
   * reduce memory consumption.
   * 
   * In specific circonstances, the archive will 
   * stay loaded even if this parameter is false.
   */
  keep_loaded: boolean = false,
  /**
   * In Twitter GDPR archives v2, tweet and dm images are in ZIP archives inside the ZIP.
   * If `true`, TwitterArchive will extract its content in RAM to allow the usage of images.
   * If `false`, DMs images will be unavailable.
   * If `undefined`, Twitter will extract in RAM in browser mode, and leave the ZIP untouched in Node.js.
   * 
   * If you want to load the DM image ZIP present in the archive when you want, use `.loadCurrentDmImageZip()`. 
   * **Please note that `keep_loaded` should be set to `true` to use this method !**
   */
  load_images_in_zip: boolean? = undefined
)

Events

Archive is quite long to read: You have to unzip, read tweets, read user informations, direct messages, and some other informations... So you might want to display current loading step to the end-user.

The TwitterArchive provides a event system compatible with Node.js and classic browser JS.

You could listen to events with .addEventListener() method or .on**event_name** attributes, like DOM elements.

Events are listed in their order of apparition.

Any of the described events, except error, contain elements in it (in detail attribute).

zipready

Fires when archive is unzipped (its content has not been read yet !).

userinfosready

Fires when basic user informations (archive creation date, user details) has been read.

indexready

Fires when tweet index (months, tweet number) has been read.

tweetsread

Fires when tweet files has been read.

willreaddm

Fires when direct messages files are about to be read. This event does not fire when a classic archive is given.

willreadextended

Fires when misc infos (favorites, moments...) are about to be read. This event does not fire when a classic archive is given.

ready

Fires when the reading process is over.

Linked to .ready() promise (fulfilled).

error

Fires when read fails. Contain, in the detail attribute, the throwed error.

Linked to .ready() promise (rejected).

Continue

Next part is Archive Properties.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Installation-&-Instantiation.md

Installation-&-Instantiation.md

Getting started

Usage

Options for TwitterArchive

Events

zipready

userinfosready

indexready

tweetsread

willreaddm

willreadextended

ready

error

Continue

Files

Installation-&-Instantiation.md

Latest commit

History

Installation-&-Instantiation.md

File metadata and controls

Getting started

Usage

Options for TwitterArchive

Events

zipready

userinfosready

indexready

tweetsread

willreaddm

willreadextended

ready

error

Continue