Satellite Image Time Series Classification with Pixel-Set Encoders and Temporal Self-Attention

PyTorch implementation of the model presented in "Satellite Image Time Series Classification with Pixel-Set Encoders and Temporal Self-Attention"

Paper abstract:

Satellite image time series, bolstered by their growing availability, are at the forefront of an extensive effort towards automated Earth monitoring by international institutions. In particular, large-scale control of agricultural parcels is an issue of major political and economic importance. In this regard, hybrid convolutional-recurrent neural architectures have shown promising results for the automated classification of satellite image time series. We propose an alternative approach in which the convolutional layers are advantageously replaced with encoders operating on unordered sets of pixels to exploit the typically coarse resolution of publicly available satellite images. We also propose to extract temporal features using a bespoke neural architecture based on self-attention instead of recurrent networks. We demonstrate experimentally that our method not only outperforms previous state-of-the-art approaches in terms of precision, but also significantly decreases processing time and memory requirements. Lastly, we release a large open-access annotated dataset as a benchmark for future work on satellite image time series.

Requirements

Pytorch + torchnet
numpy + pandas + sklearn

The code has been tested in the following environment:

Ubuntu 18.04.1 LTS, python 3.6.6, pytorch 1.1.0, CUDA 10.0

Downloads

Datasets

A toy version of the Pixel-set dataset can be directly downloaded here, to get an idea of the dataset structure.
The complete Pixel-set and Pixel-patch datasets are accessible on this ftp server: ftp://ftp3.ign.fr/.

Please send an e-mail to lastig (dot) data (at) gmail (dot) com to request credentials for the ftp.

Pre-trained weights

We also provide the pre-trained weights for inference.

Code

Code structure

The PyTorch implementations of the PSE, TAE and PSE+TAE architectures are located in the models folder.
The folder learning contains some additional utilities that are used for training.
The repository also contains two high-level scripts train.py and inference.py that should make it easier to get started.

Code Usage

Reproduce

Run the train.py script to reproduce the results of the PSE+TAE architecture presented in the paper. You will just need to specify the path to the Pixel-Set dataset (link above) with the --dataset_folder agrument.

Experiment

The default settings of the train.py script are those used to produce the results in the paper. Yet, some options are already implemented to play around with the model's hyperparameters and other training settings. These options are accessible through an argparse menu (see directly inside the script).

Re-use

You can use the pre-trained weights in the inference.py script to produce predictions on our dataset or your own, provided that it is formatted as per the indications below. You will need to pass the path to the unzipped folder containing the weights with the --weight_dir argument. (do not uncompress the model.pth.tar files as the script takes care of this.)
The two components of our model (the PSE and the TAE) are implemented as stand-alone pytorch nn.Modules (in pse.py and tae.py) and can be used for other applications. While the PSE needs to be used in combination with the PixelSetData class, the TAE can be applied to any sequential data (with input tensors of shape batch_size x sequence_length x embedding_size).

Data format

In order to use the PixelSetData dataset classs with other data than those provided in the link above, the data folder should be structured in the following fashion:

Data structure

Samples

Each dataset sample consits in the different observations for a single parcel. The observations are aggregated in a single array of shape TxCxS with T the number of temporal observations, C the number of channels, and S the number of pixels in the parcel (different for each data sample). Each of these arrays should be stored separately in a numpy file: unique_id_of_the_sample.npy

All the individual .npy files are stored in the same sub-directory DATA.

Normalisation values

The normalisation values should be computed beforehand and stored in the form of a tuple of arrays (means, stds) in a pickle file in the main folder. The PixelSetData dataset class can adapt to different normalisation strategies depending on the shape of the arrays:

Channel-wise normalisation for each date → the arrays have have shape (TxC)
Channel-wise normalisation → the arrays have shape (T,)
Global normalisation → In that case each of the two arrays consists in a single value.

Labels

The labels should be stored in the META/labels.json file. This file has a nested dictionary like structure and can contain multiple nomenclatures:

labels.json = {
  "Name_of_nomenclature1": {
    "unique_id_0": label_0,
    ...,
    "unique_id_N": label_N,
    }, 
  "Name_of_nomenclature2": {
    "unique_id_0": label_0,
    ...,
    "unique_id_N": label_N,
    }
}

Dates and pre-computed features

The dates of the observations, if they are going to be used for the positional encoding, should be stored in YYYYMMDD format in the META/dates.json file:

dates.json = {
    1: date_0,
    ...,
    T: date_T,
}

If some pre-computed static parcel features are to be used between the two MLPs of the PSE, they should be stored in another json file META/name_of_features.json:

name_of_features.json = {
    "unique_id_0": features_0,
    ...,
    "unique_id_N": features_N,
}

Folder structure

The dataset folder should thus have the follwoing structure:

Dataset_folder
│ normalisation_values.pkl
└─DATA
│    │ sample0.npy
│    │ . . .
│    │ sampleN.npy
└─META
     │ labels.json
     │ dates.json
     │ geomfeat.json

Credits

The Temporal Attention Encoder is heavily inspired by the works of Vaswani et al. on the Transformer, and this pytorch implementation served as code base for the TAE.py script.
Credits to github.com/clcarwin/ for the pytorch implementation of the focal loss

Reference

In case you use part of the present code, please include a citation to the following paper:

Sainte Fare Garnot, Vivien , Loic Landrieu, Sebastien Giordano, and Nesrine Chehata. "Satellite Image Time Series Classification with Pixel-Set Encoders and Temporal Self-Attention." arXiv preprint arXiv:1911.07757 (2019).

https://arxiv.org/abs/1911.07757
(The link and reference will be updated upon publication)

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
graphics		graphics
learning		learning
models		models
.gitignore		.gitignore
CV4A_datamaker.py		CV4A_datamaker.py
LICENSE		LICENSE
README.md		README.md
dataset.py		dataset.py
downloadS2Data.py		downloadS2Data.py
inference.py		inference.py
requirements.txt		requirements.txt
run_inference.sh		run_inference.sh
run_train.sh		run_train.sh
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Satellite Image Time Series Classification with Pixel-Set Encoders and Temporal Self-Attention

Requirements

Downloads

Datasets

Pre-trained weights

Code

Code structure

Code Usage

Reproduce

Experiment

Re-use

Data format

Data structure

Samples

Normalisation values

Labels

Dates and pre-computed features

Folder structure

Credits

Reference

About

Releases

Packages

Languages

License

cfld/pytorch-psetae

Folders and files

Latest commit

History

Repository files navigation

Satellite Image Time Series Classification with Pixel-Set Encoders and Temporal Self-Attention

Requirements

Downloads

Datasets

Pre-trained weights

Code

Code structure

Code Usage

Reproduce

Experiment

Re-use

Data format

Data structure

Samples

Normalisation values

Labels

Dates and pre-computed features

Folder structure

Credits

Reference

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages