Skip to content

Latest commit

 

History

History
51 lines (31 loc) · 2.43 KB

README.md

File metadata and controls

51 lines (31 loc) · 2.43 KB

Short-Term Meaning Shift: A Distributional Exploration

Overview

This directory contains the code and data used for the experiments prensented in the paper Short-Term Meaning Shift: A Distributional Exploration, included in the Proceedings of NAACL-HLT 2019. The final paper is in ./Paper/, while the slides used for short presentation at NAACL-HLT 2019 are in ./Slides/.

We release the following material:

  • Our implementation of the model introduced by Kim et. al (2014) for creating time dependent word representations, see ./Model/.
  • The dataset annotated by users of the subbredit r/LiverpoolFC that we used for our experiments, see ./Dataset/.

Requirements:

To run the model, Python 3 is required. You can install all the required packages using the following command:

$ pip3 install -r requirements.txt

Running the code:

To create word embeddings without any kind of pre-initialization use the following command:

python3 create_vectors.py -data <data_file> -output <output_directory>

In order to initialize the word embeddings for time bin t with those in t-1 use this command:

python3 create_vectors_preinitialization.py -data_t <data_file_for_time_t> -data_t_minus_1 <data_file_for_time_t-1> -vectors <vectors computed for t-1> -output <output_directory> 

For both files, it is possible to pass as arguments the hyper-parameters of the model.

Dataset:

The dataset is provided in the in two files:

  • ./Dataset/LiverpoolFC_[13|17].txt.zip: the linguistic content produced by the members of r/LiverpoolFC in the two time bins 2011/13 and 2017.
  • ./Dataset/contexts.txt: the examples showed to the redditors. For each word, the contexts of use in the two time bins.
  • ./Dataset/annotated_words.csv: the annotation by the redditors. For each line: id, target word, shift index value, annotation provided by redditors: [change|no_change|not_available(NA)]

References

If you use this code or dataset, please cite the following paper:

@inproceedings{del2019short-term,
	Author = {Del Tredici, Marco and Fern\'andez, Raquel and Boleda, Gemma},
	Booktitle = {Proceedings of NAACL-HLT 2019 (Annual Conference of the North American Chapter of the Association for Computational Linguistics)},
	Title = {{Short-Term Meaning Shift: A Distributional Exploration}},
	Year = {2019}
  }