Capturing Structural Locality in Non-parametric Language Models

This repository is a fork of the knnlm repository and the exact commit that this code is based on can be found here. Please use the exact commit page to determine software requirements for using this code. This README will be updated once the code has been merged into Fairseq. This repository is heavily based on fairseq.

Dependencies

Before starting, make sure you install Fairseq (after pulling the code, from the project directory) and FAISS:

pip install --editable .

pip install faiss

Data Preparation

Java: https://zenodo.org/record/3628665 Wikitext: https://blog.einstein.ai/the-wikitext-long-term-dependency-language-modeling-dataset/

Extract locality features: locality_features.

Experiments

For Wikitext-103 experiments, follow wikitext_knn_lm.sh.

For Java experiments, follow bigcode_knn_lm_dynamic.sh.

A Note about Hardware

If your hardware constraints make this too slow, you can run it without using full precision keys by adding two flags: --no-load-keys and --knn-sim-func "do_not_recomp_l2". This uses the quantized versions of keys stored within the FAISS index. You can make things faster by reducing the value of the probe (the number of clusters FAISS checks for neighbors) at the cost of performance. You can also try reducing the number of neighbors k.

Name		Name	Last commit message	Last commit date
Latest commit History 1,167 Commits
.github		.github
docs		docs
examples		examples
fairseq		fairseq
fairseq_cli		fairseq_cli
figures		figures
locality_features		locality_features
scripts		scripts
tests		tests
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
README_fairseq.md		README_fairseq.md
analysis_histogram.py		analysis_histogram.py
analysis_java.py		analysis_java.py
analysis_java_after.py		analysis_java_after.py
analysis_wiki.py		analysis_wiki.py
analysis_wiki_after.py		analysis_wiki_after.py
bigcode_knn_lm_dynamic.sh		bigcode_knn_lm_dynamic.sh
build_dstore.py		build_dstore.py
calc_acc.py		calc_acc.py
cluster_dstore_vecs.py		cluster_dstore_vecs.py
downsample_java.py		downsample_java.py
eval_lm.py		eval_lm.py
extract_dstore_vecs.py		extract_dstore_vecs.py
fairseq.gif		fairseq.gif
fairseq_logo.png		fairseq_logo.png
find_examples_wiki.py		find_examples_wiki.py
generate.py		generate.py
hubconf.py		hubconf.py
interactive.py		interactive.py
plot_java.py		plot_java.py
plot_java_after.py		plot_java_after.py
plot_wiki.py		plot_wiki.py
plot_wiki_after.py		plot_wiki_after.py
preprocess.py		preprocess.py
preprocess_wikitext.py		preprocess_wikitext.py
score.py		score.py
setup.py		setup.py
train.py		train.py
tune_locality_weights_java.py		tune_locality_weights_java.py
tune_locality_weights_java_adaptive.py		tune_locality_weights_java_adaptive.py
tune_locality_weights_wiki.py		tune_locality_weights_wiki.py
tune_locality_weights_wiki_adaptive.py		tune_locality_weights_wiki_adaptive.py
validate.py		validate.py
wikitext_bpe.sh		wikitext_bpe.sh
wikitext_knn_lm.sh		wikitext_knn_lm.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Capturing Structural Locality in Non-parametric Language Models

Dependencies

Data Preparation

Experiments

A Note about Hardware

About

Releases 1

Packages

Languages

License

frankxu2004/knnlm-locality

Folders and files

Latest commit

History

Repository files navigation

Capturing Structural Locality in Non-parametric Language Models

Dependencies

Data Preparation

Experiments

A Note about Hardware

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages