Language Style Transfer

A Tensorflow implementation of NLP Style Transfer from Non-parallel Text with Adversarial Alignment. Repo applies the work unto Shakespearean works. It provides prototypes for seq2seq models with teacher-enforcement as well as plots of the encoding layer.

Getting the Dataset

Shakespearean texts are taken from this repository. All works are aligned by line.

Getting Started

You should have two folders of data, one of style A and one of style B. Each directory should contain a or several files with each sentence separated by a newline.

To parse the data, use dataset.py:

  python dataset.py [-c] $SOURCE_PATH_1 $SOURCE_PATH_2 $PICKLE_PATH

This will create a pickled file containing a dictionary of the two datasets as well as a sklearn one-hot encoder that contains the mappings of index to word/character.

Use this object to train the aligned autoencoder.

  python autoencoder.py $PICKLE_PATH $NUM_EPOCHS

This will create a model in models/ directory. The model will contain trained weights, loss of the two autoencoders, and the adversarial loss. You can examine the plots with tensorboard.

  tensorboard $MODEL_PATH

Citation

For most cases, please cite the original work

@misc{1705.09655,
Author = {Tianxiao Shen and Tao Lei and Regina Barzilay and Tommi Jaakkola},
Title = {Style Transfer from Non-Parallel Text by Cross-Alignment},
Year = {2017},
Eprint = {arXiv:1705.09655},
}

If you use this implementation of the work, use the following citation:

@misc{2017language-style-transfer,
  author = {Jin Park},
  title = {Language Style Transfer},
  year = {2017},
  howpublished = {\url{https://github.com/jpark96/language-style-transfer}},
  note = {commit xxxxxxx}
}

Authors

Jin Park

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
cloud		cloud
img		img
src		src
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Language Style Transfer

Getting the Dataset

Getting Started

Citation

Authors

About

Releases

Packages

Languages

jpark621/language-style-transfer

Folders and files

Latest commit

History

Repository files navigation

Language Style Transfer

Getting the Dataset

Getting Started

Citation

Authors

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages