language_modelling

Generating text with one author style with LSTM, Transformer and pre-trained GPT-2

In this repository there are 4 models which can be used to generate text. Results are not great because inusfficient computational resources (even with GPU on colab notebook).

1. LSTM
A LSTM that take as input a sequence of word encoded with Glove Word2Vec Model and output a vector on the Word2Vec space. Trained on The Lord of The Rings. Bad results

2. LSTM with embeddings
A LSTM that take as input a sequence of word encoded with an index, before the LSTM layers the input goes through an embedding layer with weights taken from Glove model. The output can be seen as a probability distribution of the next index (word). Trained on The Lord of The Rings. Bad/Mediocre results.

3. Transformer
A Transformed model trained on LOTR with positional encodings. Discrete results.

4. GPT-2
An attempt to do tranfer learning on the pre trained GPT2 model from huggingface. The transfer learning procedure can be done on LOTR, Trump tweets, Salvini Tweets. Good results.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
gpt2		gpt2
lstm		lstm
lstm_with_embeddings		lstm_with_embeddings
transformer		transformer
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

language_modelling

About

Releases

Packages

Languages

f-grimaldi/language_modelling

Folders and files

Latest commit

History

Repository files navigation

language_modelling

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages