Pointer Generator Model

This repository contains modified code for the ACL 2017 paper Get To The Point: Summarization with Pointer-Generator Networks. For an intuitive overview of the original paper, read the blog post. For a beautiful documentation of how to run the original model, please refer to the code for the paper. The Python 3 version of the code that inspired this repo can be found here.

What's in it for me?

The goal of this repo is to serve as a tutorial for people just starting out with deep learning. It is certainly not exhaustive, but it's how I learned some TensorFlow. This tutorial uses one particular encoder-decoder network whose primary use is for summarizing text documents. The idea, however, is for the writing to be general enough and flexible so that the key points are applicable to any paper or code.

You'll certainly get the most out of this notebook if you have some prior coding experience and know a bit of deep learning theory. It goes step by step about what papers I read and how I ended up understanding this model's ins and outs. Not only that, but also be able to tweak the model to explore your own ideas. If you have feedback or other ways you'd like to alter the model but don't know how to, please add your idea to the Issues handle and I'll address it at some point!

In order to understand what you're getting into, I recommend you read a few papers. The first one, and most obvious one, is the one directly coming from the code's author, which comes with a beautifully written blog post, which you can find here. Another point to understand is a little bit about TensorFlow, but truthfully I knew very little as well when I dove into this paper first, so it's not necessary. Hopefully doing these modifications helps on this front.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
__pycache__		__pycache__
.bar.swp		.bar.swp
LICENSE.txt		LICENSE.txt
README.md		README.md
__init__.py		__init__.py
attention_decoder.py		attention_decoder.py
batcher.py		batcher.py
beam_search.py		beam_search.py
data.py		data.py
decode.py		decode.py
inspect_checkpoint.py		inspect_checkpoint.py
model.py		model.py
run_summarization.py		run_summarization.py
util.py		util.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pointer Generator Model

What's in it for me?

Table of Contents

0. Before You Start

1. Getting the Code to Run

2. Exploring Visualization Tools

3. Modifying our own Dataset

4. Using Pre-Trained Word Embeddings

5. Shuffling Sentences and Removing Words

6. Teacher Forcing

7. Adding Mixture Coefficient to the Tranining

8. Adding Another Attention Layer given an Input

9. Using Target Embedding as a Loss Function

10. Reinforcement Learning Loss Function

0. Before You Start

2. Getting the Code to Run

About

Releases

Packages

Languages

License

landmann/pointer_gen

Folders and files

Latest commit

History

Repository files navigation

Pointer Generator Model

What's in it for me?

Table of Contents

0. Before You Start

1. Getting the Code to Run

2. Exploring Visualization Tools

3. Modifying our own Dataset

4. Using Pre-Trained Word Embeddings

5. Shuffling Sentences and Removing Words

6. Teacher Forcing

7. Adding Mixture Coefficient to the Tranining

8. Adding Another Attention Layer given an Input

9. Using Target Embedding as a Loss Function

10. Reinforcement Learning Loss Function

0. Before You Start

2. Getting the Code to Run

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages