Using self-attention on text classification

This is a self-attentive model implementation on cntk. This repo applies it on text classification.

Prerequisites

python 3.6
cntk 2.4 for GPU
numpy

Dataset

The toy dataset used is ATIS, which is from cntk tutorial 202 Language Understanding with Recurrent Networks

Download ATIS training and test dataset

Another bigger dataset is AG's News Topic Classification Dataset

Model

This implementation is based on paper A STRUCTURED SELF-ATTENTIVE SENTENCE EMBEDDING

Inspired by tensorflow implementation in this repo

Baseline: embeded + stabilizer + bi-GRU(150 for each direction) + fc + fc
Self-Attentive: embeded + stabilizer + bi-GRU(150 for each direction) + Self-Attentive + fc + fc

Result on toy dataset

Toy dataset train result

Result on AG's

Run

unzip ag_data.zip
unzip toy_data.zip
python selfAtt.py --lr 0.03 --dataset toy --max_epoch 5 --batch_size 60 --self_attention

TODO

Add penalty

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
images		images
LICENSE		LICENSE
README.md		README.md
ag_data.zip		ag_data.zip
selfAtt.py		selfAtt.py
toy_data.zip		toy_data.zip

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Using self-attention on text classification

Prerequisites

Dataset

Model

Result on toy dataset

Result on AG's

Run

TODO

About

Releases

Packages

Languages

License

egg-west/Self-attention-on-text-calssification

Folders and files

Latest commit

History

Repository files navigation

Using self-attention on text classification

Prerequisites

Dataset

Model

Result on toy dataset

Result on AG's

Run

TODO

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages