Distillation exploration

This is a little extension of the work done in Distilling Task-Specific Knowledge from BERT into Simple Neural Networks by Tang et al. 2019. Hopefully this notebook will serve as an easy-to-follow guide to distillation, which is actually really simple. This is based on work I did for Polecat.

Tang uses BERT to train a BiLSTM. One of the suggestions for future work is to explore to what extent even simpler models can benefit from the technique. This notebook does just that - we'll try and use BERT to train a simple linear model and CNN implemented in PyTorch.

The notebook looks a bit better in NBviewer.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Distilling_BERT_to_a_CNN.ipynb		Distilling_BERT_to_a_CNN.ipynb
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Distillation exploration

About

Releases

Packages

Languages

License

cbowdon/distillation-exploration

Folders and files

Latest commit

History

Repository files navigation

Distillation exploration

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages