Skip to content

Latest commit

 

History

History
18 lines (16 loc) · 2.64 KB

README.md

File metadata and controls

18 lines (16 loc) · 2.64 KB

Building Transformer Models with Attention

Implementation from Scratch in TensorFlow Keras

Following this book to teach myself about the transformer architecture in depth.

Some excellent resources I've come across along the way: