Skip to content

Implementation of Automatic Speech Recognition inspired by "Listen, Attend and Spell" paper in PyTorch

Notifications You must be signed in to change notification settings

vshmyhlo/listen-attend-and-speell-pytorch

Repository files navigation

Implementation of Automatic Speech Recognition inspired by Listen, Attend and Spell and Attention Is All You Need papers in PyTorch

  • Trained on LibriSpeech

  • Encoder-Decoder architecture with attention

  • Encoders:

    • 2D Conv network over log-mel spectrogram
    • Followed by several GRU layers
    • Or followed by several self-attention layers
  • Decoders:

    • GRU layers with dot-product attention over encoder
    • Self-attention layers with dot-product attention over encoder

Example spectrograms

Example spectrograms

Corresponding attention matrices

Example attention matrices

About

Implementation of Automatic Speech Recognition inspired by "Listen, Attend and Spell" paper in PyTorch

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages