Transformer from Scratch

This repository contains a from-scratch implementation of several fundamental components of the Transformer architecture in Deep Learning. The goal is to understand and recreate the core mechanisms described in the groundbreaking paper "Attention is All You Need" by Vaswani et al.

Implemented Features

The main components implemented in this project are:

Self-Attention: Computing attention weights between words in a sequence.
Positional Encoding: Adding positional information to preserve word order in sequences.
Encoder: Encoding block comprising attention, feed-forward layers, and residual connections.
Decoder: Decoding block similar to the encoder with an additional attention mechanism over the encoder's output.

Architecture Illustration

The image above illustrates the full Transformer architecture as described in the paper "Attention is All You Need".

References

Vaswani, A., et al. (2017). Attention Is All You Need. Link to the paper

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
Positional Encoding.ipynb		Positional Encoding.ipynb
README.md		README.md
Self Attention in Transformers NN.ipynb		Self Attention in Transformers NN.ipynb
Transformers Pytorch implementation.ipynb		Transformers Pytorch implementation.ipynb
attention_research_1.png		attention_research_1.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Transformer from Scratch

Implemented Features

Architecture Illustration

References

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Transformer from Scratch

Implemented Features

Architecture Illustration

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages