Popular repositories Loading
-
-
TG-Shakespeare-GPT
TG-Shakespeare-GPT PublicThis is an implementation of a transformer-based language model, trained on the Tiny Shakespeare Dataset
-
assignment1-basics
assignment1-basics PublicForked from stanford-cs336/assignment1-basics
Student version of Assignment 1 for Stanford CS336 - Language Modeling From Scratch
Python
-
-
-
Transformer_Memory_Optimization
Transformer_Memory_Optimization PublicThis is a repo that aims to teach-by-doing the following memory optimization techniques: flash attention, MQA, GQA, Activation Checkpointing
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.
