Skip to content

Conversation

@egretholier
Copy link

@egretholier egretholier commented Apr 20, 2025

Cons:

Pros:

@lucidrains
Copy link
Owner

@egretholier hey, thanks for adding this! could you throw in a test as well as make sure it asserts out if autoregressive or has any masking (masked batch norm is doable, but probably too complex for what you wish to do)

@lucidrains
Copy link
Owner

@egretholier for the record, i did try https://arxiv.org/abs/2003.07845 back in the day (collaborating with Aran Komat during a time when transformers were relatively unknown) and did not see much of anything.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants