exclusive self attention

This repository compares a baseline causal self-attention model (vanilla) against an exclusive self attention variant.

Paper

The paper is included here:

2603.09078v1.pdf

Implementation Summary

The implementation is in train.py.

What changed for exclusive self attention

Inside the attention head output computation, we remove the component of the output vector along the value vector direction:

Compute dot product: dot_product = sum(out * v)
Compute squared norm: v_norm_sq = sum(v * v)
Compute projected component: component = (dot_product / (v_norm_sq + 1e-8)) * v
Subtract projection: out = out - component

This behavior is toggled with use_exclusive_self_attention=True.

Training Comparison

The script runs both configurations:

vanilla
exclussive self attention

It saves per-run CSV logs and comparison plots to outputs_compare.

Train/Val Loss Logs (Images)

Train loss comparison

Validation loss comparison

How to run

python train.py

After training completes, plots and CSV logs are available in outputs_compare/.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
outputs_compare		outputs_compare
paper		paper
README.md		README.md
input.txt		input.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

exclusive self attention

Paper

Implementation Summary

What changed for exclusive self attention

Training Comparison

Train/Val Loss Logs (Images)

Train loss comparison

Validation loss comparison

How to run

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

exclusive self attention

Paper

Implementation Summary

What changed for exclusive self attention

Training Comparison

Train/Val Loss Logs (Images)

Train loss comparison

Validation loss comparison

How to run

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages