TemporalSAE: don't apply decoding bias if weights are tied and bias wasn't applied at encoding by danra · Pull Request #703 · decoderesearch/SAELens

danra · 2026-06-12T00:07:25Z

Description

TemporalSAE decoding previously added the decoder bias unconditionally at decoding time. It should only be done in case the encoder/decoder weights are untied, or, if tied, in case the bias was also subtracted at encoding time.

Type of change

Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)

Checklist:

My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
New and existing tests pass locally with my changes
I have not rewritten tests relating to key interfaces which would affect backward compatibility

You have tested formatting, typing and tests

I have run make check-ci to check format and linting. (you can run make format to format code if needed.)

…ias wasn't applied at encoding

Copilot

Pull request overview

Note

Copilot was unable to run its full agentic suite in this review.

Expands TemporalSAE’s bias-handling logic to support a new apply_b_dec_to_input mode (especially relevant when weights are tied) and updates tests to cover the new configuration combinations.

Changes:

Conditioned adding b_dec in TemporalSAE.decode() and TemporalSAE.forward() based on tied_weights + apply_b_dec_to_input.
Broadened TemporalSAE unit tests to parametrize over apply_b_dec_to_input (and tied_weights for decode).

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 4 comments.

File	Description
tests/saes/test_temporal_sae.py	Expands test parametrization to cover `apply_b_dec_to_input` (and `tied_weights` for decode).
sae_lens/saes/temporal_sae.py	Makes decoder bias application conditional based on config flags.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

+        if not self.cfg.tied_weights or self.cfg.apply_b_dec_to_input:
+            sae_out = sae_out + self.b_dec


+        if not self.cfg.tied_weights or self.cfg.apply_b_dec_to_input:
+            sae_out = sae_out + self.b_dec


+        x_recons = torch.matmul(z_novel + z_pred, self.W_dec)
+        if not self.cfg.tied_weights or self.cfg.apply_b_dec_to_input:
+            x_recons = x_recons + self.b_dec


        # Decode novel codes
        sae_out = torch.matmul(feature_acts, self.W_dec)
-        sae_out = sae_out + self.b_dec
+        if not self.cfg.tied_weights or self.cfg.apply_b_dec_to_input:


danra added 2 commits June 11, 2026 16:39

test: expand parameterization of a few TemporalSAE tests

a7affb6

fix: TemporalSAE: don't apply decoding bias if weights are tied and b…

8bd2855

…ias wasn't applied at encoding

Copilot AI review requested due to automatic review settings June 12, 2026 00:07

Copilot AI reviewed Jun 12, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TemporalSAE: don't apply decoding bias if weights are tied and bias wasn't applied at encoding#703

TemporalSAE: don't apply decoding bias if weights are tied and bias wasn't applied at encoding#703
danra wants to merge 2 commits into
decoderesearch:mainfrom
danra:temporal_fix_b_dec

danra commented Jun 12, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		if not self.cfg.tied_weights or self.cfg.apply_b_dec_to_input:
		sae_out = sae_out + self.b_dec

Conversation

danra commented Jun 12, 2026

Description

Type of change

Checklist:

You have tested formatting, typing and tests

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants