Skip to content

Curious about Glow implementation: some weights look frozen? #20

@christabella

Description

@christabella

Hi Krzysztof,

When visualizing the distribution of weights and gradients of each tensor over training, I noticed that some the weights don't seem to be updating. E.g. InvertibleConv1x1Layer's U_mat, L_mat, and log_S.
screenshot_2019-12-19_11-40-01

My first thought was that maybe the gradients are too small, but it doesn't look like that's the case:
IMAGE 2019-12-20 14:39:48
image
Weights remain mostly constant:
image
image

But gradients are... pretty explosive 😔
image

I didn't change the core code and used the high-level API, but trained it on a different task and it is plugged into a larger model.

I will try running the original example you provided and report back with that, but in the meantime I was wondering if you (or anyone else) had any early ideas about this. Thanks!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions