Curious about Glow implementation: some weights look frozen?

Hi Krzysztof,

When visualizing the distribution of weights and gradients of each tensor over training, I noticed that some the weights don't seem to be updating. E.g. `InvertibleConv1x1Layer`'s `U_mat`, `L_mat`, and `log_S`.
![screenshot_2019-12-19_11-40-01](https://user-images.githubusercontent.com/2038751/71255264-196cfa80-2336-11ea-9d2e-0f01235bf95d.png)

My first thought was that maybe the gradients are too small, but it doesn't look like that's the case:
![IMAGE 2019-12-20 14:39:48](https://user-images.githubusercontent.com/2038751/71255447-9304e880-2336-11ea-9cd2-feecb1874b27.jpg)
![image](https://user-images.githubusercontent.com/2038751/71259945-50e1a400-2342-11ea-8e5c-a6bfdfbb6cf1.png)
Weights remain mostly constant:
![image](https://user-images.githubusercontent.com/2038751/71260332-4bd12480-2343-11ea-8ba3-8f9878e14c6c.png)
![image](https://user-images.githubusercontent.com/2038751/71264280-6bb91600-234c-11ea-9071-718b59fa620e.png)

But gradients are... pretty explosive 😔
![image](https://user-images.githubusercontent.com/2038751/71260361-5be90400-2343-11ea-99e8-f241bb47815f.png)


I didn't change the core code and used the high-level API, but trained it on a different task and it is plugged into a larger model.

I will try running the original example you provided and report back with that, but in the meantime I was wondering if you (or anyone else) had any early ideas about this. Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Curious about Glow implementation: some weights look frozen? #20

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Curious about Glow implementation: some weights look frozen? #20

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions