hidden state index at t=1 during backprop

During packpropagation, at t=2, when he calculates the gradient of the h, he writes this:

`h_weight_grad += hiddens[1,:][:,np.newaxis] @ h2_grad`

Taking into account the hidden state at t=1. But later, when he backprops at t=1, he writes:

`h_weight_grad += hiddens[1,:][:,np.newaxis] @ h1_grad`

It's not clear to me why he uses the t=1 hidden state again: shouldn't it be hiddens[0,:] since that gradient is sensitive to the previous hidden state?

Thanks in advance to whomever may help me.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hidden state index at t=1 during backprop #9

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

hidden state index at t=1 during backprop #9

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions