Fix: ignore tokens should be set to -100, not -1 by maxlchen · Pull Request #19 · alexa/dialoglue

maxlchen · 2022-05-25T22:17:04Z

Issue #, if available:

Description of changes:
Correcting the value of unmasked indices in the target tensor in mask_tokens().
Currently, line 348 is: labels[~masked_indices] = -1
But, the value of -1 breaks the loss function. From BertForMaskedLM documentation: "Indices should be in [-100, 0, ..., config.vocab_size] ... Tokens with indices set to -100 are ignored (masked), the loss is only computed for the tokens with labels in [0, ..., config.vocab_size]"
This value constraint appears to hold in every version of the HuggingFace API which has documentation for BertForMaskedLM.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

Loss cannot be computed if labels[~masked_indices] = -1. From BertForMaskedLM documentation: "Indices should be in [-100, 0, ..., config.vocab_size] ... Tokens with indices set to -100 are ignored (masked), the loss is only computed for the tokens with labels in [0, ..., config.vocab_size]"

Fix: ignore tokens in target tensor should be set to -100, not -1

maxlchen added 2 commits May 25, 2022 15:12

Merge pull request #1 from maxlchen/unmasked-index-values

7c06fac

Fix: ignore tokens in target tensor should be set to -100, not -1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix: ignore tokens should be set to -100, not -1#19

Fix: ignore tokens should be set to -100, not -1#19
maxlchen wants to merge 2 commits into
alexa:masterfrom
maxlchen:master

maxlchen commented May 25, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

maxlchen commented May 25, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant