Skip to content
This repository was archived by the owner on Nov 22, 2022. It is now read-only.

Always create non-empty logits / targets for MLM #979

Closed
wants to merge 1 commit into from

Conversation

mwu1993
Copy link
Contributor

@mwu1993 mwu1993 commented Sep 13, 2019

Summary:
D17241503 selects only masked tokens for the final logits / targets during MLM. This fails when there are no masked tokens (e.g. at the end of a file there can be a very short batch). In this case, select just the first token in the first batch.

BTW, the new masking strategy is faster (20%) than the old one - f138005051 vs f138005044.

Differential Revision: D17370855

@facebook-github-bot facebook-github-bot added the CLA Signed Do not delete this pull request or issue due to inactivity. label Sep 13, 2019
mwu1993 pushed a commit to mwu1993/pytext-1 that referenced this pull request Sep 16, 2019
Summary:
Pull Request resolved: facebookresearch#979

D17241503 selects only masked tokens for the final logits / targets during MLM. This fails when there are no masked tokens (e.g. at the end of a file there can be a very short batch). In this case, select just the first token in the first batch.

BTW, the new masking strategy is faster (20%) than the old one - f138005051 vs f138005044.

Differential Revision: D17370855

fbshipit-source-id: 0304da89c0ccd99083e761cb8f648e01456ebdd8
Summary:
Pull Request resolved: facebookresearch#979

D17241503 selects only masked tokens for the final logits / targets during MLM. This fails when there are no masked tokens (e.g. at the end of a file there can be a very short batch). In this case, select just the first token in the first batch.

BTW, the new masking strategy is faster (20%) than the old one - f138005051 vs f138005044.

Reviewed By: borguz

Differential Revision: D17370855

fbshipit-source-id: 62f0540fb94819c6a269ea067b1c0b9e08c82119
@facebook-github-bot
Copy link
Contributor

This pull request has been merged in 513abc9.

rutyrinott pushed a commit to rutyrinott/pytext-1 that referenced this pull request Sep 25, 2019
Summary:
Pull Request resolved: facebookresearch#979

D17241503 selects only masked tokens for the final logits / targets during MLM. This fails when there are no masked tokens (e.g. at the end of a file there can be a very short batch). In this case, select just the first token in the first batch.

BTW, the new masking strategy is faster (20%) than the old one - f138005051 vs f138005044.

Differential Revision: D17370855

fbshipit-source-id: 453ccc4797c0d976281aa14212eba43b6f9b5d6f
rutyrinott pushed a commit to rutyrinott/pytext-1 that referenced this pull request Sep 25, 2019
Summary:
Pull Request resolved: facebookresearch#979

D17241503 selects only masked tokens for the final logits / targets during MLM. This fails when there are no masked tokens (e.g. at the end of a file there can be a very short batch). In this case, select just the first token in the first batch.

BTW, the new masking strategy is faster (20%) than the old one - f138005051 vs f138005044.

Differential Revision: D17370855

fbshipit-source-id: 6c8dbd6346e8f8db029047171cb81f147dc70a43
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
CLA Signed Do not delete this pull request or issue due to inactivity. Merged
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants