Add 80% mask, 10% random in n-gram MLM #1

graykode · 2019-10-03T05:35:32Z

In ALBERT(Lan at el), There is not detail about 80% mask

But, from n-gram masking (Joshi et al., 2019), they said about 80/10/10

As in BERT, we also mask 15% of the tokens in total: replacing 80% of the masked tokens with [MASK], 10% with random tokens and 10% with the original tokens. However, we perform this replacement at the span level and not for each token individually; i.e. all the tokens in a span are replaced with [MASK]or sampled tokens

add 80% mask, 10% random in n-gram MLM

20b4b3f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add 80% mask, 10% random in n-gram MLM #1

Add 80% mask, 10% random in n-gram MLM #1

Uh oh!

graykode commented Oct 3, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Add 80% mask, 10% random in n-gram MLM #1

Are you sure you want to change the base?

Add 80% mask, 10% random in n-gram MLM #1

Uh oh!

Conversation

graykode commented Oct 3, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant