tokenizer 'padding' param is not correct. #669

xgwang · 2025-04-10T15:58:09Z

background:
i was evaluating models (such as Qwen2.5-7B-Instruct) against AIME 2024 dataset, the output seems not good. after digging, the tokenizer pad the input to be max_length so the output is always 1 token.
after param changed to 'longest', the generation works good.

otherwise, the response length is always 1 which is unexpected

HuggingFaceDocBuilderDev · 2025-04-17T10:39:40Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

#669) otherwise, the response length is always 1 which is unexpected Co-authored-by: xgw <[email protected]> Co-authored-by: Nathan Habib <[email protected]>

xgw and others added 3 commits April 7, 2025 10:35

change tokenizer to pad to 'longest' sequence, instead of 'max_length'

1104426

otherwise, the response length is always 1 which is unexpected

Merge remote-tracking branch 'origin/main' into xg-fix-greeduntil

362c249

Merge branch 'main' into xg-fix-greeduntil

ca80c46

NathanHB mentioned this pull request Apr 17, 2025

[BUG] Transformers model padding should be to "longest" #663

Closed

NathanHB merged commit 88e3a3b into huggingface:main Apr 22, 2025
4 checks passed

xgwang deleted the xg-fix-greeduntil branch April 23, 2025 03:22

NathanHB added the bug label May 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

tokenizer 'padding' param is not correct. #669

tokenizer 'padding' param is not correct. #669

Uh oh!

xgwang commented Apr 10, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Apr 17, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

tokenizer 'padding' param is not correct. #669

tokenizer 'padding' param is not correct. #669

Uh oh!

Conversation

xgwang commented Apr 10, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Apr 17, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants