Skip to content

Conversation

@xgwang
Copy link
Contributor

@xgwang xgwang commented Apr 10, 2025

background:
i was evaluating models (such as Qwen2.5-7B-Instruct) against AIME 2024 dataset, the output seems not good. after digging, the tokenizer pad the input to be max_length so the output is always 1 token.
after param changed to 'longest', the generation works good.

@HuggingFaceDocBuilderDev
Copy link
Collaborator

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

@NathanHB NathanHB merged commit 88e3a3b into huggingface:main Apr 22, 2025
4 checks passed
@xgwang xgwang deleted the xg-fix-greeduntil branch April 23, 2025 03:22
@NathanHB NathanHB added the bug label May 5, 2025
hynky1999 pushed a commit that referenced this pull request May 22, 2025
#669)

otherwise, the response length is always 1 which is unexpected

Co-authored-by: xgw <[email protected]>
Co-authored-by: Nathan Habib <[email protected]>
NathanHB added a commit that referenced this pull request Sep 19, 2025
#669)

otherwise, the response length is always 1 which is unexpected

Co-authored-by: xgw <[email protected]>
Co-authored-by: Nathan Habib <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants