BBH-ja (日本語版BIG-Bench Hard)

BBH-jaは、BIG-Bench Hard (Paper, GitHub) を翻訳したベンチマークデータセットである。大規模言語モデルの日本語によるChain of Thought能力の評価に用いることを想定している。

評価に用いる問題 (bbh-ja/) とfew-shot exampleに用いるためのprompt (cot-prompts/) からなる。 bbh-ja/はsubjectごとにjsonlで保存されており、inputに問題文、targetに答えが保存されている。cot-prompts/はsubjectごとにyamlで保存されており、inputに問題文、targetにCoTを含む答えが保存されている。

翻訳にはPLaMo Translation Modelを利用した。

BBH-ja is a Japanese translation of the BIG-Bench Hard benchmark dataset (Paper, GitHub). It is designed specifically for evaluating Japanese-language Chain of Thought capabilities in large language models.

The dataset consists of two components: evaluation problems (stored in bbh-ja/) and prompts for few-shot examples (stored in cot-prompts/). bbh-ja/ is organized by corpus subject and stored in JSON Lines format, with problem statements contained in input fields and correct answers stored in target fields. cot-prompts/ is also organized by corpus subject but uses YAML format, with problem statements in input fields and answers containing Chain of Thought reasoning in target fields.

We used the PLaMo Translation Model for translation.

Built with PLaMo

Citation

If you use this repository, please cite this repository:

@misc{mikami2025-bbhja
    title={{BBH-ja}},
    year={2025},
    url = {https://github.com/pfnet-research/bbh-ja}
}

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
bbh-ja		bbh-ja
cot-prompts		cot-prompts
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

BBH-ja (日本語版BIG-Bench Hard)

Citation

About

Uh oh!

Releases

Packages

License

pfnet-research/bbh-ja

Folders and files

Latest commit

History

Repository files navigation

BBH-ja (日本語版BIG-Bench Hard)

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages