Skip to content

[BUG] DatasetNotFoundError: Dataset 'asas-ai/AraTrust-categorized' doesn't exist on the Hub or cannot be accessed. #441

@BobTsang1995

Description

@BobTsang1995

It seems that the asas-ai/AraTrust-categorized dataset does not exist on hugging face. can you guys fix it?@alielfilali01

Describe the bug

When trying to run lighteval with custom Arabic evaluation tasks, it fails with a DatasetNotFoundError, indicating that the dataset 'asas-ai/AraTrust-categorized' cannot be found on the Hugging Face Hub.

To Reproduce

  1. Set up the conda environment with Python 3.10
  2. Install required packages
  3. Run the following command:
accelerate launch --multi_gpu --num_processes=8 -m lighteval \
accelerate "pretrained=/mnt/sg_nas/liheng/Marco_checkpoint/Qwen2-7B-mmmlu-latest/checkpoint-1150,dtype=bfloat16,max_length=16384" \
"examples/tasks/OALL_v2_tasks.txt" \
--custom-tasks "community_tasks/arabic_evals.py" \
--output-dir="./evals/"

Full Error Message

[rank3]: DatasetNotFoundError: Dataset 'asas-ai/AraTrust-categorized' doesn't exist on the Hub or cannot be accessed.

Version info

  • Operating System: (Linux)
  • Python Version: Python 3.10

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions