Skip to content

Setup CI to check on 2 GPUs #2466

@vfdev-5

Description

@vfdev-5

As CircleCI removed resource class with 2 GPUs we can not run tests on 2 GPUs with Circle CI:

Resource class "gpu.medium" is not a valid resource class. The default resource class will be used.

Available resource classes are only 1 GPU: https://circleci.com/docs/2.0/configuration-reference/#gpu-executor-linux

There is a feature request on below link (but IMO it is a regression):

I submitted a request:

We can think of using AWS as PyTorch does or setup a custom runner on our own multi-gpu servers.

cc @sdesrozis @trsvchn @fco-dv


  • Set up GHA self-hosted runner and ported unit tests on 2 GPUs and check the training on CIFAR10
  • Port HVD tests on 2 GPUs from config.yml

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions