Skip to content

Conversation

@Thyre
Copy link
Collaborator

@Thyre Thyre commented Jul 2, 2025

(created using eb --new-pr)


One of the first EasyConfigs for the ROCm 6.4.1 stack. Doesn't require ROCm itself, therefore straight-forward.

Tests are not built as they require both a GPU and access to change hardware features, which is very unlikely to be available when built via EasyBuild. Many of these tests will be skipped with upcoming amd-smi versions due to requiring root access.


Sanity check cannot run amd-smi, as it fails on systems without the correct kernel modules loaded.
Unfortunately, this is true regardless of the command being passed to amd-smi...

…tches: amdsmi-25.4.2_handle-non-standard-rocm-paths.patch
@github-actions github-actions bot added the new label Jul 2, 2025
@Thyre
Copy link
Collaborator Author

Thyre commented Jul 2, 2025

Arch Linux w/ sytem ROCm

Test report by @Thyre
SUCCESS
Build succeeded for 1 out of 1 (1 easyconfigs in total)
Linux - Linux Arch Linux UNKNOWN, x86_64, AMD Ryzen 7 7800X3D 8-Core Processor, 1 x AMD Navi 48 [RX 9070/9070 XT] (device id: 0x7550, gfx: gfx1201, driver: 6.15.4-arch2-1), 1 x AMD Raphael (device id: 0x164e, gfx: gfx1036, driver: 6.15.4-arch2-1), Python 3.13.5
See https://gist.github.com/Thyre/90d032a7b8fdcbdf959fa983b88d2710 for a full test report.

@Thyre
Copy link
Collaborator Author

Thyre commented Jul 2, 2025

Rocky 9.5 container without ROCm

Test report by @Thyre
SUCCESS
Build succeeded for 1 out of 1 (1 easyconfigs in total)
jrc0850.jureca - Linux Rocky Linux 9.5 (Blue Onyx), x86_64, AMD EPYC 7443 24-Core Processor (zen3), Python 3.9.21
See https://gist.github.com/Thyre/26307eaaf4c3e1e3cd2262d146b11be6 for a full test report.

@Thyre
Copy link
Collaborator Author

Thyre commented Jul 2, 2025

Rocky 9.5 w/ system ROCm

Test report by @Thyre
SUCCESS
Build succeeded for 1 out of 1 (1 easyconfigs in total)
jrc0850.jureca - Linux Rocky Linux 9.5 (Blue Onyx), x86_64, AMD EPYC 7443 24-Core Processor, 8 x AMD AMD Instinct MI250X / MI250 (model: 0x740c, driver: "6.12.12"), Python 3.9.21
See https://gist.github.com/Thyre/534963db657c5e5b668a8396181ef272 for a full test report.

@Thyre
Copy link
Collaborator Author

Thyre commented Jul 2, 2025

@boegelbot please test @ jsc-zen3
EB_ARGS="--installpath /tmp/$USER/pr23279"

amdsmi is more or less just a Python wrapper in libexec.

Signed-off-by: Jan André Reuter <[email protected]>
@boegelbot
Copy link
Collaborator

@Thyre: Request for testing this PR well received on jsczen3l1.int.jsc-zen3.fz-juelich.de

PR test command 'if [[ develop != 'develop' ]]; then EB_BRANCH=develop ./easybuild_develop.sh 2> /dev/null 1>&2; EB_PREFIX=/home/boegelbot/easybuild/develop source init_env_easybuild_develop.sh; fi; EB_PR=23279 EB_ARGS="--installpath /tmp/$USER/pr23279" EB_CONTAINER= EB_REPO=easybuild-easyconfigs EB_BRANCH=develop /opt/software/slurm/bin/sbatch --job-name test_PR_23279 --ntasks=8 ~/boegelbot/eb_from_pr_upload_jsc-zen3.sh' executed!

  • exit code: 0
  • output:
Submitted batch job 7102

Test results coming soon (I hope)...

- notification for comment with ID 3029338724 processed

Message to humans: this is just bookkeeping information for me,
it is of no use to you (unless you think I have a bug, which I don't).

@boegelbot
Copy link
Collaborator

Test report by @boegelbot
FAILED
Build succeeded for 1 out of 2 (1 easyconfigs in total)
jsczen3c1.int.jsc-zen3.fz-juelich.de - Linux Rocky Linux 9.5, x86_64, AMD EPYC-Milan Processor (zen3), Python 3.9.21
See https://gist.github.com/boegelbot/d54350628caf2ab2872a7b0cc03c19c0 for a full test report.

@Thyre
Copy link
Collaborator Author

Thyre commented Jul 2, 2025

Test report by boegelbot FAILED Build succeeded for 1 out of 2 (1 easyconfigs in total) jsczen3c1.int.jsc-zen3.fz-juelich.de - Linux Rocky Linux 9.5, x86_64, AMD EPYC-Milan Processor (zen3), Python 3.9.21 See https://gist.github.com/boegelbot/d54350628caf2ab2872a7b0cc03c19c0 for a full test report.

Need to check if amd-smi --help also expects to have an AMD GPU present...

Given that the output is the same, probably yes.

@Thyre Thyre marked this pull request as draft July 2, 2025 21:27
amd-smi always requires having specific kernel drivers loaded,
which won't be the case on all systems building amd-smi.
Therefore, just disable calling the tool.

Keep the comment though, in case someone wants to test amd-smi.

Signed-off-by: Jan André Reuter <[email protected]>
@Thyre
Copy link
Collaborator Author

Thyre commented Jul 3, 2025

@boegelbot please test @ jsc-zen3
EB_ARGS="--installpath /tmp/$USER/pr23279"

@boegelbot
Copy link
Collaborator

@Thyre: Request for testing this PR well received on jsczen3l1.int.jsc-zen3.fz-juelich.de

PR test command 'if [[ develop != 'develop' ]]; then EB_BRANCH=develop ./easybuild_develop.sh 2> /dev/null 1>&2; EB_PREFIX=/home/boegelbot/easybuild/develop source init_env_easybuild_develop.sh; fi; EB_PR=23279 EB_ARGS="--installpath /tmp/$USER/pr23279" EB_CONTAINER= EB_REPO=easybuild-easyconfigs EB_BRANCH=develop /opt/software/slurm/bin/sbatch --job-name test_PR_23279 --ntasks=8 ~/boegelbot/eb_from_pr_upload_jsc-zen3.sh' executed!

  • exit code: 0
  • output:
Submitted batch job 7105

Test results coming soon (I hope)...

- notification for comment with ID 3030847736 processed

Message to humans: this is just bookkeeping information for me,
it is of no use to you (unless you think I have a bug, which I don't).

@boegelbot
Copy link
Collaborator

Test report by @boegelbot
SUCCESS
Build succeeded for 2 out of 2 (1 easyconfigs in total)
jsczen3c1.int.jsc-zen3.fz-juelich.de - Linux Rocky Linux 9.5, x86_64, AMD EPYC-Milan Processor (zen3), Python 3.9.21
See https://gist.github.com/boegelbot/e134ff77e833f3b7223b5491ce3d2bd3 for a full test report.

@Thyre Thyre marked this pull request as ready for review July 3, 2025 07:24
@Thyre
Copy link
Collaborator Author

Thyre commented Jul 3, 2025

Test report by @Thyre
SUCCESS
Build succeeded for 1 out of 1 (1 easyconfigs in total)
ZAM054 - Linux Zorin OS 17, x86_64, 12th Gen Intel(R) Core(TM) i7-1260P, 1 x NVIDIA NVIDIA GeForce MX550, 570.133.07, Python 3.10.12
See https://gist.github.com/Thyre/f6a0d8097f5aa666ec86267438230f58 for a full test report.

@bedroge
Copy link
Contributor

bedroge commented Jul 4, 2025

Test report by @bedroge
SUCCESS
Build succeeded for 1 out of 1 (1 easyconfigs in total)
il-c04 - Linux Rocky Linux 9.5 (Blue Onyx), x86_64, AMD EPYC 7542 32-Core Processor, 1 x AMD Arcturus GL-XL [Instinct MI100] (model: 0x738c, driver: "6.8.5"), 1 x AMD Instinct MI210 (model: 0x740f, driver: "6.8.5"), Python 3.9.21
See https://gist.github.com/bedroge/bed00aa9d9ef83094f5a3225d925c2f8 for a full test report.

@bedroge
Copy link
Contributor

bedroge commented Jul 4, 2025

Test report by @bedroge
SUCCESS
Build succeeded for 1 out of 1 (1 easyconfigs in total)
bob-Latitude-5300 - Linux Ubuntu 24.04.2 LTS (Noble Numbat), x86_64, Intel(R) Core(TM) i7-8665U CPU @ 1.90GHz, Python 3.12.3
See https://gist.github.com/bedroge/7c450078f5d88b0ba81a068ceae6a3fd for a full test report.

Calling amd-smi itself may fail without a driver, but we can
still grep for the version number.

Signed-off-by: Jan André Reuter <[email protected]>
@Thyre
Copy link
Collaborator Author

Thyre commented Jul 4, 2025

Test report by @Thyre
SUCCESS
Build succeeded for 1 out of 1 (1 easyconfigs in total)
jrlogin01.jureca - Linux Rocky Linux 9.5, x86_64, AMD EPYC 7742 64-Core Processor, 2 x NVIDIA Quadro RTX 8000, 570.133.20, Python 3.9.21
See https://gist.github.com/Thyre/17d0b4787b3df9a2b3fa6c920b539c5f for a full test report.

@Thyre
Copy link
Collaborator Author

Thyre commented Jul 4, 2025

@boegelbot please test @ jsc-zen3
EB_ARGS="--installpath /tmp/$USER/pr23279"

@bedroge
Copy link
Contributor

bedroge commented Jul 4, 2025

Test report by @bedroge
SUCCESS
Build succeeded for 1 out of 1 (1 easyconfigs in total)
il-c04 - Linux Rocky Linux 9.5 (Blue Onyx), x86_64, AMD EPYC 7542 32-Core Processor, 1 x AMD Arcturus GL-XL [Instinct MI100] (model: 0x738c, driver: "6.8.5"), 1 x AMD Instinct MI210 (model: 0x740f, driver: "6.8.5"), Python 3.9.21
See https://gist.github.com/bedroge/291783f45ccbe765ec1a0f599b164248 for a full test report.

@bedroge
Copy link
Contributor

bedroge commented Jul 4, 2025

Test report by @bedroge
SUCCESS
Build succeeded for 1 out of 1 (1 easyconfigs in total)
bob-Latitude-5300 - Linux Ubuntu 24.04.2 LTS (Noble Numbat), x86_64, Intel(R) Core(TM) i7-8665U CPU @ 1.90GHz, Python 3.12.3
See https://gist.github.com/bedroge/93ac4438830f4de2629bed660c9688ab for a full test report.

@boegelbot
Copy link
Collaborator

@Thyre: Request for testing this PR well received on jsczen3l1.int.jsc-zen3.fz-juelich.de

PR test command 'if [[ develop != 'develop' ]]; then EB_BRANCH=develop ./easybuild_develop.sh 2> /dev/null 1>&2; EB_PREFIX=/home/boegelbot/easybuild/develop source init_env_easybuild_develop.sh; fi; EB_PR=23279 EB_ARGS="--installpath /tmp/$USER/pr23279" EB_CONTAINER= EB_REPO=easybuild-easyconfigs EB_BRANCH=develop /opt/software/slurm/bin/sbatch --job-name test_PR_23279 --ntasks=8 ~/boegelbot/eb_from_pr_upload_jsc-zen3.sh' executed!

  • exit code: 0
  • output:
Submitted batch job 7133

Test results coming soon (I hope)...

- notification for comment with ID 3034898945 processed

Message to humans: this is just bookkeeping information for me,
it is of no use to you (unless you think I have a bug, which I don't).

@boegelbot
Copy link
Collaborator

Test report by @boegelbot
SUCCESS
Build succeeded for 2 out of 2 (1 easyconfigs in total)
jsczen3c1.int.jsc-zen3.fz-juelich.de - Linux Rocky Linux 9.5, x86_64, AMD EPYC-Milan Processor (zen3), Python 3.9.21
See https://gist.github.com/boegelbot/ef0af6850b1c111b479970ab067c0691 for a full test report.

@Thyre
Copy link
Collaborator Author

Thyre commented Jul 4, 2025

Test report by @Thyre
SUCCESS
Build succeeded for 1 out of 1 (1 easyconfigs in total)
Linux - Linux Arch Linux UNKNOWN, x86_64, AMD Ryzen 7 7800X3D 8-Core Processor, 1 x AMD Navi 48 [RX 9070/9070 XT] (device id: 0x7550, gfx: gfx1201, driver: 6.15.4-arch2-1), 1 x AMD Raphael (device id: 0x164e, gfx: gfx1036, driver: 6.15.4-arch2-1), Python 3.13.5
See https://gist.github.com/Thyre/86003a55d0320c511b50263dcc4e52b5 for a full test report.

@bedroge
Copy link
Contributor

bedroge commented Jul 4, 2025

Test report by @bedroge
SUCCESS
Build succeeded for 1 out of 1 (1 easyconfigs in total)
il-c04 - Linux Rocky Linux 9.5 (Blue Onyx), x86_64, AMD EPYC 7542 32-Core Processor, 1 x AMD Arcturus GL-XL [Instinct MI100] (model: 0x738c, driver: "6.8.5"), 1 x AMD Instinct MI210 (model: 0x740f, driver: "6.8.5"), Python 3.9.21
See https://gist.github.com/bedroge/87a9382556e8ae92a6431c456635bd04 for a full test report.

@bedroge
Copy link
Contributor

bedroge commented Jul 4, 2025

Test report by @bedroge
SUCCESS
Build succeeded for 1 out of 1 (1 easyconfigs in total)
bob-Latitude-5300 - Linux Ubuntu 24.04.2 LTS (Noble Numbat), x86_64, Intel(R) Core(TM) i7-8665U CPU @ 1.90GHz, Python 3.12.3
See https://gist.github.com/bedroge/a862d144acfd3f1452d8348937f0c38b for a full test report.

@bedroge
Copy link
Contributor

bedroge commented Jul 4, 2025

@boegelbot please test @ jsc-zen3

Copy link
Contributor

@bedroge bedroge left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good to me

@boegelbot
Copy link
Collaborator

@bedroge: Request for testing this PR well received on jsczen3l1.int.jsc-zen3.fz-juelich.de

PR test command 'if [[ develop != 'develop' ]]; then EB_BRANCH=develop ./easybuild_develop.sh 2> /dev/null 1>&2; EB_PREFIX=/home/boegelbot/easybuild/develop source init_env_easybuild_develop.sh; fi; EB_PR=23279 EB_ARGS= EB_CONTAINER= EB_REPO=easybuild-easyconfigs EB_BRANCH=develop /opt/software/slurm/bin/sbatch --job-name test_PR_23279 --ntasks=8 ~/boegelbot/eb_from_pr_upload_jsc-zen3.sh' executed!

  • exit code: 0
  • output:
Submitted batch job 7138

Test results coming soon (I hope)...

- notification for comment with ID 3035141665 processed

Message to humans: this is just bookkeeping information for me,
it is of no use to you (unless you think I have a bug, which I don't).

@boegelbot
Copy link
Collaborator

Test report by @boegelbot
SUCCESS
Build succeeded for 2 out of 2 (1 easyconfigs in total)
jsczen3c1.int.jsc-zen3.fz-juelich.de - Linux Rocky Linux 9.5, x86_64, AMD EPYC-Milan Processor (zen3), Python 3.9.21
See https://gist.github.com/boegelbot/36759fb6ed5e074d016a6067944a8fee for a full test report.

@bedroge bedroge added this to the 5.1.1 milestone Jul 4, 2025
@bedroge
Copy link
Contributor

bedroge commented Jul 4, 2025

Going in, thanks @Thyre!

@bedroge bedroge merged commit 4aba96d into easybuilders:develop Jul 4, 2025
8 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants