Skip to content

Conversation

@jfgrimm
Copy link
Member

@jfgrimm jfgrimm commented May 6, 2022

(created using eb --new-pr)

….0-GCCcore-11.3.0.eb, PMIx-4.1.2-GCCcore-11.3.0.eb
@jfgrimm jfgrimm added the update label May 6, 2022
@jfgrimm jfgrimm added this to the next release (4.5.5?) milestone May 6, 2022
@jfgrimm jfgrimm mentioned this pull request May 6, 2022
2 tasks
@SebastianAchilles
Copy link
Member

@boegelbot please test @ generoso

@boegelbot
Copy link
Collaborator

@SebastianAchilles: Request for testing this PR well received on login1

PR test command 'EB_PR=15456 EB_ARGS= /opt/software/slurm/bin/sbatch --job-name test_PR_15456 --ntasks=4 ~/boegelbot/eb_from_pr_upload_generoso.sh' executed!

  • exit code: 0
  • output:
Submitted batch job 8507

Test results coming soon (I hope)...

- notification for comment with ID 1119403366 processed

Message to humans: this is just bookkeeping information for me,
it is of no use to you (unless you think I have a bug, which I don't).

@SebastianAchilles
Copy link
Member

@boegelbot please test @ jsc-zen2

@boegelbot
Copy link
Collaborator

@SebastianAchilles: Request for testing this PR well received on jsczen2l1.int.jsc-zen2.easybuild-test.cluster

PR test command 'EB_PR=15456 EB_ARGS= /opt/software/slurm/bin/sbatch --job-name test_PR_15456 --ntasks=8 ~/boegelbot/eb_from_pr_upload_jsc-zen2.sh' executed!

  • exit code: 0
  • output:
Submitted batch job 1179

Test results coming soon (I hope)...

- notification for comment with ID 1119403995 processed

Message to humans: this is just bookkeeping information for me,
it is of no use to you (unless you think I have a bug, which I don't).

@boegelbot
Copy link
Collaborator

Test report by @boegelbot
SUCCESS
Build succeeded for 3 out of 3 (3 easyconfigs in total)
jsczen2c1.int.jsc-zen2.easybuild-test.cluster - Linux Rocky Linux 8.5, x86_64, AMD EPYC 7742 64-Core Processor (zen2), Python 3.6.8
See https://gist.github.com/ee7b33c1f6cb37afeb65419a7779d6be for a full test report.

@SebastianAchilles
Copy link
Member

Test report by @SebastianAchilles
SUCCESS
Build succeeded for 3 out of 3 (3 easyconfigs in total)
zen2-ubuntu-eb - Linux Ubuntu 22.04, x86_64, AMD EPYC 7452 32-Core Processor (zen2), Python 3.10.4
See https://gist.github.com/1ea6b0407d08c54a2d2cc0d32df0f413 for a full test report.

@boegelbot
Copy link
Collaborator

Test report by @boegelbot
SUCCESS
Build succeeded for 3 out of 3 (3 easyconfigs in total)
cns1 - Linux Rocky Linux 8.5, x86_64, Intel(R) Xeon(R) CPU E5-2667 v3 @ 3.20GHz (haswell), Python 3.6.8
See https://gist.github.com/9f5e64941e818a055042f420ed85b5e8 for a full test report.

@branfosj
Copy link
Member

branfosj commented May 6, 2022

Test report by @branfosj
FAILED
Build succeeded for 1 out of 3 (3 easyconfigs in total)
bear-pg0306u07a.bear.cluster - Linux RHEL 8.5, POWER, 8335-GTX (power9le), 4 x NVIDIA Tesla V100-SXM2-16GB, 470.57.02, Python 3.6.8
See https://gist.github.com/a1fdab96247b6029f20bd7de32e7d6ed for a full test report.

@branfosj
Copy link
Member

branfosj commented May 6, 2022

Test report by @branfosj
SUCCESS
Build succeeded for 3 out of 3 (3 easyconfigs in total)
bear-pg0105u36b.bear.cluster - Linux RHEL 8.5, x86_64, Intel(R) Xeon(R) Platinum 8360Y CPU @ 2.40GHz (icelake), Python 3.6.8
See https://gist.github.com/4bef5210f23e21076e2a1d322f5cd88c for a full test report.

@branfosj
Copy link
Member

branfosj commented May 6, 2022

Test report by @branfosj
SUCCESS
Build succeeded for 3 out of 3 (3 easyconfigs in total)
bear-pg0211u03a.bear.cluster - Linux RHEL 8.5, x86_64, Intel(R) Xeon(R) Gold 6248 CPU @ 2.50GHz (cascadelake), Python 3.6.8
See https://gist.github.com/f112e550551de360070d545cb6c7781c for a full test report.

@branfosj
Copy link
Member

branfosj commented May 6, 2022

Test report by @branfosj
SUCCESS
Build succeeded for 3 out of 3 (3 easyconfigs in total)
bear-pg0211u08b.bear.cluster - Linux Ubuntu 20.04, x86_64, Intel(R) Xeon(R) Gold 6248 CPU @ 2.50GHz (cascadelake), Python 3.8.5
See https://gist.github.com/f3db186ee7caaf89afb971e4d95069ef for a full test report.

@branfosj
Copy link
Member

branfosj commented May 6, 2022

POWER9 failure for libfabric:

In file included from ./prov/opx/include/rdma/opx/fi_opx_reliability.h:44,
                 from ./prov/opx/include/rdma/opx/fi_opx_atomic.h:40,
                 from prov/opx/src/fi_opx_atomic.c:34:
./prov/opx/include/rdma/opx/fi_opx_timer.h:66:2: error: #error "Cycle timer not defined for this platform"
   66 | #error "Cycle timer not defined for this platform"
      |  ^~~~~

So we know about this failure. However, because I expect we'll have issues creating a foss toolchain (see #12968), I am not going to investigate this failure.

@jfgrimm
Copy link
Member Author

jfgrimm commented May 6, 2022

POWER9 failure for libfabric:

In file included from ./prov/opx/include/rdma/opx/fi_opx_reliability.h:44,
                 from ./prov/opx/include/rdma/opx/fi_opx_atomic.h:40,
                 from prov/opx/src/fi_opx_atomic.c:34:
./prov/opx/include/rdma/opx/fi_opx_timer.h:66:2: error: #error "Cycle timer not defined for this platform"
   66 | #error "Cycle timer not defined for this platform"
      |  ^~~~~

So we know about this failure. However, because I expect we'll have issues creating a foss toolchain (see #12968), I am not going to investigate this failure.

there is a patch linked upstream (ofiwg/libfabric#7573)

Copy link
Contributor

@bartoldeman bartoldeman left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We may want to wait for 1.15.1

@branfosj
Copy link
Member

branfosj commented May 6, 2022

Test report by @branfosj
SUCCESS
Build succeeded for 0 out of 0 (3 easyconfigs in total)
bear-pg0211u08b.bear.cluster - Linux Ubuntu 20.04, x86_64, Intel(R) Xeon(R) Gold 6248 CPU @ 2.50GHz (cascadelake), Python 3.8.5
See https://gist.github.com/670cf084427cacff67141e9b653a5202 for a full test report.

@jfgrimm jfgrimm requested a review from bartoldeman May 16, 2022 11:16
Copy link
Contributor

@bartoldeman bartoldeman left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

just one minor thing, lgtm otherwise.

@branfosj
Copy link
Member

POWER9 failure for libfabric:

In file included from ./prov/opx/include/rdma/opx/fi_opx_reliability.h:44,
                 from ./prov/opx/include/rdma/opx/fi_opx_atomic.h:40,
                 from prov/opx/src/fi_opx_atomic.c:34:
./prov/opx/include/rdma/opx/fi_opx_timer.h:66:2: error: #error "Cycle timer not defined for this platform"
   66 | #error "Cycle timer not defined for this platform"
      |  ^~~~~

So we know about this failure. However, because I expect we'll have issues creating a foss toolchain (see #12968), I am not going to investigate this failure.

there is a patch linked upstream (ofiwg/libfabric#7573)

That is not in 1.15.1. See https://github.com/ofiwg/libfabric/blob/v1.15.1/prov/opx/configure.m4

@jfgrimm
Copy link
Member Author

jfgrimm commented May 16, 2022

@branfosj I've added the OPX patch

@SebastianAchilles
Copy link
Member

Test report by @SebastianAchilles
SUCCESS
Build succeeded for 4 out of 4 (4 easyconfigs in total)
zen2-ubuntu-eb - Linux Ubuntu 22.04, x86_64, AMD EPYC 7452 32-Core Processor (zen2), Python 3.10.4
See https://gist.github.com/f811bae3fb712c4f2df0550f5ef8a79c for a full test report.

@branfosj
Copy link
Member

Test report by @branfosj
SUCCESS
Build succeeded for 4 out of 4 (4 easyconfigs in total)
bear-pg0306u23a.bear.cluster - Linux RHEL 8.5, POWER, 8335-GTX (power9le), 4 x NVIDIA Tesla V100-SXM2-16GB, 470.57.02, Python 3.6.8
See https://gist.github.com/16964120191b274b69388371ac3ad23a for a full test report.

@SebastianAchilles SebastianAchilles changed the title {lib}[GCCcore/11.3.0] libevent v2.1.12, libfabric v1.15.0, PMIx v4.1.2 {lib}[GCCcore/11.3.0] libevent v2.1.12, libfabric v1.15.1, PMIx v4.1.2 May 16, 2022
@boegel
Copy link
Member

boegel commented May 27, 2022

@boegelbot please test @ generoso

@boegelbot
Copy link
Collaborator

@boegel: Request for testing this PR well received on login1

PR test command 'EB_PR=15456 EB_ARGS= /opt/software/slurm/bin/sbatch --job-name test_PR_15456 --ntasks=4 ~/boegelbot/eb_from_pr_upload_generoso.sh' executed!

  • exit code: 0
  • output:
Submitted batch job 8578

Test results coming soon (I hope)...

- notification for comment with ID 1139353525 processed

Message to humans: this is just bookkeeping information for me,
it is of no use to you (unless you think I have a bug, which I don't).

@boegelbot
Copy link
Collaborator

Test report by @boegelbot
SUCCESS
Build succeeded for 3 out of 3 (3 easyconfigs in total)
cns1 - Linux Rocky Linux 8.5, x86_64, Intel(R) Xeon(R) CPU E5-2667 v3 @ 3.20GHz (haswell), Python 3.6.8
See https://gist.github.com/617c244ded483a57329271ada0bc78a9 for a full test report.

@boegel
Copy link
Member

boegel commented May 27, 2022

Test report by @boegel
SUCCESS
Build succeeded for 3 out of 3 (3 easyconfigs in total)
fair-mastodon-c6g-2xlarge-0001 - Linux rocky linux 8.5, AArch64, ARM UNKNOWN (graviton2), Python 3.6.8
See https://gist.github.com/03c24b9050c992ce941766c2edebd53e for a full test report.

@boegel
Copy link
Member

boegel commented May 27, 2022

Test report by @boegel
SUCCESS
Build succeeded for 3 out of 3 (3 easyconfigs in total)
node3141.skitty.os - Linux RHEL 8.4, x86_64, Intel(R) Xeon(R) Gold 6140 CPU @ 2.30GHz, Python 3.6.8
See https://gist.github.com/cc2d5f68315c8a6d65d63ae9df3ebd44 for a full test report.

@branfosj
Copy link
Member

Test report by @branfosj
SUCCESS
Build succeeded for 3 out of 3 (3 easyconfigs in total)
bear-pg0105u36b.bear.cluster - Linux RHEL 8.6, x86_64, Intel(R) Xeon(R) Platinum 8360Y CPU @ 2.40GHz (icelake), Python 3.6.8
See https://gist.github.com/29aed9eabd6a68464fd6fc010c61ad43 for a full test report.

@branfosj
Copy link
Member

Test report by @branfosj
SUCCESS
Build succeeded for 3 out of 3 (3 easyconfigs in total)
bear-pg0306u03a.bear.cluster - Linux RHEL 8.5, POWER, 8335-GTX (power9le), 4 x NVIDIA Tesla V100-SXM2-16GB, 470.57.02, Python 3.6.8
See https://gist.github.com/4f9458e8e42601f28588131a33f5dcac for a full test report.

@branfosj
Copy link
Member

Test report by @branfosj
SUCCESS
Build succeeded for 3 out of 3 (3 easyconfigs in total)
bear-pg0211u08b.bear.cluster - Linux Ubuntu 20.04, x86_64, Intel(R) Xeon(R) Gold 6248 CPU @ 2.50GHz (cascadelake), Python 3.8.5
See https://gist.github.com/004e886689a14e326bdee7b61d8df031 for a full test report.

@branfosj
Copy link
Member

Test report by @branfosj
SUCCESS
Build succeeded for 3 out of 3 (3 easyconfigs in total)
bear-pg0211u03a.bear.cluster - Linux RHEL 8.6, x86_64, Intel(R) Xeon(R) Gold 6248 CPU @ 2.50GHz (cascadelake), Python 3.6.8
See https://gist.github.com/6df28bdf088cadf58ebc7f755a59fcb3 for a full test report.

@boegel boegel dismissed bartoldeman’s stale review May 27, 2022 11:15

requested changes made

Copy link
Member

@boegel boegel left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

Copy link
Contributor

@bartoldeman bartoldeman left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@bartoldeman bartoldeman merged commit 436fd61 into easybuilders:develop May 27, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants