Skip to content

Conversation

@deniskristak
Copy link
Contributor

@deniskristak deniskristak commented Aug 5, 2021

@deniskristak deniskristak changed the title adding easyconfigs: jax-0.2.18-foss-2020b.eb WIP: adding easyconfigs: jax-0.2.18-foss-2020b.eb Aug 5, 2021
@boegel boegel added the new label Aug 5, 2021
@boegel boegel added this to the 4.x milestone Aug 5, 2021
@Flamefire
Copy link
Contributor

Need another 2 changes in build.py:

Copy link
Member

@boegel boegel left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@deniskristak The name of the patch files as they are included in the PR are incorrect.

I fixed that in deniskristak#3, along with a couple of other things, like including checksums.

boegel added 5 commits August 13, 2021 17:17
…lding jaxlib in parallel, use pip to install all Python packages, enable sanity_pip_check, fix and rename patch file for TensorFlow sources
…ak/easybuild-easyconfigs into 20210805110301_new_pr_jax0218
@boegel boegel changed the title WIP: adding easyconfigs: jax-0.2.18-foss-2020b.eb {tools}[foss/2020b] jax v0.2.18 Aug 13, 2021
@boegel boegel changed the title {tools}[foss/2020b] jax v0.2.18 {tools}[foss/2020b] jax v0.2.19 Aug 13, 2021
@boegel
Copy link
Member

boegel commented Aug 13, 2021

Test report by @boegel
FAILED
Build succeeded for 2 out of 3 (3 easyconfigs in total)
node3111.skitty.os - Linux centos linux 7.9.2009, x86_64, Intel(R) Xeon(R) Gold 6140 CPU @ 2.30GHz, Python 3.6.8
See https://gist.github.com/8ece804dca9e7ea990fb2dbfa0714095 for a full test report.

edit: hmm, one of the tests failed...

_____________________________________________ XlaBridgeTest.test_timer_tpu_warning _____________________________________________
[gw2] linux -- Python 3.8.6 /software/Python/3.8.6-GCCcore-10.2.0/bin/python3.8

self = <xla_bridge_test.XlaBridgeTest testMethod=test_timer_tpu_warning>, _ = <MagicMock name='make_tpu_client' id='47426097730176'>

    @mock.patch('jax.lib.xla_client.make_tpu_client', side_effect=mock_tpu_client)
    def test_timer_tpu_warning(self, _):
      with warnings.catch_warnings(record=True) as w:
        warnings.simplefilter('always')
        xb.tpu_client_timer_callback(0.01)
>       self.assertLen(w, 1)
E       AssertionError: [] has length of 0, expected 1.

tests/xla_bridge_test.py:72: AssertionError

@boegel
Copy link
Member

boegel commented Aug 13, 2021

@boegelbot please test @ generoso
CORE_CNT=16

@boegelbot
Copy link
Collaborator

@boegel: Request for testing this PR well received on generoso

PR test command 'EB_PR=13622 EB_ARGS= /apps/slurm/default/bin/sbatch --job-name test_PR_13622 --ntasks="16" ~/boegelbot/eb_from_pr_upload_generoso.sh' executed!

  • exit code: 0
  • output:
Submitted batch job 18132

Test results coming soon (I hope)...

- notification for comment with ID 898704698 processed

Message to humans: this is just bookkeeping information for me,
it is of no use to you (unless you think I have a bug, which I don't).

@boegel
Copy link
Member

boegel commented Aug 13, 2021

Test report by @boegel
SUCCESS
Build succeeded for 3 out of 3 (3 easyconfigs in total)
node3111.skitty.os - Linux centos linux 7.9.2009, x86_64, Intel(R) Xeon(R) Gold 6140 CPU @ 2.30GHz, Python 3.6.8
See https://gist.github.com/5891da8c2282beb78f56f429cf1e49f6 for a full test report.

@boegel
Copy link
Member

boegel commented Aug 13, 2021

Test report by @boegel
SUCCESS
Build succeeded for 3 out of 3 (3 easyconfigs in total)
node3525.doduo.os - Linux RHEL 8.2, x86_64, AMD EPYC 7552 48-Core Processor (zen2), Python 3.6.8
See https://gist.github.com/dbe92c04b4452fa1055631a989bc9a0a for a full test report.

@boegel
Copy link
Member

boegel commented Aug 13, 2021

Test report by @boegel
SUCCESS
Build succeeded for 3 out of 3 (3 easyconfigs in total)
node2607.swalot.os - Linux centos linux 7.9.2009, x86_64, Intel(R) Xeon(R) CPU E5-2660 v3 @ 2.60GHz (haswell), Python 3.6.8
See https://gist.github.com/ef3d0914628552932371ffaca619f0a2 for a full test report.

@boegelbot
Copy link
Collaborator

Test report by @boegelbot
SUCCESS
Build succeeded for 3 out of 3 (3 easyconfigs in total)
generoso-x-1 - Linux centos linux 8.2.2004, x86_64, Intel(R) Xeon(R) CPU E5-2667 v3 @ 3.20GHz (haswell), Python 3.6.8
See https://gist.github.com/655945a4cfadc9314f2d179b8f9ed876 for a full test report.

@boegel boegel modified the milestones: 4.x, next release (4.4.2?) Aug 14, 2021
@boegel
Copy link
Member

boegel commented Aug 14, 2021

This can/should probably be cleaned up by implementing a custom easyblock for jaxlib.

I'll merge this as is though, since it's working, and a major hurdle towards AlphaFold, and keep that as an exercise for an upcoming PR...

Copy link
Member

@boegel boegel left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@boegel
Copy link
Member

boegel commented Aug 14, 2021

Going in, thanks @deniskristak!

@boegel boegel merged commit d8c7e81 into easybuilders:develop Aug 14, 2021
@Flamefire
Copy link
Contributor

Test report by @Flamefire
FAILED
Build succeeded for 2 out of 3 (3 easyconfigs in total)
taurusi8002 - Linux centos linux 7.9.2009, x86_64, AMD EPYC 7352 24-Core Processor (zen2), Python 2.7.5
See https://gist.github.com/de777494f99609e7c0c32a739adaf447 for a full test report.

@Flamefire
Copy link
Contributor

Test report by @Flamefire
FAILED
Build succeeded for 2 out of 3 (3 easyconfigs in total)
taurusi6606.taurus.hrsk.tu-dresden.de - Linux RHEL 7.9, x86_64, Intel(R) Xeon(R) CPU E5-2680 v3 @ 2.50GHz (haswell), Python 2.7.5
See https://gist.github.com/13f062a38935f5541d68de71b5113a15 for a full test report.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants