Add all2all initial impl #51

danielhua23 · 2025-08-20T02:29:18Z

This PR included reference.py and submission.py and task description

I initially added some shapes which all from popular models like deepseek series, gpt-oss, mixtral. You can check if they work with the reference.py and submission.py on your end.

Thanks

msaroufim · 2025-08-26T04:37:55Z

problems/amd/all2all/reference.py

+    cfg, all_rank_data = data
+    world_size = 8
+
+    mp.set_start_method("spawn", force=True)


we were doing this on our end, I'm still unsure whether we should do this or make users do this. It seems "fine" if we can just provide a helper or copy pastable code in template for folks -cc @ngc92

msaroufim · 2025-08-26T04:43:57Z

problems/amd/all2all/reference.py

+        # num experts per rank
+        self.num_local_experts = cfg.num_experts // world_size
+        # max recv tokens per rank
+        self.max_recv = cfg.max_num_tokens * cfg.experts_per_token


n00b q is reasonable loading balancing guaranteed?

danielhua23 added 3 commits August 20, 2025 02:26

add all2all initial impl

e526f13

add moe compute at combine

a24cdde

revert

d66f03c

msaroufim pushed a commit that referenced this pull request Aug 20, 2025

fixes for a couple of things in leaderboard cog (#51)

8e59226

danielhua23 added 2 commits August 25, 2025 06:00

define all2all problem shapes and add roofline

424396e

fix typos

a62277e

msaroufim reviewed Aug 26, 2025

View reviewed changes

S1ro1 added 4 commits August 26, 2025 13:10

Feat: conform to kernelbot infrastructure

b14e0f7

Fix: typo

7a281f5

Feat: reenable tests, add basic check_implementation

81e2f45

Feat: move to separate leaderboard

e5db664

S1ro1 merged commit a38f4e8 into gpu-mode:main Aug 31, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add all2all initial impl #51

Add all2all initial impl #51

danielhua23 commented Aug 20, 2025 •

edited

Loading

Uh oh!

msaroufim Aug 26, 2025

Uh oh!

msaroufim Aug 26, 2025

Uh oh!

Uh oh!

Add all2all initial impl #51

Add all2all initial impl #51

Conversation

danielhua23 commented Aug 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

msaroufim Aug 26, 2025

Choose a reason for hiding this comment

Uh oh!

msaroufim Aug 26, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

danielhua23 commented Aug 20, 2025 •

edited

Loading