Preliminar support for sageattention3 #9047

Panchovix · 2025-07-25T16:03:03Z

Add preliminary support to sage attention 3 (https://github.com/thu-ml/SageAttention and https://huggingface.co/jt-zhang/SageAttention3) for attention.

Help in implementation is welcome, as when testing on txt2img and img2img on SDXL on a RTX 5090, on Linux, I'm getting ~10% slower speeds vs sage attention 2, so probably I'm missing something here. EDIT: Thanks to Kijai I have updated per_block_mean to False and it should make a difference.

If someone can test with video it would be great.

Also maybe it would be a good idea to have different flags on comfy.model_management and comfy.cli_args for the 2 different versions?

Not sure what are the implications of this, but it seems to make sage3 actually run,

comfyanonymous · 2025-07-25T22:45:20Z

I don't feel like filling out approval forms so I'll wait until it's public before testing this.

For the line ending check if you sync master it should be fixed.

pamparamm · 2025-07-26T08:55:42Z

@Panchovix I agree with your last point: it's probably better to separate sageattention and sageattention3 into different flags. as they have different signatures and expect different tensor shapes as well.
Also, I'm getting worse performance with sageattention3 compared to sageattention2 on SDXL on Windows, and switching per_block_mean is not helping (I'm using MSVC-compatible version from https://huggingface.co/jt-zhang/SageAttention3/discussions/5).

Panchovix · 2025-07-26T18:32:10Z

Okay I have separated the sage attention versions with different flags:

--use-sage-attention for sage 1.x/2.x
--use-sage-attention3 for sage 3.x

Now they work separately (so you can have both installed in the venv and use the one you want).

I also still get less performance on SDXL for some reason. But I'm not sure if my implementation is not correct and that may cause that perf regression.

Kijai seems to have better performance on video, as he mentioned on https://huggingface.co/jt-zhang/SageAttention3/discussions/3#6883b71543d2651d281c9cc0, but it seems to be a custom implementation from here kijai/ComfyUI-WanVideoWrapper@a35eb7d.

Any ideas are welcome.

Preliminar support for sageattention3

a9d44d9

Panchovix requested a review from comfyanonymous as a code owner July 25, 2025 16:03

Change per_block_mean to False

2c7bb10

Not sure what are the implications of this, but it seems to make sage3 actually run,

Panchovix added 5 commits July 25, 2025 18:46

Merge branch 'comfyanonymous:master' into patch-1

2d69b2f

Ruff fixes

bb73935

More ruff fixes

9a70423

change print to logging.info

27720a1

remove unused variable

01934bf

Panchovix added 4 commits July 26, 2025 14:17

Merge branch 'comfyanonymous:master' into patch-1

6476f0b

Separate sage 1.x/2.x and sage 3.x on different functions and flags 1.

4a92350

Separate sage 1.x/2.x and sage 3.x on different functions and flags 2.

5e1865c

Separate sage 1.x/2.x and sage 3.x on different functions and flags 3.

3c3009c

Merge branch 'comfyanonymous:master' into patch-1

d52cd3e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Preliminar support for sageattention3 #9047

Preliminar support for sageattention3 #9047

Panchovix commented Jul 25, 2025 •

edited

Loading

Uh oh!

comfyanonymous commented Jul 25, 2025

Uh oh!

pamparamm commented Jul 26, 2025

Uh oh!

Panchovix commented Jul 26, 2025

Uh oh!

Uh oh!

Preliminar support for sageattention3 #9047

Are you sure you want to change the base?

Preliminar support for sageattention3 #9047

Conversation

Panchovix commented Jul 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

comfyanonymous commented Jul 25, 2025

Uh oh!

pamparamm commented Jul 26, 2025

Uh oh!

Panchovix commented Jul 26, 2025

Uh oh!

Uh oh!

Panchovix commented Jul 25, 2025 •

edited

Loading