feat: add fastercache and pab #92

nifleisch · 2025-05-06T07:41:12Z

Description

This PR adds the PAB and FasterCache algorithms from diffusers (https://huggingface.co/docs/diffusers/main/api/cache).

Type of Change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

How Has This Been Tested?

I manually tested both caching mechanisms across all supported Diffusers pipelines, visually inspected the resulting images, videos, and measured the inference time (relative speedups match this benchmark. For FLUX, I evaluated every supported combination of algorithms. I implemented new tests for each algorithm and all of their combinations with other algorithms.

Checklist

My code follows the style guidelines of this project
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes

Additional Notes

I tried to make these cachers work with compilation but ran into errors. The main problem is that these methods introduce a condition for the attention layer, which hinders compilation.

johnrachwan123

LGTM

sharpenb

Looks good! Mostly small comments before ti can be merged.

sharpenb

I readded my comments that have been dropped in the last review ;)

sharpenb · 2025-05-12T08:58:20Z

src/pruna/algorithms/caching/pab.py

+        """
+        imported_modules = self.import_algorithm_packages()
+        # set default values
+        temporal_attention_block_skip_range: Optional[int] = None


I would still recommend to put them in the smash config as constant and mention that these can be overwritten for different architecture with a link to the code file or the diffuser PR so that the documentation is complete.

I agree. The best solution would be to use the pipeline-specific defaults unless the user explicitly specifies a parameter. Unfortunately, our SmashConfig interface doesn’t support that logic right now. To implement it, we’d have to apply the same defaults across every pipeline. Given the large number of parameters—and the fact that most aren’t straightforward to tune—I’ll leave things as they are for now. This approach ensures users receive strong out-of-the-box results and a straightforward interface, albeit with fewer tuning options.

Makes sense. As discussed async, let's make clear in the PR descripiton that we have a pipeline specific default in this PR that would need some iterations :)

sharpenb · 2025-05-12T08:58:56Z

src/pruna/algorithms/caching/fastercache.py

+        temporal_attention_block_skip_range: Optional[int] = None
+        spatial_attention_timestep_skip_range: Tuple[int, int] = (-1, 681)
+        temporal_attention_timestep_skip_range: Optional[Tuple[int, int]] = None
+        low_frequency_weight_update_timestep_range: Tuple[int, int] = (99, 901)
+        high_frequency_weight_update_timestep_range: Tuple[int, int] = (-1, 301)
+        unconditional_batch_skip_range: int = 5
+        unconditional_batch_timestep_skip_range: Tuple[int, int] = (-1, 641)
+        spatial_attention_block_identifiers: Tuple[str, ...] = (
+            "blocks.*attn1",
+            "transformer_blocks.*attn1",
+            "single_transformer_blocks.*attn1"
+        )
+        temporal_attention_block_identifiers: Tuple[str, ...] = ("temporal_transformer_blocks.*attn1",)
+        attention_weight_callback = lambda _: 0.5  # noqa: E731
+        tensor_format: str = "BFCHW"
+        is_guidance_distilled: bool = False


I would still recommend to put them in the smash config as constant and mention that these can be overwritten for different architecture with a link to the code file or the diffuser PR so that the documentation is complete :)

src/pruna/algorithms/caching/fastercache.py

sharpenb

Let's go!

* fix: correct docstring in deepcache * feat: add model checks * feat: add pyramid attention broadcast (pab) cacher * feat: add fastercache cacher * tests: add flux tiny random fixture * tests: add algorithms tests for pab and fastercache * tests: add combination tests for pab and fastercache * fix: add 1 as value for interval parameter

nifleisch added 7 commits May 6, 2025 07:29

fix: correct docstring in deepcache

8dded7e

feat: add model checks

c14a071

feat: add pyramid attention broadcast (pab) cacher

cbd0e71

feat: add fastercache cacher

9661554

tests: add flux tiny random fixture

327526b

tests: add algorithms tests for pab and fastercache

9bba3ae

tests: add combination tests for pab and fastercache

3743bfc

nifleisch requested review from johnrachwan123 and sharpenb May 6, 2025 07:41

nifleisch changed the title ~~Feat/add fastercache and pab~~ feat: add fastercache and pab May 6, 2025

johnrachwan123 approved these changes May 7, 2025

View reviewed changes

sharpenb requested changes May 9, 2025

View reviewed changes

sharpenb reviewed May 12, 2025

View reviewed changes

fix: add 1 as value for interval parameter

3f2213f

nifleisch requested a review from sharpenb May 12, 2025 15:24

sharpenb approved these changes May 12, 2025

View reviewed changes

nifleisch merged commit 4827f66 into main May 12, 2025
7 checks passed

nifleisch deleted the feat/add-fastercache-and-pab branch May 12, 2025 16:37

davidberenstein1957 mentioned this pull request May 13, 2025

[BUG] CI/CD external_contributions_tests does not work as expected #114

Closed

davidberenstein1957 added the algorithm label Aug 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add fastercache and pab #92

feat: add fastercache and pab #92

Uh oh!

nifleisch commented May 6, 2025

Uh oh!

johnrachwan123 left a comment

Uh oh!

sharpenb left a comment

Uh oh!

sharpenb left a comment

Uh oh!

sharpenb May 12, 2025

Uh oh!

nifleisch May 12, 2025

Uh oh!

sharpenb May 12, 2025

Uh oh!

sharpenb May 12, 2025

Uh oh!

Uh oh!

Uh oh!

sharpenb left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

feat: add fastercache and pab #92

feat: add fastercache and pab #92

Uh oh!

Conversation

nifleisch commented May 6, 2025

Description

Type of Change

How Has This Been Tested?

Checklist

Additional Notes

Uh oh!

johnrachwan123 left a comment

Choose a reason for hiding this comment

Uh oh!

sharpenb left a comment

Choose a reason for hiding this comment

Uh oh!

sharpenb left a comment

Choose a reason for hiding this comment

Uh oh!

sharpenb May 12, 2025

Choose a reason for hiding this comment

Uh oh!

nifleisch May 12, 2025

Choose a reason for hiding this comment

Uh oh!

sharpenb May 12, 2025

Choose a reason for hiding this comment

Uh oh!

sharpenb May 12, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

sharpenb left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants