Encode commands on finish #8220

andyleiserson · 2025-09-12T18:55:49Z

Resolves #7679. Fixes #7854.

This moves command encoding from happening within the calls to the various methods for each command, to instead happen within CommandEncoder.finish. The motivation to do this is to allow generating better barriers in the command stream (in the future -- this PR does not change the barriers), and it also fixes a validation error that could occur in certain cases.

The change is divided into commits, but I'm not sure that makes reviewing easier other than by providing a way to bookmark review progress. (Which I think could equally well be done by noting which files you have reviewed.)

A lot of code has changed, but many of the changes are rote -- e.g. referencing things via state: EncodingState rather than via cmd_buf_data. Some particular things to focus on might be the comment on struct EncodingState, everything in command/mod.rs, and handling of compute / render passes.

With this PR, two copies of every command are saved when tracing -- one with Ids for the trace, and one with Arcs for encoding. I am still working on resolving that.

Testing
Mostly via existing tests, but this enables the tests from #7854 that are sensitive to when command encoding happens, which provides some directed coverage.

I did do some performance testing which had somewhat inconclusive results. When I ran the entire wgpu benchmark (which takes hours), it looked like things might be significantly (10-40%) slower. But since it took hours, I was doing other things on my system. When I isolated a specific test case and adjusted to a more reasonable runtime, things were within a few percent before and after the changes.

Squash or Rebase? Probably squash. Things mostly work at the intermediate stages, but there are also weird failures when commands get encoded out of order, that will not necessarily be obvious if encountered without context.

Checklist

Need to add a changelog entry.

JMS55 · 2025-09-13T05:52:29Z

Would be nice to run Bevy and see what the performance difference is.

teoxoy

Looks great!

andyleiserson · 2025-09-19T20:40:08Z

I ran the bevy_render benchmark before and after this change, it seems like it may be very slightly slower, but the difference is below criterion's threshold for reporting a change on any single benchmark:

layers_intersect        time:   [2.0196 ns 2.0215 ns 2.0242 ns]
                        change: [−1.0243% −0.8111% −0.5820%] (p = 0.00 < 0.05)
                        Change within noise threshold.

smooth_normals          time:   [3.3578 ms 3.3621 ms 3.3667 ms]
                        change: [−1.4260% −1.0745% −0.7489%] (p = 0.00 < 0.05)
                        Change within noise threshold.

angle_weighted_normals  time:   [3.3651 ms 3.3690 ms 3.3731 ms]
                        change: [−0.4215% −0.0445% +0.2781%] (p = 0.80 > 0.05)
                        No change in performance detected.

face_weighted_normals   time:   [1.1781 ms 1.1843 ms 1.1909 ms]
                        change: [−1.0657% −0.2984% +0.5387%] (p = 0.47 > 0.05)
                        No change in performance detected.

flat_normals            time:   [559.91 µs 561.46 µs 563.11 µs]
                        change: [−1.1946% −0.6108% −0.0669%] (p = 0.04 < 0.05)
                        Change within noise threshold.

build_torus             time:   [2.2850 ns 2.3552 ns 2.4392 ns]
                        change: [−6.4441% −3.2043% +0.2574%] (p = 0.07 > 0.05)
                        No change in performance detected.

Wumpf · 2025-09-21T07:55:29Z

With this PR, two copies of every command are saved when tracing -- one with Ids for the trace, and one with Arcs for encoding. I am still working on resolving that.

Interesting, we've been in that state before transiently :)

jimblandy · 2025-09-25T16:32:46Z

For what it's worth: this PR is important to Firefox.

cwfitzgerald

I think this is good progres and we should land it.

Just so that I understand correctly, this doesn't yet resolve the issue of us needing multiple underlying command buffers in renderpasses - that would be a future pass?

andyleiserson · 2025-09-25T17:15:30Z

Just so that I understand correctly, this doesn't yet resolve the issue of us needing multiple underlying command buffers in renderpasses - that would be a future pass?

Correct. This PR does change the behavior for non-pass operations directly on a command encoder -- previously each was given its own command buffer, now they will reuse a single command buffer. But each pass will still still have two command buffers, one for the "pre-pass" and one for the actual pass.

cwfitzgerald · 2025-09-25T18:22:24Z

I'm not too concerned about performance here as we can always optimize later, plus this is far from its final form.

cwfitzgerald · 2025-09-25T18:22:30Z

Merging.

Co-authored-by: Andreas Reich <[email protected]>

andyleiserson added 12 commits September 11, 2025 22:07

Encode copy_buffer_to_buffer on finish

2aafb31

Encode compute passes and other transfers on finish

0d2b5fe

Encode render passes on finish

1ce5e77

Encode query set operations on finish

0e7d6a3

Encode other commands on finish

2569147

Fix debug scope depth tracking

4f55071

Encode transition_resources on finish

b913f21

Encode ray tracing commands on finish

bb35e66

Cleanup TODOs

ec4ee31

Clean up record_with / unlock_and_record

c053002

Re-enable {buffer,texture}_destroy_before_submit tests

372af17

Fix fmt and rustdoc, add CHANGELOG entry

3efb5d6

teoxoy self-assigned this Sep 18, 2025

teoxoy approved these changes Sep 18, 2025

View reviewed changes

Merge branch 'trunk' into encode-on-finish

fe24523

teoxoy assigned cwfitzgerald Sep 25, 2025

cwfitzgerald approved these changes Sep 25, 2025

View reviewed changes

cwfitzgerald merged commit 1967900 into gfx-rs:trunk Sep 25, 2025
41 checks passed

andyleiserson deleted the encode-on-finish branch September 25, 2025 18:56

sharmajai pushed a commit to sharmajai/wgpu that referenced this pull request Oct 12, 2025

Encode commands on finish (gfx-rs#8220)

58a5619

Co-authored-by: Andreas Reich <[email protected]>

andyleiserson mentioned this pull request Oct 17, 2025

Disallow mixing command encoding APIs #8373

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Encode commands on finish #8220

Encode commands on finish #8220

Uh oh!

andyleiserson commented Sep 12, 2025 •

edited

Loading

Uh oh!

JMS55 commented Sep 13, 2025

Uh oh!

teoxoy left a comment

Uh oh!

andyleiserson commented Sep 19, 2025

Uh oh!

Wumpf commented Sep 21, 2025

Uh oh!

jimblandy commented Sep 25, 2025

Uh oh!

cwfitzgerald left a comment

Uh oh!

andyleiserson commented Sep 25, 2025

Uh oh!

cwfitzgerald commented Sep 25, 2025

Uh oh!

cwfitzgerald commented Sep 25, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Encode commands on finish #8220

Encode commands on finish #8220

Uh oh!

Conversation

andyleiserson commented Sep 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JMS55 commented Sep 13, 2025

Uh oh!

teoxoy left a comment

Choose a reason for hiding this comment

Uh oh!

andyleiserson commented Sep 19, 2025

Uh oh!

Wumpf commented Sep 21, 2025

Uh oh!

jimblandy commented Sep 25, 2025

Uh oh!

cwfitzgerald left a comment

Choose a reason for hiding this comment

Uh oh!

andyleiserson commented Sep 25, 2025

Uh oh!

cwfitzgerald commented Sep 25, 2025

Uh oh!

cwfitzgerald commented Sep 25, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

andyleiserson commented Sep 12, 2025 •

edited

Loading