Small improvements to early nuts behaviour #5824

aseyboldt · 2022-05-30T14:38:34Z

Step size adaptation:
Currently, we compute the acceptance rate of a draw only in that part of the trajectory that is accepted at the end. This way we discard some additional information about energy errors in the remaining discarded part. After this PR we compute the mean acceptance rate over all leapfrog steps we take.

Divergences:
Currently we count large derivations from the initial energy as a divergence regardless of the direction of the energy error.
This PR changes this so that only large energy errors that correspond to a low acceptance probability are considered a divergence. I think this helps with some model in the initial phase of sampling, were with the old behavior we stop promising trajectories that are too good, leading to worse performance.

Also a little bit of housekeeping:
This introduces an additional sampler statistic: index_in_trajectory. I don't see much of a use-case for this other than teaching and debugging, but it proved useful while comparing the sampler to other implementations.

I also introduced some array copies for the integrated momentum array. I thought I saw bugs while comparing sampler behavior to nuts-rs, but somehow I can't reproduce those now. Still, I think it's better to be safe.

codecov · 2022-05-30T14:57:46Z

Codecov Report

Merging #5824 (7f1e031) into main (6b22ed5) will decrease coverage by 0.02%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##             main    #5824      +/-   ##
==========================================
- Coverage   89.40%   89.38%   -0.03%     
==========================================
  Files          74       74              
  Lines       13772    13771       -1     
==========================================
- Hits        12313    12309       -4     
- Misses       1459     1462       +3

Impacted Files	Coverage Δ
pymc/step_methods/hmc/base_hmc.py	`90.55% <100.00%> (+0.07%)`	⬆️
pymc/step_methods/hmc/hmc.py	`92.59% <100.00%> (+0.13%)`	⬆️
pymc/step_methods/hmc/integration.py	`78.84% <100.00%> (ø)`
pymc/step_methods/hmc/nuts.py	`97.40% <100.00%> (-0.10%)`	⬇️
pymc/step_methods/hmc/quadpotential.py	`80.69% <100.00%> (+0.14%)`	⬆️
pymc/parallel_sampling.py	`86.46% <0.00%> (-1.00%)`	⬇️

ColCarroll

looks good!

two questions:

is the change from log_weighted_accept_sum to log_accept_sum what is making the change in step size adaptation? I am having trouble seeing that.
these changes are careful enough that I might ask for, say, 10k samples from a random 100d gaussian to show that it still provides unbiased samples.

ColCarroll · 2022-05-30T15:06:40Z

pymc/step_methods/hmc/quadpotential.py

is this for the future?

That got here my accident...
I can take it out, it is useful though for the new code in covadapt (which I think is pretty nice actually, and I'll try to get into pymc at some point).

If you want to leave it in, I'm ok with it (so long as it has a comment!) I wonder what you think of algorithm 2 in https://proceedings.mlr.press/v151/hoffman22a/hoffman22a.pdf as a means of estimating scales?

Thomas sent me a link to that paper just this morning. I also wonder what I'll think about algorithm 2, looks fascinating though. ;-)
If you like to compare ideas with what I did in covadapt, I just wrote a sketch of an intro in the readme: https://github.com/aseyboldt/covadapt
If you don't understand what I'm talking about over there, that's my fault. I'll try to improve it soon. :-)

aseyboldt · 2022-05-30T15:26:27Z

@ColCarroll

is the change from log_weighted_accept_sum to log_accept_sum what is making the change in step size adaptation? I am having trouble seeing that.

Yes. Maybe it is easier to see if you only look at the first commit.
Previously, we had information about the acceptance rate in the Subtree structure, now there is only the attribute log_accept_sum in the NutsTree class, that keeps track of the sum of acceptance rates, and a counter n_proposals that counts how many leapfrog steps we did.

these changes are careful enough that I might ask for, say, 10k samples from a random 100d gaussian to show that it still provides unbiased samples.

That should also be in the tests already, (https://github.com/pymc-devs/pymc/blob/main/pymc/tests/test_posteriors.py), but you are right to ask, I'll also run that manually to make sure. (also we have to run the long-running test manually)

aseyboldt · 2022-05-31T20:32:48Z

Notebook with the promised verification of the posterior: https://gist.github.com/aseyboldt/43354eded52981340304d3aec94ced3c
I also added a few lines in the docstring about the new stats.

ricardoV94 · 2022-06-03T13:17:42Z

Notebook with the promised verification of the posterior: https://gist.github.com/aseyboldt/43354eded52981340304d3aec94ced3c I also added a few lines in the docstring about the new stats.

Can we include that as slow test (which will not run by default)

ricardoV94 · 2022-06-04T16:58:36Z

@ColCarroll can we get a binary review from you :D?

ColCarroll

apologies for the wait, and thanks for the verification, @aseyboldt!

ricardoV94 · 2022-06-21T08:14:57Z

Thanks @aseyboldt and @ColCarroll!

ColCarroll · 2022-06-21T08:17:59Z

I guess this is an enhancement of the NUTS algorithm, so it will change results for NUTS users (i.e., most users), we assume for the better. My impression (defer to adrian!) is that these should be fairly subtle or only matter for the most careful or difficult models, so it does not require (say) a minor version release (which I'd suggest doing if the old algorithm was arguably wrong), and can go out whenever the next release was planned anyways.

aseyboldt force-pushed the stepsize branch from 9484a34 to d2b0549 Compare May 30, 2022 15:13

ColCarroll reviewed May 30, 2022

View reviewed changes

twiecki requested a review from ColCarroll May 30, 2022 15:23

aseyboldt added 5 commits May 31, 2022 15:55

Use all leapfrog steps for acceptance rate

51b5d98

Add more sampler stats

8273218

Only count positive energy errors as divergence

8720d44

Reformat

6a3232c

Copy momentum sum to be sure

f713ea7

aseyboldt force-pushed the stepsize branch from d2b0549 to 55e51a5 Compare May 31, 2022 15:16

Fix tests and add doc for new statistics

7f1e031

aseyboldt force-pushed the stepsize branch from 55e51a5 to 7f1e031 Compare May 31, 2022 15:47

ColCarroll approved these changes Jun 21, 2022

View reviewed changes

ricardoV94 merged commit c120b02 into pymc-devs:main Jun 21, 2022

ricardoV94 added the enhancements label Jun 21, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Small improvements to early nuts behaviour #5824

Small improvements to early nuts behaviour #5824

Uh oh!

aseyboldt commented May 30, 2022

Uh oh!

codecov bot commented May 30, 2022 •

edited

Loading

Uh oh!

ColCarroll left a comment

Uh oh!

ColCarroll May 30, 2022

Uh oh!

aseyboldt May 30, 2022

Uh oh!

ColCarroll May 30, 2022

Uh oh!

aseyboldt May 30, 2022

Uh oh!

aseyboldt commented May 30, 2022

Uh oh!

aseyboldt commented May 31, 2022

Uh oh!

ricardoV94 commented Jun 3, 2022 •

edited

Loading

Uh oh!

ricardoV94 commented Jun 4, 2022

Uh oh!

ColCarroll left a comment

Uh oh!

ricardoV94 commented Jun 21, 2022

Uh oh!

ColCarroll commented Jun 21, 2022

Uh oh!

Uh oh!

Small improvements to early nuts behaviour #5824

Small improvements to early nuts behaviour #5824

Uh oh!

Conversation

aseyboldt commented May 30, 2022

Uh oh!

codecov bot commented May 30, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

ColCarroll left a comment

Choose a reason for hiding this comment

Uh oh!

ColCarroll May 30, 2022

Choose a reason for hiding this comment

Uh oh!

aseyboldt May 30, 2022

Choose a reason for hiding this comment

Uh oh!

ColCarroll May 30, 2022

Choose a reason for hiding this comment

Uh oh!

aseyboldt May 30, 2022

Choose a reason for hiding this comment

Uh oh!

aseyboldt commented May 30, 2022

Uh oh!

aseyboldt commented May 31, 2022

Uh oh!

ricardoV94 commented Jun 3, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ricardoV94 commented Jun 4, 2022

Uh oh!

ColCarroll left a comment

Choose a reason for hiding this comment

Uh oh!

ricardoV94 commented Jun 21, 2022

Uh oh!

ColCarroll commented Jun 21, 2022

Uh oh!

Uh oh!

codecov bot commented May 30, 2022 •

edited

Loading

ricardoV94 commented Jun 3, 2022 •

edited

Loading