Optimize LZMA range decoder #910

jdpurcell · 2025-03-22T22:22:18Z

I noticed this duplicated bit of logic inside the range decoder. Turns out, there's a helper function called Normalize2 that does the exact same thing and was never even called. This quirk came all the way from the original LZMA C# SDK it seems. I can see why, as with .NET Framework 4.8, attempting to use the helper function results in performance degradation, so someone manually inlined it. But that's not even necessary, as [MethodImpl(MethodImplOptions.AggressiveInlining)] can be used to achieve the same performance.

But the more interesting part, hence this PR, is that newer JITs (tested with .NET 8.0) don't like the manually inlined version of the code as much; when calling Normalize2 instead, it seems to get inlined either way (even without the attribute that we put as a hint for .NET 4.8), and the performance is better.

Core i7-6700k (3.2% reduction):

Method	Mean	Error	StdDev
Before	1.128 s	0.0018 s	0.0015 s
After	1.092 s	0.0011 s	0.0009 s

Apple M3 (11.6% reduction):

Method	Mean	Error	StdDev
Before	888.9 ms	16.78 ms	14.01 ms
After	785.8 ms	1.98 ms	1.85 ms

That's reduction in overall time to extract the whole archive. Tested with a smaller archive in BenchmarkDotNet and I was honestly in disbelief at first on the M3, but it's real. I also manually benchmarked using the same setup/archive as my first PR, it was a 412MB Qt 7z taking 26+ seconds, and it indeed shaved an entire 3 seconds off the extraction time.

adamhathcock

Thanks!

Optimize LZMA range decoder

14affd8

adamhathcock approved these changes Mar 24, 2025

View reviewed changes

adamhathcock merged commit 35ac2b9 into adamhathcock:master Mar 24, 2025
2 checks passed

jdpurcell deleted the pr-rangedecoderoptim branch March 24, 2025 20:39

This was referenced Aug 1, 2025

Bump CliWrap and 9 others futrime/lip#286

Open

update: Bump SharpCompress and 3 others ethan-hann/CyberRadio-Assistant#95

Merged

dependabot bot mentioned this pull request Aug 18, 2025

update: Bump SharpCompress from 0.39.0 to 0.40.0 ethan-hann/CyberRadio-Assistant#96

Merged

dependabot bot mentioned this pull request Sep 5, 2025

Bump SharpCompress from 0.38.0 to 0.40.0 Kiryuumaru/AbsolutePathHelpers#107

Merged

dependabot bot mentioned this pull request Oct 17, 2025

Bump SharpCompress from 0.32.2 to 0.41.0 mattjohnsonpint/TimeZoneConverter#171

Merged

dependabot bot mentioned this pull request Oct 27, 2025

chore: Bump SharpCompress from 0.39.0 to 0.41.0 aweXpect/aweXpect.Testably#106

Open

dependabot bot mentioned this pull request Nov 10, 2025

chore: Bump SharpCompress from 0.39.0 to 0.41.0 aweXpect/aweXpect#845

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Optimize LZMA range decoder #910

Optimize LZMA range decoder #910

Uh oh!

jdpurcell commented Mar 22, 2025

Uh oh!

adamhathcock left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Optimize LZMA range decoder #910

Optimize LZMA range decoder #910

Uh oh!

Conversation

jdpurcell commented Mar 22, 2025

Uh oh!

adamhathcock left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants