[4.2] Updates to Floating-point printing code (SwiftDtoa.cpp) (#16178) #16228

stephentyrone · 2018-04-27T23:41:24Z

Cherry picking @tbkka 's change from master for 4.2:

This collects a number of changes I've been testing over the
last month.

Bug fix: The single-precision float formatter did not always
round the last digit even in cases where there were two
possible outputs that were otherwise equally good.
Algorithm simplification: The condition for determining
whether to widen or narrow the interval was more complex than
necessary. I now simply widen the interval for all even
significands.
Code simplification: The single-precision float formatter now uses fewer
64-bit features. This eliminated some 32-bit vs. 64-bit conditionals in
exchange for a minor loss of performance (~2%).
Minor performance tweaks: Steve Canon pointed out a few places
where I could avoid some extraneous arithmetic.

I've also rewritten a lot of comments to try to make the exposition
clearer.

The earlier testing regime focused on testing from first
principles. For example, I verified accuracy by feeding the
result back into the C library strtof, strtod, etc. and
checking round-trip exactness. Unfortunately, this approach
requires many checks for each value, limiting test performance.
It's also difficult to validate last-digit rounding.

For this round of updates, I've instead compared the digit
decompositions to other popular algorithms:

David M. Gay's gdtoa library is a robust and well-tested
implementation based on Dragon4. It supports all formats, but
is slow. (netlib.org/fp)
Grisu3 supports Float and Double. It is fast but incomplete,
failing on about 1% of all inputs.
(github.com/google/double-conversion)
Errol4 is fast and complete but only supports Double. The
repository includes an implementation of the enumeration
algorithm described in the Errol paper.
(github.com/marcandrysco/errol)

The exact tests varied by format:

Float: SwiftDtoa now generates the exact same digits as gdtoa
for every single-precision Float.
Double: Testing against Grisu3 (with fallback to Errol4 when
Grisu3 failed) greatly improved test performance. This
allowed me to test 100 trillion (10^14) randomly-selected
doubles in a reasonable amount of time. I also checked all
values generated by the Errol enumeration algorithm.
Float80: I compared the Float80 output to the gdtoa library
because neither Grisu3 nor Errol4 yet supports 80-bit extended
precision. All values generated by the Errol enumeration
algorithm have been checked, as well as several billion
randomly-selected values.

This collects a number of changes I've been testing over the last month. * Bug fix: The single-precision float formatter did not always round the last digit even in cases where there were two possible outputs that were otherwise equally good. * Algorithm simplification: The condition for determining whether to widen or narrow the interval was more complex than necessary. I now simply widen the interval for all even significands. * Code simplification: The single-precision float formatter now uses fewer 64-bit features. This eliminated some 32-bit vs. 64-bit conditionals in exchange for a minor loss of performance (~2%). * Minor performance tweaks: Steve Canon pointed out a few places where I could avoid some extraneous arithmetic. I've also rewritten a lot of comments to try to make the exposition clearer. The earlier testing regime focused on testing from first principles. For example, I verified accuracy by feeding the result back into the C library `strtof`, `strtod`, etc. and checking round-trip exactness. Unfortunately, this approach requires many checks for each value, limiting test performance. It's also difficult to validate last-digit rounding. For this round of updates, I've instead compared the digit decompositions to other popular algorithms: * David M. Gay's gdtoa library is a robust and well-tested implementation based on Dragon4. It supports all formats, but is slow. (netlib.org/fp) * Grisu3 supports Float and Double. It is fast but incomplete, failing on about 1% of all inputs. (github.com/google/double-conversion) * Errol4 is fast and complete but only supports Double. The repository includes an implementation of the enumeration algorithm described in the Errol paper. (github.com/marcandrysco/errol) The exact tests varied by format: * Float: SwiftDtoa now generates the exact same digits as gdtoa for every single-precision Float. * Double: Testing against Grisu3 (with fallback to Errol4 when Grisu3 failed) greatly improved test performance. This allowed me to test 100 trillion (10^14) randomly-selected doubles in a reasonable amount of time. I also checked all values generated by the Errol enumeration algorithm. * Float80: I compared the Float80 output to the gdtoa library because neither Grisu3 nor Errol4 yet supports 80-bit extended precision. All values generated by the Errol enumeration algorithm have been checked, as well as several billion randomly-selected values.

stephentyrone · 2018-04-27T23:41:53Z

@swift-ci Please smoke test

airspeedswift · 2018-04-28T00:07:04Z

I think you may need full test for 4.2 PRs

airspeedswift · 2018-04-28T00:38:15Z

@swift-ci please test

airspeedswift · 2018-04-30T17:55:30Z

@swift-ci please smoke test Linux platform

stephentyrone · 2018-05-04T01:37:38Z

@swift-ci please smoke test Linux

stephentyrone · 2018-05-07T17:44:52Z

@airspeedswift OK to merge this?

airspeedswift · 2018-05-07T20:48:30Z

Yep

stephentyrone requested a review from airspeedswift April 27, 2018 23:41

stephentyrone changed the title ~~Updates to Floating-point printing code (SwiftDtoa.cpp) (#16178)~~ [4.2] Updates to Floating-point printing code (SwiftDtoa.cpp) (#16178) Apr 27, 2018

airspeedswift approved these changes Apr 28, 2018

View reviewed changes

airspeedswift merged commit 0e6d867 into swift-4.2-branch May 7, 2018

stephentyrone deleted the float-format-update-4.2 branch June 21, 2018 01:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[4.2] Updates to Floating-point printing code (SwiftDtoa.cpp) (#16178) #16228

[4.2] Updates to Floating-point printing code (SwiftDtoa.cpp) (#16178) #16228

Uh oh!

stephentyrone commented Apr 27, 2018

Uh oh!

stephentyrone commented Apr 27, 2018

Uh oh!

airspeedswift commented Apr 28, 2018

Uh oh!

airspeedswift commented Apr 28, 2018

Uh oh!

airspeedswift commented Apr 30, 2018

Uh oh!

stephentyrone commented May 4, 2018

Uh oh!

stephentyrone commented May 7, 2018

Uh oh!

airspeedswift commented May 7, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[4.2] Updates to Floating-point printing code (SwiftDtoa.cpp) (#16178) #16228

[4.2] Updates to Floating-point printing code (SwiftDtoa.cpp) (#16178) #16228

Uh oh!

Conversation

stephentyrone commented Apr 27, 2018

Uh oh!

stephentyrone commented Apr 27, 2018

Uh oh!

airspeedswift commented Apr 28, 2018

Uh oh!

airspeedswift commented Apr 28, 2018

Uh oh!

airspeedswift commented Apr 30, 2018

Uh oh!

stephentyrone commented May 4, 2018

Uh oh!

stephentyrone commented May 7, 2018

Uh oh!

airspeedswift commented May 7, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants