-
Notifications
You must be signed in to change notification settings - Fork 7
cmake: use -flto=auto compiler flag when supported, rework fast-math disablement #80
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Changes from 1 commit
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -95,6 +95,9 @@ if (MSVC) | |
| # and https://devblogs.microsoft.com/cppblog/the-fpcontract-flag-and-changes-to-fp-modes-in-vs2022/ | ||
| # By default, MSVC doesn't enable the /fp:fast option. | ||
| set_cxx_flag("/fp:fast") | ||
| else() | ||
| # Precise model (/fp:precise) should do safe contractions, but we should not trust that (see below). | ||
| set_cxx_flag("/fp:strict") | ||
| endif() | ||
|
|
||
| if (USE_LTO) | ||
|
|
@@ -122,14 +125,34 @@ else() | |
| set_cxx_flag("-O3" RELWITHDEBINFO) | ||
| endif() | ||
|
|
||
| try_cxx_flag(FNO_MATH_ERRNO "-fno-math-errno") | ||
|
|
||
| if (USE_FAST_MATH) | ||
| # By default, GCC uses -ffp-contract=fast with -std=gnu* and uses -ffp-contract=off with -std=c*. | ||
| # See https://gcc.gnu.org/onlinedocs/gcc/Optimize-Options.html | ||
| # By default, GCC doesn't enable the -ffast-math option. | ||
| set_cxx_flag("-ffast-math -fno-math-errno -ffp-contract=fast") | ||
| try_cxx_flag(FFAST_MATH "-ffast-math") | ||
|
|
||
| # GCC. | ||
| try_cxx_flag(FFP_CONTRACT_FAST "-ffp-contract=fast") | ||
| # Clang. | ||
| try_cxx_flag(FFP_MODEL_FAST "-ffp-model=agressive") | ||
| # ICC. | ||
| try_cxx_flag(FP_MODEL_FAST_2 "-fp-model=fast=2") | ||
| try_cxx_flag(QSIMD_HONOR_FP_MODEL "-qsimd-honor-fp-model") | ||
| else() | ||
| try_cxx_flag(FNO_FAST_MATH "-fno-fast-math") | ||
|
|
||
| # By default, GCC uses -ffp-contract=fast with -std=gnu* and uses -ffp-contract=off with -std=c*. | ||
| # By default, GCC uses -std=gnu* and then enables -ffp-contract=fast even if -ffast-math is not enabled. | ||
| set_cxx_flag("-ffp-contract=off") | ||
| # See https://gcc.gnu.org/onlinedocs/gcc/Optimize-Options.html | ||
| # GCC fast contractions (-ffp-contract=fast) should be safe, but aren't on arm64 with GCC 12. | ||
|
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. What do you mean by "aren't safe"? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I should have been more verbose at the time because I forgot the exact issue. I guess it meant it doesn't produce the same result as compiling without any fast stuff. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Given the purpose of my patch is to maximize the chance the files are reproducible, I probably noticed that on ARM such option broke the reproducibility. It's probably a similar problem than using x87 instead of SSE on x86, maybe some ARM fused operations break IEEE compliance. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. For sure, since I mentioned a specific GCC version, that was the result of me testing that specific compiler on the said hardware, and I was testing the reproducibility of converted files. GCC 12 is the Debian Bookworm GCC, and I use Debian Bookworm on my Arm boards. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. That's unobvious so it should be explained in the comment. As we've discussed in the past, I don't think floating point reproducibility is a good goal -- the language and compilers don't make any attempt to provide such guarantees. Platforms using x87 floating point are an easy example where you're not going to achieve it. But for GCC/Clang those options to disable fast math are good in any case, since fast math makes the software too unreliable. |
||
| # Clang precise contractions (-ffp-contract=precise) should be safe, but aren't on arm64 with Clang 14. | ||
|
|
||
| # GCC. | ||
| try_cxx_flag(FFP_CONTRACT_OFF "-ffp-contract=off") | ||
| # Clang | ||
| try_cxx_flag(FFP_MODEL_STRICT "-ffp-model=strict") | ||
| # ICC. | ||
| try_cxx_flag(FP_MODEL_STRICT "-fp-model=strict") | ||
| try_cxx_flag(QSIMD_HONOR_FP_MODEL "-qsimd-honor-fp-model") | ||
| endif() | ||
|
|
||
| # It should be done at the very end because it copies all compiler flags | ||
|
|
||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/fp:precisewould be a better default for MSVC. The documentation says/fp:strictis mostly needed if you want floating point exceptions. I did a little test with 57 PNG files and got246.162s(self-reported) with master and259.846swith this branch.