Fix #423: change -ffast-math to -mtune=native #432

ftessier · 2018-04-19T14:52:28Z

Update the default gcc optimization configuration to use -march=native
instead of -ffast-math. The latter causes various floating-point
exceptions on newer cpus and compilers. If the programs are run on a
different cpu, then one should use the corresponding -march option for
that architecture instead of "native", or else use the less aggressive
-mtune=native if the compiling and running cpus are in the same family.

The alternatives -march and -mtune were suggested by @Kawrakow in
issue #174.

See also #85, #91, #174, #204 and #423.

marenaud · 2018-04-19T14:56:52Z

You should be mindful that this change will break some cluster installations of EGS. For example, many users on our cluster install/compile their own version of EGS in their home area, which is mounted on an NFS partition and shared across a heterogeneous cluster (5 nodes, 3 different generations of CPU architectures). Compiling with march=native will produce a binary that is only "valid" for the CPU architecture of the node that the user was logged in on, not other compute nodes.

In this setup, it wouldn't really be possible to compile with different architectures as the binaries would just be overwritten since it's all the same mount. (mtune will work, though I'm curious about the performance penalty of losing ffast-math and only using mtune or doing march=oldest_cpu_in_cluster)

I predict many headaches for people in charge of their department's clusters as new users install EGS from the github repo and have code that runs fine interactively but crashes when they submit batch jobs to their cluster.

ftessier · 2018-04-19T15:50:29Z

Very good point @marenaud. Then I suggest settling for -mtune=native (or even -mtune=generic) which is backward compatible within cpu families, or else just leave out -ffast-math and -m options entirely in the default configuration (and provide some guidance about these optimization options in the installation instructions). At any rate -ffast-math does not seem like a safe default any more.

marenaud · 2018-04-19T15:58:53Z

Agreed, alternatively we could also look at which flag is causing issues. According to the gcc website, -ffast-math is equivalent to:

-fno-trapping-math, -funsafe-math-optimizations, -ffinite-math-only, -fno-errno-math, -fno-signaling-nans, -fno-rounding-math, -fcx-limited-range and -fno-signed-zeros

If we're lucky, maybe only one of these is causing all the trouble. Not sure I would count on that though...

ftessier · 2018-04-19T16:11:19Z

I would not count on that either! I think -mtune=native is a reasonable default, since an environment with different cpu families is unlikely, and one must then anyways handle multiple executables. I will update this pull request with -mtune.

ftessier · 2018-04-19T17:02:33Z

Just added -mtune=native to the default fortran optimization options in the configure script.

mainegra · 2018-09-26T14:13:06Z

@ftessier this must be also added to the config GUI!

ftessier · 2018-09-26T15:44:56Z

I think it has, see changes in egs_tools.cpp. Can you check if I missed anything?

Update the default gcc optimization configuration to -mtune=native instead of -ffast-math. The latter causes various floating-point exceptions on newer cpus and compilers. Note that if everything is compiled and run on identical cpu, then the more aggressive -march=native option should be considered during configuration. Change the default optimization level to -O2 instead of -O3. There have been cases where upgrading to a newer compiler revealed bugs under -O3, and more aggressive optimization does not always lead to increased performance. The -O2 option is a better default, and another level can be selected at configuration time. Also add a test in the Fortran compiler version check to catch the gfortran version string, and fix a duplicate echo for the default fortran debugger flag.

ftessier · 2018-09-26T17:58:12Z

Also changed the default optimization level from -O3 to -O2

ftessier requested review from mainegra, rtownson and blakewalters April 19, 2018 14:52

ftessier self-assigned this Apr 19, 2018

ftessier added installation compilation labels Apr 19, 2018

ftessier force-pushed the fix-fast-math branch 2 times, most recently from b30d5f7 to 2641e83 Compare April 19, 2018 17:01

rtownson approved these changes Apr 19, 2018

View reviewed changes

rtownson changed the title ~~Fix #423: change -ffast-math to -march=native~~ Fix #423: change -ffast-math to -mtune=native Apr 19, 2018

mainegra approved these changes Apr 24, 2018

View reviewed changes

ftessier force-pushed the fix-fast-math branch 2 times, most recently from b3a7f71 to d4c5d10 Compare September 26, 2018 13:41

blakewalters approved these changes Sep 26, 2018

View reviewed changes

ftessier force-pushed the fix-fast-math branch from d4c5d10 to b41079d Compare September 26, 2018 17:51

ftessier merged commit 37650ea into develop Sep 26, 2018

ftessier deleted the fix-fast-math branch September 26, 2018 17:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix #423: change -ffast-math to -mtune=native #432

Fix #423: change -ffast-math to -mtune=native #432

ftessier commented Apr 19, 2018

marenaud commented Apr 19, 2018 •

edited

Loading

ftessier commented Apr 19, 2018 •

edited

Loading

marenaud commented Apr 19, 2018 •

edited

Loading

ftessier commented Apr 19, 2018

ftessier commented Apr 19, 2018

mainegra commented Sep 26, 2018

ftessier commented Sep 26, 2018

ftessier commented Sep 26, 2018

Fix #423: change -ffast-math to -mtune=native #432

Fix #423: change -ffast-math to -mtune=native #432

Conversation

ftessier commented Apr 19, 2018

marenaud commented Apr 19, 2018 • edited Loading

ftessier commented Apr 19, 2018 • edited Loading

marenaud commented Apr 19, 2018 • edited Loading

ftessier commented Apr 19, 2018

ftessier commented Apr 19, 2018

mainegra commented Sep 26, 2018

ftessier commented Sep 26, 2018

ftessier commented Sep 26, 2018

marenaud commented Apr 19, 2018 •

edited

Loading

ftessier commented Apr 19, 2018 •

edited

Loading

marenaud commented Apr 19, 2018 •

edited

Loading