Gerhard Wellein and Georg Hager, ISC13, Fooling the Masses with Performance Results: Old Clasic & Some New Ideas, here. Yahtzee! I have seen all these. Nice work. I have even seen we need a supercomputer to run the overnight build for the C++ framework because it takes so long.
Slow Computing 101
1.Do not use high compiler optimization levels or the latest compiler versions, because of numerical stability
2.Use fancy C++/JAVA/Python/… frameworks – they are much more maintainable and flexible
3.Scalability is still bad? Parallelize short loops with OpenMP and earn some extra bonus for a scalable hybrid code.
Time to solution?
“If I had a bigger machine, I could get the solution as fast as you want. This is of course due to the superior scalability of my code which is ready to scale on exaflop machines…..”