Phoronix picked up Linus Torvalds' providing some not so gentle feedback on GCC 4.9. GCC 4.9 supports OpenMP 4.0. Apparently the latest GNU compiler is doing some silly spilling of CPU registers (including constants!) that caused a random panic in a load balance function with the in-development Linux 3.16 kernel. On a comparative note, GCC just received … [Read more...]
GCC 4.9.1 Adds OpenMP 4.0 Fortran Support for Multicore
Jakub Jelinek <jakub at redhat dot com> posted on July 16th that the GCC 4.9.1 release now supports OpenMP 4.0 in Fortran (as well as C/C++). This is great news for multi-core programmers. GCC looks to be on-track to become the opensource platform that both Intel Xeon Phi and GPU programmers can use to to test pragma based programming. As reported on techEnablement.com, … [Read more...]
GCC likely to support both OpenACC and Intel Xeon Phi Offload Pragmas in 2015
It looks like GCC will be supporting both OpenACC and Intel Xeon Phi offload pragmas in future releases. Perhaps the GNU compiler chain will become the melting pot where OpenACC and OpenMP 4.0 pragmas merge to become a single unified syntax. According to Nathan Sidwell, Director of Sourcery Services at Mentor Embedded, their OpenACC effort is working to “make the underlying … [Read more...]
Farber to Teach All-Day Tutorial At Supercomputing Nov 16 2014
Supercomputing 2014 recently approved my proposal for an all-day class "From 'Hello World' to Exascale Using x86, GPUs and Intel Xeon Phi Coprocessors" (tut106s1), at The International Conference for High Performance Computing, Networking, Storage, and Analysis (SC14). I hope to see you on Sunday November 16, 2014 in New Orleans,! Abstract Both GPUs and Intel Xeon Phi … [Read more...]
Pragma Puzzler – Ambiguous Loop Trip Count in OpenMP and OpenACC
Pragma-based programming can be described as a "negotiation" with the compiler where the compiler has to assume corner-cases that are not apparent to the programmer. So why does the loop count in the OpenMP and OpenACC article, "A First Transparent OpenACC C++ Class" have to be assigned to a separate variable to generate a parallel … [Read more...]