OakRidge National Laboratory has announced three GPU Hackathons for 2015. The first will be hosted April 20-24 by the National Center for Supercomputing Applications on the UIUC Campus. The second will be hosted by the Swiss National Supercomputing Centre in Lugano, Switzerland from July 6-10. The final one will be hosted by the Oak Ridge Leadership Computing Facility on … [Read more...]
Free IEEE OpenACC Webinar Using the PGI Compiler
Register here to view a recent Dec. 11, 2014 IEEE webinar on OpenACC by Michael Wolf, a compiler engineer at PGI, who presents the latest PGI support for C++ features and will look at the roadmap for more complete PGI OpenACC support in the future. Michael will also show some significant performance enhancements that should impact all OpenACC programmers. He closes with a short … [Read more...]
Xsede Feb. 6 OpenACC Workshop Telecast to Various Locations
XSEDE, along with the Pittsburgh Supercomputing Center and the National Center for Supercomputing Applications at the University of Illinois will be presenting an OpenACC GPU programming workshop on February 6, 2015. OpenACC is the accepted standard using compiler directives to allow quick development of GPU capable codes using standard languages and compilers. It has been … [Read more...]
ORNL Introductory Tutorials On Concurrent Kernels
The OLCF at Oakridge National Laboratory (ORNL) is working to educate users about how to best use their computing resources. As part of that process, the OLCF has published two very introductory tutorials to teach how to utilize concurrent kernels on their systems. Part 1 (concurrent kernels) and Part 2 (batched library calls) teach how to launch concurrent kernels using CUDA … [Read more...]
IPMACC – An Open Source OpenACC to CUDA/OpenCL Translator
IPMACC is a research-grade open-source framework for translating OpenACC source code to CUDA or OpenCL. Binary executables can then be created with OpenCL or CUDA compilers. The authors (Ahmad Lashgar - University of Victoria, Alireza Majidi - Texas A&M University, Amirali Baniasadi - University of Victoria) verified correctness and performance using benchmarks from … [Read more...]
AMD and Pathscale Join OpenACC Standards Committee
AMD and Pathscale announced at SC14 that they have joined the OpenACC standards committee. OpenACC provides an efficient and performance-portable path for developing massively parallel programs across a wide range of accelerators, including GPUs, many core coprocessors and multi-core CPUs. OpenACC has been gaining traction for parallel programming. Such a move appears … [Read more...]
Analysis of Phylogenetic Tree Code Shows OpenACC Within 10% Of Native CUDA
The paper, "Accelerating Phylogenetic Inference on GPUs: an OpenACC and CUDA comparison" by University of Barcelona and Intel Barcelona Research Center claim near-CUDA performance for OpenACC - within 10% - that can be achieved when accelerating a Phylogenetic Tree code based on the popular MrBayes Markov chain Monte Carlo (MCMC) package. Comparing with state-of-art … [Read more...]
Houston Workshop: Directives and Tools for Accelerators: A Seismic Programming Shift
With space for 70-80 participants, those who wish to attend the FREE University of Houston workshop, "Directives and Tools for Accelerators: A Seismic Programming Shift" must register by October 10th, 2014. The workshop is a fill-day event on October 20th, 2014, with a preceding welcome reception on October 19th, 2014. THis workshop is organized by the HPC Tools group in the … [Read more...]
OpenACC Compilers Deliver 85% The Performance Of Hand-Optimized Code
Directive-based compilers offer both portability and the ability to optimized code for specific platforms such as GPUs and CPUs. A recent LCPC14 paper, "Directive-Based Compilers for GPUs", by Swapnil Ghike, Ruben Gran, Maria J. Garzaran, David Padua at the University of Illinois at Urbana-Champaign found OpenACC code generated by the PGI and Cray OpenACC compilers achieved … [Read more...]
Mix OpenACC and CUDA (including Thrust)
The NVIDIA Parallel ForAll blog shows how to mix OpenACC and CUDA (including Thrust) with the host_data construct, the deviceptr clause, and the acc_map_data() API function. … [Read more...]









