• Home
  • News
  • Tutorials
  • Analysis
  • About
  • Contact

TechEnablement

Education, Planning, Analysis, Code

  • CUDA
    • News
    • Tutorials
    • CUDA Study Guide
  • OpenACC
    • News
    • Tutorials
    • OpenACC Study Guide
  • Xeon Phi
    • News
    • Tutorials
    • Intel Xeon Phi Study Guide
  • OpenCL
    • News
    • Tutorials
    • OpenCL Study Guide
  • Web/Cloud
    • News
    • Tutorials
You are here: Home / Featured news / PGI 14.4 Release Contains Much OpenACC C++ Goodness

PGI 14.4 Release Contains Much OpenACC C++ Goodness

April 25, 2014 by Rob Farber Leave a Comment

PGI released their 14.4 and upcoming 14.7 OpenACC 2.0 roadmap. The expectation is that we will see the 14.4 release in early May and the 14.7 release in early July. Note: these are not official PGI dates.

Analysis:

  • The 14.4 support of atomic operations will enable many low-wait algorithms such as counters and massively parallel stacks.
  • Improved reduction performance in 14.7 should speed many scientific codes including machine learning algorithms and allow OpenACC to deliver similar performance to CUDA such as 13 PF/s on the ORNL Titan supercomputer.

New in PGI 14.4

  • PGI Accelerator Features and Enhancements
    • Expanded OpenACC C++ Support
    • C++ this pointer support
    • C++ member functions
    • C++ support for the Routine directive
    • C++ class member arrays in data clauses
  • Expanded OpenACC 2.0 Features
    • Loop directive collapse clause on deeply nested loops
    • Parallel directive firstprivate clause
    • C structs/Fortran derived type member arrays in data clauses
    • Partial support for Fortran and C/C++ atomic directives
    • Calling C/C++ CUDA-style atomics from OpenACC
    • Fortran common block names in OpenACC data clauses
  • GPU-side debugging in OpenACC with Allinea DDT
  • CUDA Fortran support for CUDA 5.5 batched cuBLAS routines
  • Integrated CUDA 6 Toolkit
  • PGI Multi-core Features and Enhancements
    • 2% improvement in SPEC OMP 2012 performance compared to 14.1 on Intel Sandy Bridge processors
    •  Support for new AVX2 instructions available on the latest Haswell CPUs from Intel; updated Windows assembler
    • New EDG C++ front-end with C++11 support
  •  Other Features and Enhancements
    • Comprehensive support for environment modules
    • New tutorials and expanded set of examples
    • Prebuilt versions of NetCDF and HDF5

Planned for 14.7

  • OpenACC 2.0 Declare directive link clause support
  • OpenACC 2.0 Loop directive tile and auto clause support
  • Support for CUDA managed data in both OpenACC and CUDA Fortran
  • Support for reductions inside Routines
  • Support for reductions of COMPLEX data types
  • Cache clause tuning

 

Share this:

  • Twitter

Filed Under: Featured news, News, News, openacc Tagged With: GPU, machine-learning, NVIDIA, Nvidia Tesla, openacc, x86

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Tell us you were here

Recent Posts

Farewell to a Familiar HPC Friend

May 27, 2020 By Rob Farber Leave a Comment

TechEnablement Blog Sunset or Sunrise?

February 12, 2020 By admin Leave a Comment

The cornerstone is laid – NVIDIA acquires ARM

September 13, 2020 By Rob Farber Leave a Comment

Third-Party Use Cases Illustrate the Success of CPU-based Visualization

April 14, 2018 By admin Leave a Comment

More Tutorials

Learn how to program IBM’s ‘Deep-Learning’ SyNAPSE chip

February 5, 2016 By Rob Farber Leave a Comment

Free Intermediate-Level Deep-Learning Course by Google

January 27, 2016 By Rob Farber Leave a Comment

Intel tutorial shows how to view OpenCL assembly code

January 25, 2016 By Rob Farber Leave a Comment

More Posts from this Category

Top Posts & Pages

  • Intel Broadwell Compute Gen8 GPU Architecture
  • Recovering Speech from a Potato-chip Bag Viewed Through Soundproof Glass - Even With Commodity Cameras!
  • MultiOS Gaming, Media, and OpenCL Using XenGT Virtual Machines On Shared Intel GPUs
  • NVIDIA GTC 2015 keynote - Near-term Roadmap is Deep-Learning
  • LibreOffice OpenCL Acceleration for the Masses - Intel vs. AMD GPU performance

Archives

© 2025 · techenablement.com