• Home
  • News
  • Tutorials
  • Analysis
  • About
  • Contact

TechEnablement

Education, Planning, Analysis, Code

  • CUDA
    • News
    • Tutorials
    • CUDA Study Guide
  • OpenACC
    • News
    • Tutorials
    • OpenACC Study Guide
  • Xeon Phi
    • News
    • Tutorials
    • Intel Xeon Phi Study Guide
  • OpenCL
    • News
    • Tutorials
    • OpenCL Study Guide
  • Web/Cloud
    • News
    • Tutorials
You are here: Home / Archives for OpenMP

PathScale Supports OpenMP4 and OpenACC On NVIDIA, AMD, ARMv8, Plus Some CUDA

June 4, 2015 by Rob Farber Leave a Comment

PathScale is capitalizing on over six years of development with the release of ENZO 2015, a compiler suite that allows C/C++/Fortran parallel programs to be built for a variety of hardware platforms from a single OpenACC or OpenMP4 source base. The PathScale compilers can generate executables for AMD GPUs, and NVIDIA GPUs as well as ARMv8 and x86 instruction sets. As a result, … [Read more...]

GCC5 Release Candidate in Early April With OpenACC, Cilk, and OpenMP4 Offload!

March 20, 2015 by Rob Farber Leave a Comment

Richard Biener posted a message to gcc.gnu.org to expect a GCC 5 release candidate the first week of April. GCC5 supports OpenACC, Cilk Plus, and OpenMP 4.0 offload support. "We've come a long way towards the release criteria of zero P1 bugs. There are still a few remaining P1s though and we are targeting for a GCC 5 release candidate in the first week of April (given those … [Read more...]

Concise Comparision Adds OpenMP Versus OpenACC To CUDA Versus OpenCL Debates

March 4, 2015 by Rob Farber Leave a Comment

The debate over OpenMP versus OpenACC for manycore and heterogeneous computing is starting to heat up. Michael Wong (CEO of OpenMP Corp), Barbara Chapman (Univ. Houston), and Yonghong Yan (Assistant Prof. Univ. Oakland and OpenMP ARB representative) have written a nice, quick read, comparative article on HPCWire: "A Comparison of Heterogeneous and Manycore … [Read more...]

Profiling Guided Optimization On Intel Xeon Phi

November 5, 2014 by Rob Farber Leave a Comment

This chapter in High Performance Parallelism Pearls by Andrey Vladimirov focuses on the use of Intel VTune Amplifier XE reports to understand where to apply optimization on matrix transposition, a small and self-contained workload of great practical value. The optimization process applied to the code relies exclusively on programming in a high-level language plus utilization of … [Read more...]

The Unabridged Chapter 1 Introduction To High Performance Parallelism Pearls

October 3, 2014 by Rob Farber Leave a Comment

Following is the full, unabridged text of the chapter 1 introduction  (written by James Reinders) to High Performance Parallelism Pearls. Thanks to Morgan Kaufmann, James Reinders, and Jim Jeffers for giving permission so TechEnablment can make this available. After reading what James wrote, you will see that summarizing the introduction would simply have left out too much … [Read more...]

Teaching The World About Intel Xeon Phi

September 30, 2014 by Rob Farber Leave a Comment

The newest book by James Reinders and Jim Jeffers, “High Performance Parallelism Pearls” distills the experience of sixty-nine HPC experts into twenty-eight chapters designed to teach the world about the performance capabilities of the massively-parallel Intel® Xeon Phi™ family of products. Source code for numerous working examples selected for their educational content, … [Read more...]

Berkeley Online and Onsite 2014 Short Course on Parallel Programming – Aug. 18

August 15, 2014 by Rob Farber Leave a Comment

Monday August 18, 2014 the Berkeley EECS ( Electrical Engineering and Computer Sciences, COLLEGE OF ENGINEERING, UC Berkeley) will provide an on-site and on-line introduction to parallel architectures and programming issues, a thorough exposure to languages and tools for shared memory programming, including hands-on experience, a presentation of high level programming parallel … [Read more...]

Paper Compares AMD, NVIDIA, Intel Xeon Phi CFD Turbulent Flow Mesh Performance Using OpenMP and OpenCL

August 10, 2014 by Rob Farber Leave a Comment

Timely for Siggraph 2014 (because animations use meshes) and food-for-thought for CFD (Computational Fluid Dynamics) research, the paper by A. Gorobets, F.X. Trias, R. Borrell, G. Oyarzún and A. Oliva, "Direct Numerical Simulation of Turbulent Flows with Parallel Algorithms for Various Computing Architectures" considers structured and unstructured meshes for incompressible … [Read more...]

Hot At Siggraph 2014 – Multithreading for Visual Effects

August 9, 2014 by Rob Farber Leave a Comment

The ACM Siggraph conference starts tomorrow, August 10th, in Vancouver, BC. Multithreading for visual effects coupled with massive parallelism has changed the visual effects industry forever. A new book capturing that trend titled, appropriately enough, "Multithreading for Visual Effects", is an excellent source of information about how visual effects are currently performed at … [Read more...]

SC14 Technical Program and Registration – XSEDE/TACC Resources for Farber Tutorial

July 28, 2014 by Rob Farber Leave a Comment

Register early for Supercomputing 2014 in New Orleans and save up to $275. View the Technical Program online (and register for our tutorial!) The Technical Program fee includes  admission to all conference sessions, exhibits, the Monday night Exhibits opening event, Thursday night event, and one copy of the SC14 proceedings. Click here to view the grid showing access to … [Read more...]

Next Page »

Tell us you were here

Recent Posts

Farewell to a Familiar HPC Friend

May 27, 2020 By Rob Farber Leave a Comment

TechEnablement Blog Sunset or Sunrise?

February 12, 2020 By admin Leave a Comment

The cornerstone is laid – NVIDIA acquires ARM

September 13, 2020 By Rob Farber Leave a Comment

Third-Party Use Cases Illustrate the Success of CPU-based Visualization

April 14, 2018 By admin Leave a Comment

More Tutorials

Learn how to program IBM’s ‘Deep-Learning’ SyNAPSE chip

February 5, 2016 By Rob Farber Leave a Comment

Free Intermediate-Level Deep-Learning Course by Google

January 27, 2016 By Rob Farber Leave a Comment

Intel tutorial shows how to view OpenCL assembly code

January 25, 2016 By Rob Farber Leave a Comment

More Posts from this Category

Top Posts & Pages

  • Face It: AI Gets Personal to Make You Look Better!
  • Intel tutorial shows how to view OpenCL assembly code
  • Intel Xeon Phi Optimization Part 1 of 3: Multi-Threading and Parallel Reduction
  • LibreOffice OpenCL Acceleration for the Masses - Intel vs. AMD GPU performance
  • The CUDA Thrust API Now Supports Streams and Concurrent Tasks

Archives

© 2021 · techenablement.com