• Home
  • News
  • Tutorials
  • Analysis
  • About
  • Contact

TechEnablement

Education, Planning, Analysis, Code

  • CUDA
    • News
    • Tutorials
    • CUDA Study Guide
  • OpenACC
    • News
    • Tutorials
    • OpenACC Study Guide
  • Xeon Phi
    • News
    • Tutorials
    • Intel Xeon Phi Study Guide
  • OpenCL
    • News
    • Tutorials
    • OpenCL Study Guide
  • Web/Cloud
    • News
    • Tutorials
You are here: Home / Featured news / Paper Compares AMD, NVIDIA, Intel Xeon Phi CFD Turbulent Flow Mesh Performance Using OpenMP and OpenCL

Paper Compares AMD, NVIDIA, Intel Xeon Phi CFD Turbulent Flow Mesh Performance Using OpenMP and OpenCL

August 10, 2014 by Rob Farber Leave a Comment

Timely for Siggraph 2014 (because animations use meshes) and food-for-thought for CFD (Computational Fluid Dynamics) research, the paper by A. Gorobets, F.X. Trias, R. Borrell, G. Oyarzún and A. Oliva, “Direct Numerical Simulation of Turbulent Flows with Parallel Algorithms for Various Computing Architectures” considers structured and unstructured meshes for incompressible flows and overlapped MPI communication patterns. The authors are quite clear in their performance conclusions about their new algorithm, “Performance comparison for basic OpenCL kernels of the algorithm on unstructured meshes showed that the different GPUs considered substantially outperform Intel Xeon Phi accelerator. Also, the AMD GPU tends to be more efficient than NVIDIA on heavy computing kernels.” Such performance comparisons -when fairly performed on generally usable kernels  –  provide valuable information to help cut through all the marketing. For more information see http://termofluids.com/.

Abstract

The purpose of the work is twofold. Firstly, it is devoted to the development of efficient parallel algorithms for large-scale simulations of turbulent flows on different
supercomputer architectures. It reports experience with massively-parallel accelerators including graphics processing units of AMD and NVIDIA and Intel Xeon Phi coprocessors.
Secondly, it introduces new series of direct numerical simulations of incompressible turbulent flows with heat transfer performed with the considered algorithms.

CFDersults_fig4

Figure 4 (click to read paper)

Summary (Exerpt)

Parallel finite-volume algorithms for large-scale simulations of turbulent flows have been considered. Simple approaches for OpenMP parallelization aimed at hundreds of threads have
been presented for both structured and unstructured meshes. High internal speedups ensuring good level of parallelism demonstrated on Intel Xeon Phi surprisingly coexist with relatively
poor net performance comparable with 8-core CPU. Implementation of overlapped communications with an OpenCL task scheduler infrastructure has demonstrated promising
results. Performance comparison for basic OpenCL kernels of the algorithm on unstructured meshes showed that the different GPUs considered substantially outperform Intel Xeon Phi
accelerator. Also, the AMD GPU tends to be more efficient than NVIDIA on heavy computing kernels.

For more information:

  • http://termofluids.com/

 

Share this:

  • Twitter

Filed Under: Featured news, News, News, News, OpenCL, Xeon Phi Tagged With: GPU, HPC, Intel Xeon Phi, NVIDIA, OpenCL, OpenMP

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Tell us you were here

Recent Posts

Farewell to a Familiar HPC Friend

May 27, 2020 By Rob Farber Leave a Comment

TechEnablement Blog Sunset or Sunrise?

February 12, 2020 By admin Leave a Comment

The cornerstone is laid – NVIDIA acquires ARM

September 13, 2020 By Rob Farber Leave a Comment

Third-Party Use Cases Illustrate the Success of CPU-based Visualization

April 14, 2018 By admin Leave a Comment

More Tutorials

Learn how to program IBM’s ‘Deep-Learning’ SyNAPSE chip

February 5, 2016 By Rob Farber Leave a Comment

Free Intermediate-Level Deep-Learning Course by Google

January 27, 2016 By Rob Farber Leave a Comment

Intel tutorial shows how to view OpenCL assembly code

January 25, 2016 By Rob Farber Leave a Comment

More Posts from this Category

Top Posts & Pages

  • Part 1: Load-Balanced, Strong-Scaling Task-Based Parallelism on GPUs
  • About
  • Rob Farber
  • Teaching The World About Intel Xeon Phi
  • Guide to Get Ubuntu 14.10 Running Natively on Nvidia Shield Tablet

Archives

© 2026 · techenablement.com