• Home
  • News
  • Tutorials
  • Analysis
  • About
  • Contact

TechEnablement

Education, Planning, Analysis, Code

  • CUDA
    • News
    • Tutorials
    • CUDA Study Guide
  • OpenACC
    • News
    • Tutorials
    • OpenACC Study Guide
  • Xeon Phi
    • News
    • Tutorials
    • Intel Xeon Phi Study Guide
  • OpenCL
    • News
    • Tutorials
    • OpenCL Study Guide
  • Web/Cloud
    • News
    • Tutorials
You are here: Home / Featured article / A Many-Core Implementation Of The Direct N-body Problem

A Many-Core Implementation Of The Direct N-body Problem

October 15, 2014 by Rob Farber Leave a Comment

Chapter 9 of High Performance Parallelism Pearls presents several optimizations that are usually necessary to obtain good performance on an Intel Xeon Phi coprocessor that include: introducing a softening factor, exploring the impact of single- vs. double-precision, Improving tililing, utilizing an SoA (Structure of Arrays) layout, generating code that does not maintain IEEE precision, and the use of dynamic OpenMP scheduling to account for cache misses. It also explains why these optimizations where done intending that the applicability of the chapter is broader than the N-Body kernel that is used to drive the presentation.

Unique to this chapter, the authors discussed how to quickly find a target for the optimization process and demonstrated that the final optimized version of the code obtained an 89% of the upper bound performance. They noted that efforts  to optimize for the Intel Xeon Phi coprocessor also had a positive effect on the performance of the identical code when running on an Intel Xeon processor.

Cover3D-fs8

Chapter Authors

Alejandro Duran

Alejandro Duran

Alejandro Duran has been an Application Engineer for Intel Corporation for the last two years. Previously, Alex was a senior researcher in the Barcelona Supercomputing Center. He holds a PhD from the Polytechnic University of Catalonia. He has been part of the OpenMP Language committee for the past 9 years.

Larry Meadows

Larry Meadows

Larry Meadows has worked on compilers, tools, and applications software for HPC since 1982. He was a founding member of The Portland Group and has been working for Intel Corporation in Oregon for the last ten years. 

Click to see the overview article “Teaching The World About Intel Xeon Phi” that contains a list of TechEnablement links about why each chapter is considered a “Parallelism Pearl” plus information about the chapter author(s).

 

Share this:

  • Twitter

Filed Under: Featured article, Featured news, News, News, Xeon Phi Tagged With: HPC, Intel, Intel Xeon Phi, x86

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Tell us you were here

Recent Posts

Farewell to a Familiar HPC Friend

May 27, 2020 By Rob Farber Leave a Comment

TechEnablement Blog Sunset or Sunrise?

February 12, 2020 By admin Leave a Comment

The cornerstone is laid – NVIDIA acquires ARM

September 13, 2020 By Rob Farber Leave a Comment

Third-Party Use Cases Illustrate the Success of CPU-based Visualization

April 14, 2018 By admin Leave a Comment

More Tutorials

Learn how to program IBM’s ‘Deep-Learning’ SyNAPSE chip

February 5, 2016 By Rob Farber Leave a Comment

Free Intermediate-Level Deep-Learning Course by Google

January 27, 2016 By Rob Farber Leave a Comment

Intel tutorial shows how to view OpenCL assembly code

January 25, 2016 By Rob Farber Leave a Comment

More Posts from this Category

Top Posts & Pages

  • The cornerstone is laid - NVIDIA acquires ARM
  • Farewell to a Familiar HPC Friend

Archives

© 2025 · techenablement.com