• Home
  • News
  • Tutorials
  • Analysis
  • About
  • Contact

TechEnablement

Education, Planning, Analysis, Code

  • CUDA
    • News
    • Tutorials
    • CUDA Study Guide
  • OpenACC
    • News
    • Tutorials
    • OpenACC Study Guide
  • Xeon Phi
    • News
    • Tutorials
    • Intel Xeon Phi Study Guide
  • OpenCL
    • News
    • Tutorials
    • OpenCL Study Guide
  • Web/Cloud
    • News
    • Tutorials
You are here: Home / Featured article / Profiling Guided Optimization On Intel Xeon Phi

Profiling Guided Optimization On Intel Xeon Phi

November 5, 2014 by Rob Farber Leave a Comment

This chapter in High Performance Parallelism Pearls by Andrey Vladimirov focuses on the use of Intel VTune Amplifier XE reports to understand where to apply optimization on matrix transposition, a small and self-contained workload of great practical value. The optimization process applied to the code relies exclusively on programming in a high-level language plus utilization of the OpenMP framework. The result is portable code that can run on both CPU (processor) and MIC (coprocessor) architectures, and can be recompiled for future generations of Intel architectures.

Cover3D-fs8

Through VTune, the performance monitoring functionality of Intel Xeon Phi coprocessors is showcased to not only detect bottlenecks, but also point out overall performance issues and possible resolution methods. Based on the reports from the profiling tool, the programmer learns where to make changes in the source code to improve application performance. In short, this chapter demonstrates the familial relationship between Intel processors and coprocessors plus the Intel software development tools.

VTune can  identify which  branch of compiled code is taken most frequently at runtime

VTune can identify which branch of compiled code is taken most frequently at runtime. (Image courtesy Morgan Kaufmann)

Chapter Author

Andrey Vladimirov

Andrey Vladimirov

Andrey Vladimirov, PhD, is the Head of HPC Research at Colfax International. His primary research interest is the application of modern computing technologies to computationally demanding scientific problems. Prior to joining Colfax, Andrey was involved in theoretical astrophysics research at the Ioffe Institute (Russia), North Carolina State University, and Stanford University (USA), where he studied cosmic rays, collisionless plasmas and the interstellar medium using computer simulations.

Click to see the overview article “Teaching The World About Intel Xeon Phi” that contains a list of TechEnablement links about why each chapter is considered a “Parallelism Pearl” plus information about James Reinders and Jim Jeffers, the editors of High Performance Parallelism Pearls.

 

Share this:

  • Twitter

Filed Under: Featured article, Featured news, News, Xeon Phi Tagged With: HPC, Intel, Intel Xeon Phi, OpenMP, x86

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Tell us you were here

Recent Posts

Farewell to a Familiar HPC Friend

May 27, 2020 By Rob Farber Leave a Comment

TechEnablement Blog Sunset or Sunrise?

February 12, 2020 By admin Leave a Comment

The cornerstone is laid – NVIDIA acquires ARM

September 13, 2020 By Rob Farber Leave a Comment

Third-Party Use Cases Illustrate the Success of CPU-based Visualization

April 14, 2018 By admin Leave a Comment

More Tutorials

Learn how to program IBM’s ‘Deep-Learning’ SyNAPSE chip

February 5, 2016 By Rob Farber Leave a Comment

Free Intermediate-Level Deep-Learning Course by Google

January 27, 2016 By Rob Farber Leave a Comment

Intel tutorial shows how to view OpenCL assembly code

January 25, 2016 By Rob Farber Leave a Comment

More Posts from this Category

Top Posts & Pages

  • Depth-Categorizing GPU-Accelerated Deep Neural Networks Perform Fast Semantic Segmentation of RGB-D Scenes
  • Plesiochronous (Loosely Synchronous) Phasing Barriers To Avoid Thread Inefficiencies
  • Guide to Get Ubuntu 14.10 Running Natively on Nvidia Shield Tablet
  • NVIDIA GTC 2015 keynote - Near-term Roadmap is Deep-Learning
  • Free Online OpenACC Course Starting Oct. 1 2015

Archives

© 2026 · techenablement.com