• Home
  • News
  • Tutorials
  • Analysis
  • About
  • Contact

TechEnablement

Education, Planning, Analysis, Code

  • CUDA
    • News
    • Tutorials
    • CUDA Study Guide
  • OpenACC
    • News
    • Tutorials
    • OpenACC Study Guide
  • Xeon Phi
    • News
    • Tutorials
    • Intel Xeon Phi Study Guide
  • OpenCL
    • News
    • Tutorials
    • OpenCL Study Guide
  • Web/Cloud
    • News
    • Tutorials
You are here: Home / Featured news / Intel Xeon Phi Provides Cambridge 30x Speedup in Production COSMOS WALLS Code

Intel Xeon Phi Provides Cambridge 30x Speedup in Production COSMOS WALLS Code

October 10, 2014 by Rob Farber Leave a Comment

Professor Paul Shellard, the COSMOS Director at Cambridge University reports a 30x speedup of the heavily utilized production WALLS code and he notes “Our expectation is that all our cosmological field theory codes, like WALLS, will have similarly large speed-ups when optimized and ported to Xeon Phi.”  Currently the project is transferring a larger portion of the CMB analysis to Intel Xeon Phi coprocessors over the next twelve months.

Professor E.P.S. Shellard

Professor E.P.S. Shellard

COSMOS has proved essential to the CMB, particularly the Planck satellite maps of the entire sky. Professor Shellard states, “The magnitude of data is manageable, but the computational effort required to extract scientific information from it is formidable”. For example, when researchers look at the statistics described by the three-point correlation function, they need to add up all the contributions from all the possible triangles we can draw within the 10 million pixels in a single Planck map. Naively, this is 1021 sets of complex operations! These calculations need to be simulated and repeated many times to test and eliminate systematic experimental effects, so brute force methods are not possible. Just to calculate the three-point correlator data required 3 million core-hours. On a single core that would take over 300 years. We needed to be able to solve it within 300 days.

Using COSMOS, researchers have been able to rapidly develop and implement methods to calculate the complete three-point correlator for the first time, up to a given resolution. This is a statistic that looks at “triangles in the sky”, testing whether there is a connection between sets of three different points in the Universe. Porting the computational “hot spots” in our Planck pipeline to the Intel Xeon Phi coprocessors has greatly shortened turnaround times, which will ensure we substantially improve resolution in the future. We could not begin to analyse the Planck maps, let alone previous generations of experiments, without COSMOS.

 Another major project being undertaken on COSMOS is the study of spectroscopic signatures of particular molecules in exoplanet atmospheres (the EXOMOL project). These signatures can help in the identification of exoplanets and whether or not there is extra-terrestrial life there.

Share this:

  • Twitter

Filed Under: Featured news, News, News, Xeon Phi Tagged With: HPC, Intel, Intel Xeon Phi, x86

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Tell us you were here

Recent Posts

Farewell to a Familiar HPC Friend

May 27, 2020 By Rob Farber Leave a Comment

TechEnablement Blog Sunset or Sunrise?

February 12, 2020 By admin Leave a Comment

The cornerstone is laid – NVIDIA acquires ARM

September 13, 2020 By Rob Farber Leave a Comment

Third-Party Use Cases Illustrate the Success of CPU-based Visualization

April 14, 2018 By admin Leave a Comment

More Tutorials

Learn how to program IBM’s ‘Deep-Learning’ SyNAPSE chip

February 5, 2016 By Rob Farber Leave a Comment

Free Intermediate-Level Deep-Learning Course by Google

January 27, 2016 By Rob Farber Leave a Comment

Intel tutorial shows how to view OpenCL assembly code

January 25, 2016 By Rob Farber Leave a Comment

More Posts from this Category

Top Posts & Pages

  • Rob Farber
  • NVIDIA GTC'17 announcements make them a complete 'soup to nuts' solution for specialized deep-learning applications
  • Altera OpenCL Programmable FPGA Talks QPI, HMC, and 100G Optical Interconnect
  • Plesiochronous (Loosely Synchronous) Phasing Barriers To Avoid Thread Inefficiencies
  • AMD Firepro S9150 5 TF/s Single, 2.5 TF/s Double-Precision GPU and OpenCL 1.2 Support

Archives

© 2026 · techenablement.com