The problem of jointly allocating computations and data is a known NP-hard problem. A heuristic proposed by MIT researchers Nathan Beckmann, Po-An Tsai, and Daniel Sanchez recently the best-paper award at the IEEE Symposium on High-Performance Computer Architecture for a place-and-route algorithm that runs in milliseconds and finds a solution that is more than 99 percent as … [Read more...]
Power Profiling Shows Simple Changes To Save Megawatts of Power On Leadership Supercomputers
A challenge with profiling applications lies in how to interpret the profile results. In particular, most programmers do not give the power profile plots more than a cursory glance. Following is an example waterfall plot showing the power utilization for an NWChem run on Intel Xeon Phi coprocessors: My recent column in Scientific Computing, "Using Profile Information for … [Read more...]
Preparing For Knights Landing – Stay in HBM Memory
NERSC published an informative preparatory article for programming the forthcoming Cori supercomputer that notes each Intel Xeon Phi “Knight’s Landing” (KNL) devices will be running in a “self-hosted” mode, meaning that there will be no host/traditional processor. Everything - including the operating system - will run on KNL. This eliminates concerns about data movement as … [Read more...]
MAGMA LU Decompositions, Factorizations, and Eigensolvers for Intel Xeon Phi Coprocessors Released
MAGMA MIC 1.3.1 now provides implementations for MAGMA's one-sided (LU, QR, and Cholesky) and two-sided (Hessenberg, bi- and tridiagonal reductions) dense matrix factorizations, as well as linear and eigenproblem solver for Intel Xeon Phi Coprocessors. The MAGMA MIC 1.3.1 release adds Added orthogonal transformations … [Read more...]
Fine-Tuning Vectorization and Memory Traffic on Intel Xeon Phi Coprocessors
Andrey Vladimirov at ColFax International has posted source code and a paper, "Fine-Tuning Vectorization and Memory Traffic on Intel Xeon Phi Coprocessors: LU Decomposition of Small Matrices" on the ColFax site. Andrey notes, "Benchmarks show that the discussed optimizations improve the application performance on the coprocessor by a factor of 2.8 compared to the unoptimized … [Read more...]
TACC Accepting Summer Internship Applications
TACC is now accepting applications for the 2015 Research Experience for Undergraduates (REU) from June 1 to August 1, 2015. This summer, 10 undergraduate students from across the United States majoring in science and engineering will be immersed in training at UT Austin to become the next generation of ‘game changers.' Participants will explore grand challenges including … [Read more...]
Kriging Interpolation Exhibits Strong Scaling Across GPUs
Geostatistical interpolation (Kriging) can be useful in a great number of applications where high fidelity models are required for mapping spatial effects and making predictions based on observations. It is widely utilized in the domain of spatial analysis and computer experiments and heavily used by the US Air Force and GIS services. The following images by Yang, et. al. … [Read more...]
PathScale EKO 5 Compiler Adds Support for ARM and C++11
PathScale - a well-known provider of one of the industry's highest-performance 64-bit C, C++, and Fortran compilers - has added support for ARM and C++11 in their EKOPath 5 compiler suite. “We are pleased to provide the ARM HPC ecosystem with our new EKOPath compiler,” said Christopher Bergstrom, the PathScale Chief Technology Officer. “Our C/Fortran compiler and BLAS … [Read more...]
Free eBook – Optimizing HPC Applications with Intel Cluster Tools
Amazon is offering a free Kindle edition of the book, “Optimizing HPC Applications with Intel Cluster Tools" by Alexander Supalov, Andrey Semin, Michael Klemm, and Christopher Dahnken. Table of Contents Foreword by Bronis de Supinski, CTO, Livermore Computing, LLNL Introduction Chapter 1: No Time to Read this Book? Chapter 2: Overview of Platform … [Read more...]
Intel Xeon Phi Used in Australia’s DUG Oil and Gas Supercomputer
Equivalent to the 8th largest system in the world, Perth Australia's DownUnder GeoSolutions (DUG) has purchased a large Intel Xeon Phi equipped supercomputer from SGI for Oil and Gas applications. This system augments TF/s capable workstations also equipped with Intel Xeon Phi devices. DownUnder notes that all their software has been rewritten to use coprocessors with 6x - 10x … [Read more...]