For a limited time Intel is selling Intel® Xeon Phi™ Coprocessor 31S1P for under $200. This offer is designed for Software developers to cost-effectively purchase systems or clusters from OEMs to modernize their code for greater levels of performance. See one of the OEMs at this link, or Intel your rep for eligibility requirements. Additionally, as part of this developer … [Read more...]
Morton Order Improves Performance
Author Kerry Evans writes in his High Performance Parallelism Pearls chapter, "There are many facets to performance optimization but three issues to deal with right from the beginning are memory access, vectorization, and parallelization. Unless we can optimize these, we cannot achieve peak performance.” Specifically, this chapter examines a method of mapping multidimensional … [Read more...]
Scalable Out-Of-Core Solvers On A Cluster
This chapters documents the implementation of a parallel distributed memory out-of-core (OOC) solver for performing LU and Cholesky factorizations of a large dense matrix on clusters equipped with Intel Xeon Phi coprocessors. The code was ported from CUDA with high-level library routines in CUBLAS This matches well with the offload model for the coprocessor using the … [Read more...]
Portable Performance with OpenCL On Intel Xeon Phi
This High Performance Parallelism Pearl show the potential for using the OpenCL™ standard parallel programming language to deliver portable performance on Intel Xeon Phi coprocessors, Xeon processors, and many-core devices such as GPUs from multiple vendors. This portable performance can be delivered from a single program without needing multiple versions of the code, an … [Read more...]
High Performance Ray Tracing With Embree On Intel Xeon Phi
Ray tracing is a technique for generating images of synthetic scenes. Because ray tracing simulates the physics of light transport in the real world, it can be used to achieve high quality and even photorealistic results. The chapter authors in High Performance Parallelism Pearls describe how the Intel Embree ray tracing kernel library can be used to achieve high performance … [Read more...]
Register For Lustre’s Brent Gorda Parallel Storage and Big Data HP-Cast
Register here to join Brent Gorda, GM of Intel Corporations High Performance Data division as he presents on the topic of how the Intel® Enterprise Edition for Lustre* software makes parallel storage simpler to manage and more productive for data intensive applications. Big Data has been synonymous with high performance computing for decades, and has become the primary driver … [Read more...]
Performance Optimization Of Black-Scholes Pricing On Intel Xeon Phi
Who would have thought that a mere two hundred lines of code provide so many capabilities! The chapter authors (Iosif Meyerov, Alexander Sysoyev, Nikita Astafiev, and Ilya Burylov) apply their optimization expertise for Intel Xeon and Intel Xeon Phi to calculate the fair prices of a set of European options. They chose this the Black-Scholes calculation for the following … [Read more...]
Efficient Nested Parallelism On Large Scale Systems
Choosing right threading library is critical for application performance, as different threading libraries provide significantly different performance behavior, especially when dealing with the complex computer systems as Intel Xeon Phi coprocessor and NUMA Intel Xeon processor machines. Unfortunately, choosing the right threading library is not enough, addition application … [Read more...]
Nov. 10 Live and Webcast Hands-On Intel Xeon Phi Training
On Monday, November 10, 2014 from 9 AM to 4:30 PM EST Xsede and TACC will present a live AND webcast training event regarding using the Intel Xeon Phis on both the Beacon machine at NICS and the Stampede machine at TACC. There is no charge for registration, but there are only 16 seats for the live training and 25 75 participants allowed into the zoom webcast. The training will … [Read more...]
Free Online OpenCL FPGA Development Courses By Altera
Altera has released a new training course to teach OpenCL programmers how to develop code for FPGAs, "Building Custom Platforms for Altera SDK for OpenCL". This online training will go over all of the necessary steps to create a custom platform compatible with the Altera SDK for OpenCL. Altera recommends completing the following courses: Introduction to Parallel … [Read more...]









