NVIDIA has released the pdf slides and session video recordings from GTC 2015 at http://www.gputechconf.com/attend/sessions. The sessions covered a diverse range of topics: Astronomy & Astrophysics Augmented & Virtual Reality Automotive Big Data Analytics Computational Physics Computer Vision Cloud Computing & HPC Developer – … [Read more...]
PGI Compiled OpenACC ILP Loop Beats CUDA-7 by 200 GF/s on Deep-learning PCA Example
The PGI OpenACC compiler beat the performance of a CUDA 7.0 NVIDIA nvcc compiled deep-learning based PCA (Principal Components Analysis) example by 200 GF/s on a K40c using an ILP (Instruction Level Parallelism) loop structure taught in the TechEnablement classes and forthcoming Farber OpenACC book. PCA is an important data analysis tool utilized by data scientists. Sign up for … [Read more...]
GCC5 Release Candidate in Early April With OpenACC, Cilk, and OpenMP4 Offload!
Richard Biener posted a message to gcc.gnu.org to expect a GCC 5 release candidate the first week of April. GCC5 supports OpenACC, Cilk Plus, and OpenMP 4.0 offload support. "We've come a long way towards the release criteria of zero P1 bugs. There are still a few remaining P1s though and we are targeting for a GCC 5 release candidate in the first week of April (given those … [Read more...]
PathScale Reports Highest SPEC-ACCEL OpenACC Results Using AMD s9150 GPU
PathScale in collaboration with Cirrascale posted the fastest OpenACC SPEC-ACCEL results to date using an AMD Firepro s9150 GPU. ENZO 2015 is the first major PathScale release to include support for AMD Firepro devices and demonstrates the "pragmatic portability" of OpenACC to compile and run on a variety of hardware platforms using a single source base. The ENZO 2015 compiler … [Read more...]
NVIDIA GTC 2015 keynote – Near-term Roadmap is Deep-Learning
The GTC 2015 Keynote by NVIDIA CEO Jen-Hsun Huang showed an intense focus on Deep-Learning through four topic areas: (1) The official Titan X GPU announcement that noted the 7 TF/s SP and 0.2 TF/s DP device has an excellent arithmetic performance mix for deep-learning, (2) The NVIDIA DIGITS (Deep GPU Training System for Data Scientists) software and custom NVIDIA built … [Read more...]
Intel Xeon Phi Benefits Commercial Engineering Simulation Code
ANSYS and Intel worked closely together to optimize the first commercial engineering simulation package multi-core Intel® Xeon® processor E5-2600 v2 and v3 families and the many-core Intel® Xeon Phi™ coprocessor. Results indicate a 3.1x speedup after upgrading from a Xeon V2 to V3 solution, and a further 1.7x speedup when using an Intel® Xeon Phi™ 7120 coprocessor (or total … [Read more...]
ACM Paper Observes FPGA, GPU, CPU Energy Efficiency Hierarchy
Sparsh Mittal and Jeff Vetter provide a survey of the comparative energy efficiency of GPUs relative to FPGAs and CPUs in their Jan. 2015 ACM Computing Surveys (CSUR) paper, "A Survey of Methods for Analyzing and Improving GPU Energy Efficiency" in which they observe a general energy efficiency hierarchy where, "it is clear that although for majority of works, FPGAs are … [Read more...]
Plan Ahead – SC15 Housing Site Open For Room Reservations!
The SC15 housing site is now open through October. The conference takes place Nov 15-20 in Austin, Texas. IMPORTANT DATES AND DEADLINES March 2, 2015 – Hotel reservation system opens September 20, 2015 – Last day for submitting occupancy lists for exhibitor room blocks October 16, 2015 – Last day for making hotel reservations October 16, 2015 – Last day for canceling … [Read more...]
NVIDIA Titan X Powers Games and Virtual Reality
NVIDIA CEO Jen-Hsun Huang announced NVIDIA's latest GPU, the Titan X, in a surprise appearance at the 2015 Game Developers Conference. Jen-Hsun claims it is the most powerful GPU on the planet. The announcement followed a presentation by Epic Games' co-founder Tim Sweeney about the convergence of photorealistic imagery, film, video games, architecture, industrial design, and … [Read more...]
Concise Comparision Adds OpenMP Versus OpenACC To CUDA Versus OpenCL Debates
The debate over OpenMP versus OpenACC for manycore and heterogeneous computing is starting to heat up. Michael Wong (CEO of OpenMP Corp), Barbara Chapman (Univ. Houston), and Yonghong Yan (Assistant Prof. Univ. Oakland and OpenMP ARB representative) have written a nice, quick read, comparative article on HPCWire: "A Comparison of Heterogeneous and Manycore … [Read more...]








