StreamComputing launched an educational initiative this week that aims to get more developers to study and use OpenCL in their projects. Within this project, up to 20 collaborators will port as many GEGL operations to OpenCL as possible. There is tons of work to be done, but the benefit is that your code will be running in GIMP. Vincent Hindriksen and Adel Johar … [Read more...]
Intel tutorial shows how to view OpenCL assembly code
In order to better optimize and debug OpenCL kernels, sometimes it is very helpful to look at the underlying assembly. This article shows you the tools available in the Intel® SDK for OpenCL™ Applications that allow you to view assembly generated by the offline compiler for individual kernels, highlight the regions of the assembly code that correspond to OpenCL C code, as well … [Read more...]
SC15 – Accelerator Use in World’s Top Supercomputers
NVIDIA notes more than 100 accelerated systems now on TOP500 list TOP500 Highlights: One-third of FLOPS now powered by accelerators NVIDIA Tesla GPUs sweep 23 of 24 new accelerated supercomputers Tesla supercomputers growing at 48% CAGR over past five years AUSTIN, Texas—SC15—Nov. 16, 2015—Today’s list of the world’s TOP500 supercomputers shows the extent to which … [Read more...]
Pre-order Your $599 Tegra X1 Powered TX1 Developer Kit
NVIDIA is now taking pre-orders for the Tegra X1 powered TX1 development kit at the NVIDIA online store. You can develop OpenACC, CUDA, and OpenCL codes for this 64-bit ARM based system! No word yet on pricing of the actual credit card sized TX1 computer. JETSON TX1 MODULE NVIDIA Maxwell™ GPU with 256 NVIDIA® CUDA® Cores Quad-core ARM® Cortex®-A57 MPCore … [Read more...]
Learn to Make Windows 10 Apps with Free Microsoft Course Then Add GPU Acceleration!
Free Windows courses by themselves are not newsworthy, but those who wish to create Windows 10 apps for the Windows Marketplace - AND exploit the power of CUDA and OpenCL computing via C# should find the Free Microsoft course in combination with the TechEnablement tutorial "Combine C-Sharp With CUDA and OpenCL On Linux, iOS, Android and Windows" an enabling pair of … [Read more...]
No longer a preview – Intel updates the INDE OpenCL Code Analyzer
Intel has updated the new OpenCL™ Code Analyzer, a feature of Intel® INDE OpenCL™ Code Builder, which adds performance analysis capabilities integrated into a Microsoft Visual Studio OpenCL development environment. No longer a preview, the OpenCL Code Builder now supports OpenCL code development, which enables developers to carry on performance optimizations in each step of … [Read more...]
ACM Paper Observes FPGA, GPU, CPU Energy Efficiency Hierarchy
Sparsh Mittal and Jeff Vetter provide a survey of the comparative energy efficiency of GPUs relative to FPGAs and CPUs in their Jan. 2015 ACM Computing Surveys (CSUR) paper, "A Survey of Methods for Analyzing and Improving GPU Energy Efficiency" in which they observe a general energy efficiency hierarchy where, "it is clear that although for majority of works, FPGAs are … [Read more...]
Provisional OpenCL 2.1 Enables Kernels Written Using a Subset of C++14 and Uses SPIR-V
The Khronos™ Group today announced the ratification and public release of the OpenCL™ 2.1 provisional specification viewable at www.khronos.org/opencl/ so developers and implementers can provide feedback before finalization at the OpenCL forums. Comments can be made via https://www.khronos.org/opencl/opencl_feedback_forum. The OpenCL 2.1 C++ kernel language is a static … [Read more...]
OpenCL SPIR Tutorial Teaches Portability Without Shipping Kernel Source
Intel has released an OpenCL tutorial showing how developers can use SPIR (Standard Portable Intermediate Representation) to preserve vendor and device portability without having to ship OpenCL kernel source code. For more information about how SPIR enables commercial OpenCl applications, see our article, "Commercial OpenCL! SPIR 2.0 Protects IP Yet Allows Powerful, Portable, … [Read more...]
OpenCL Programmed FPGAs Claim a 3X Performance-to-Power Advantage at Microsoft
The Microsoft white paper, "Accelerating Deep Convolutional Neural Networks Using Specialized Hardware" describes an OpenCL programmed implementation of Convolutional Neural Networks (CNNs) that touts a conservative estimate of 3x the performance-to-power advantage over NVIDIA GPUs when running on new FPGA hardware. Doug Berger posted on the Inside Microsoft Research … [Read more...]