Cross-platform interest in WebCL is expanding with support for Firefox, Chrome as well as Safari. WebCL provides a tremendous opportunity to exploit parallelism on client-side machines. Thanks for to Antonio Gomes (Twitter @tonikitoo) who brought the Safari webkit-webcl implementation to our attention! The SMAST Computational Laboratory (CMLab) at the School for Marine … [Read more...]
Try Quantum Computing in Your WebGL-enabled Browser
GPUs are wonderful for running energy minimization algorithms where a system relaxes to a low energy state to solve a problem. The 13 PF/s Titan Deep-learning teaching code is a compelling example of this ability. Similarly, Quantum Computing solves a problem (like RSA encryption) by having a quantum system relax to a low energy state. Google has created a WebGL Chrome … [Read more...]
May 2014 Current K1 Development Pathways
NVIDIA Tegra K1 Jetson development kits are now available for purchase from Newegg or Microcenter. The NVIDIA Tegra K1 chip has generated much interest due to the CUDA programmability and power efficiency of the ARM/Kepler ceepee-geepee combination. Upcoming Tegra K1 devices include the Xiaomi MiPad, NVIDIA's reference design tablet, plus the K1 powered Shield 2 gaming device. … [Read more...]
Calling All Authors for an Intel Xeon Phi Gems Book – Rough Submissions Due May 29, 2014
All are invited to contribute to High Performance Parallelism Gems – Successful Approaches for Multicore and Many-core Programming (working title) a contribution-based book that will focus on practical techniques for Intel Xeon processor and Intel Xeon Phi coprocessor parallel computing. Submissions<http://lotsofcores.com/gems> are due by May 29, 2014 in order to … [Read more...]
Firefox WebCL plugin, WebCL Security, and Compliance Tests
Interest in WebCL is expanding as exemplified by the Nokia WebCL project that has released a Firefox plugin to run WebCL apps. Developers now have a choice of running WebCL in Chrome via AMD and Firefox with the Nokia plugin. (Firefox, Chrome and Safari all have some form of WebCL support.) The continued expansion of WebCL proof-of-concept … [Read more...]
The Missing Link in NVlink, or “Hello Pascal” bye-bye PCI bus limitations!
Say hello to NVlink, a new technology by NVIDIA that is not constrained by PCIe bandwidth and latency limitations, but you will have to wait for the Pascal generation of 2016 GPUs to get it. NVlink is NVIDIA's properitary "DRAM speed and latency" class interface for CPU to GPU and GPU to GPU point-to-point communications. The basic building block for NVLink is a high-speed, … [Read more...]
OpenCL + Java Acceleration on Mobile Promises 8x speedup with 3x Less Power
In what will certainly become a flood of papers about GPU acceleration of Java applications on mobile devices, a masters theses by Iype P. Joseph at the University of Ottawa claims 8x performance gains and 3x reductions in power consumption through the use of Java binding with OpenCL 1.1 on a a Freescale i.MX6Q SabreLite board. With NVIDIA entering the programmable mobile GPU … [Read more...]
Understanding the Rational behind 400 GB Flash-based DIMM Memory
On January 24th, SanDisk announced shipments of ULLtraDIMM SSD storage in concert with an IBM announcement rebranding the SanDisk ULLtraDIMMs as eXFlash DIMMs. On March 21, SanDisk's stocks hit a 14-year high. ULLtraDIMM SSD storage puts Flash memory in a standard DIMM form factor that can be plugged into a memory socket. The Linux, Windows, or VMware UEFI/BIOS … [Read more...]
NERSC to Procure “Cori” a Knights Landing Based Cray XC Supercomputer
Scheduled for delivery in mid-2016, NERSC's next-generation supercomputer, a Cray XC, will be named after Gerty Cori, the first American woman to be honored with a Nobel Prize in science. The Cory supercomputer will use Intel’s next-generation Intel® Xeon Phi™ processor –- code-named “Knights Landing” -- a self-hosted, manycore processor with on-package high bandwidth memory … [Read more...]
PGI 14.4 Release Contains Much OpenACC C++ Goodness
PGI released their 14.4 and upcoming 14.7 OpenACC 2.0 roadmap. The expectation is that we will see the 14.4 release in early May and the 14.7 release in early July. Note: these are not official PGI dates. Analysis: The 14.4 support of atomic operations will enable many low-wait algorithms such as counters and massively parallel stacks. Improved reduction performance in … [Read more...]








