Interest in WebCL is expanding as exemplified by the Nokia WebCL project that has released a Firefox plugin to run WebCL apps. Developers now have a choice of running WebCL in Chrome via AMD and Firefox with the Nokia plugin. (Firefox, Chrome and Safari all have some form of WebCL support.) The continued expansion of WebCL proof-of-concept … [Read more...]
The Missing Link in NVlink, or “Hello Pascal” bye-bye PCI bus limitations!
Say hello to NVlink, a new technology by NVIDIA that is not constrained by PCIe bandwidth and latency limitations, but you will have to wait for the Pascal generation of 2016 GPUs to get it. NVlink is NVIDIA's properitary "DRAM speed and latency" class interface for CPU to GPU and GPU to GPU point-to-point communications. The basic building block for NVLink is a high-speed, … [Read more...]
OpenCL Haswell Iris 5200 Performance Results – 800 GF/s Peak Performance
The Intel Haswell chip contains an integrated GPU that delivers significantly better OpenCL performance than an NVIDIA GeForce GT 650M - exceeding 800 GF/s peak performance. Allan MacKinnon at PixelIO has been investigating the OpenCL performance of this device and has been finding a plethora of on-gpu registers but also that the GPU appears to be both power and thermally … [Read more...]
PGI 14.4 is now released with lots of OpenACC C++ Goodness!
PGI 14.4 is now released with lots of OpenACC C++ goodness. Give it a try! Here is the link for or those with existing licenses. If need be, get a 15 day trial license and use some of my OpenACC tutorials. PGI Trial keys Trial license keys are used for evaluating PGI software. They are valid for fifteen days. If you haven't already done so, you … [Read more...]
NVIDIA HBAO+ and TXAA Enhanced Gaming Video
A fun video showing the progress being made in near photo realistic gaming imagery. THe big news in this video is the use of HBAO+ (for ambient occlusion) and TXAA (anti-aliasing) technologies. I imagine such video platforms can be used for small studio animation projects as well. The Watchdog game highlighted in the video will be released May 27, … [Read more...]
OpenCL + Java Acceleration on Mobile Promises 8x speedup with 3x Less Power
In what will certainly become a flood of papers about GPU acceleration of Java applications on mobile devices, a masters theses by Iype P. Joseph at the University of Ottawa claims 8x performance gains and 3x reductions in power consumption through the use of Java binding with OpenCL 1.1 on a a Freescale i.MX6Q SabreLite board. With NVIDIA entering the programmable mobile GPU … [Read more...]
GTC 2014 Presentations Now Available Online to All
The NVIDIA GTC presentations are now available for all to view at http://www.gputechconf.com/gtcnew/on-demand-gtc.php. Of-course, I recommend my 30 minute presentation, "S4178: Killer-app Fundamentals: Massively-parallel data structures, Performance to 13 PF/s, Portability, Transparency, and more " [pdf][video]. My talk covers: Deep-learning to 13 PF/s on the ORNL … [Read more...]
Understanding the Rational behind 400 GB Flash-based DIMM Memory
On January 24th, SanDisk announced shipments of ULLtraDIMM SSD storage in concert with an IBM announcement rebranding the SanDisk ULLtraDIMMs as eXFlash DIMMs. On March 21, SanDisk's stocks hit a 14-year high. ULLtraDIMM SSD storage puts Flash memory in a standard DIMM form factor that can be plugged into a memory socket. The Linux, Windows, or VMware UEFI/BIOS … [Read more...]
NERSC to Procure “Cori” a Knights Landing Based Cray XC Supercomputer
Scheduled for delivery in mid-2016, NERSC's next-generation supercomputer, a Cray XC, will be named after Gerty Cori, the first American woman to be honored with a Nobel Prize in science. The Cory supercomputer will use Intel’s next-generation Intel® Xeon Phi™ processor –- code-named “Knights Landing” -- a self-hosted, manycore processor with on-package high bandwidth memory … [Read more...]
Opportunities to Run on Jetson, the Latest Tegras, and ORNL Titan
Following Jen-Hsun's strategy to enable those who wish to use NVIDIA chips, developers can win a Jetson K1, get free access to the latest Tegra GPUs. Also those with big computations can submit INCITE proposals to run on the ORNL Titan supercomputer. Ends today (4/30/14) to possibly win a Jetson K1 (link) merely by submitting an idea via … [Read more...]








