Mobile is big money unless you are playing catch up. Sean Hollister at The Verge relays a report that the Intel Mobile division lost $3.15 Billion in 2013 and that losses in Q1 2014 are already $929 million. In my article, "Mobile Tech between a Rock and a Hard Place", I noted: Intel’s chief executive Brian Krzanich admitted at the firm’s November 2013 annual investor … [Read more...]
(4/24 update) Signals from Nvidia’s Sumit Gupta
Sumit Gupta is a busy man. Named by HPCwire as a 2013 "Person to Watch", Sumit does not idly take time to create a blog post unless it conveys a message about the NVIDIA Tesla development and marketing effort. His recent blog, "Fostering an Explosion of Innovation in the Data Center", posted by Steve Hamm, recognizes how the data-center is going to be supporting mobile … [Read more...]
Calling all Android Developers Interested in Using Microsoft Visual Studio on NVIDIA Tegra
The NVIDIA Developer Tools team is conduction a survey to assess interest in Microsoft Visual Studio as a development tool for Tegra Android devices. Not a registered developer? No problem! click here. NVIDIA is working on making development on Tegra platforms the best possible environment for Android application development. We are conducting interest from our Tegra developer … [Read more...]
Proof-of-Concept WebCL Chrome Browser Available from AMD
AMD has been working on implementing WebCL inside a Chrome browser to enable web programmer's access to OpenCL acceleration plus WebCL and WebGL interoperability. (Firefox, Chrome and Safari all have some form of WebCL support.) The following video shows the potential: http://youtu.be/dGD9NpipcrE Hands on experience can be found through the Chromium-WebCL github project, … [Read more...]
Inside NVIDIA’s Unified Memory: Multi-GPU Limitations and the Need for a cudaMadvise API Call
The CUDA 6.0 Unified Memory offers a “single-pointer-to-data” model that is similar to CUDA’s zero-copy mapped memory. Both make it trivially easy for the programmer to access memory on the CPU or GPU, but applications that use mapped memory have to perform a PCI bus transfer occur every time a memory access steps outside of a cache line while a kernel running in a Unified … [Read more...]
Battery Powered Supercomputing for the Masses: First Impression of the NVIDIA Jetson TK1 board
GTC 2014 demonstrated that we have now entered the "Battery Powered Supercomputing for the Masses" era. I had the opportunity to experience a Jetson TK1 board running ubuntu 13.04 at the hands-on lab. First impressions were very positive with a snappy response to the Ubuntu window system.. The GTC hands-on labs are oriented for techies and not the press. They provide a very … [Read more...]
Micron’s New Automata Processor
Adding computation to memory is a fantastic way to accelerate applications and real-time solutions. Content addressable memory (CAM) is a widespread and compelling example of how hardware can speed table lookups. (Most virtual memory computers utilize CAM to perform page lookups.) Micron recently announced the Automata Processor (AP) that implements an NFA (Non-deterministic … [Read more...]
Deep-learning Teaching Code Achieves 13 PF/s on the ORNL Titan Supercomputer
The deep-learning teaching code described in my book, "CUDA Application Design and Development" [Chapters 2, 3, and 9] plus online tutorials achieved 13 PF/s average sustained performance using 16,384 GPUs on the Oakridge Titan supercomputer. Full source code for my teaching code can be found on github in the farbopt directory. Nicole Hemsoth at HPCwire noted these CUDA … [Read more...]
OpenCL 2.0 Conformance Test Suite
The adage with OpenCL is "write once - test everywhere" is being addressed by the Khronos organization through the release of the OpenCL 2.0 test suite. The Khronos™ Group today announced the availability of the official conformance test suite for the OpenCL 2.0 specification, making it possible for implementers to certify that their implementations are officially conformant … [Read more...]
WebCL 1.0 specification released
We all know that browser accelerated 3D graphics are coming and that this technology solution - however instantiated - is going to be a tremendous money maker. WebCL is a technology to watch for browser accelerated 3D graphics. The release of the webCL 1.0 specification is the latest evolution in the Khrnos effort to bring 3D browser acceleration to the Internet. WebCL 1.0 … [Read more...]









