• Home
  • News
  • Tutorials
  • Analysis
  • About
  • Contact

TechEnablement

Education, Planning, Analysis, Code

  • CUDA
    • News
    • Tutorials
    • CUDA Study Guide
  • OpenACC
    • News
    • Tutorials
    • OpenACC Study Guide
  • Xeon Phi
    • News
    • Tutorials
    • Intel Xeon Phi Study Guide
  • OpenCL
    • News
    • Tutorials
    • OpenCL Study Guide
  • Web/Cloud
    • News
    • Tutorials
You are here: Home / Archives for OpenCL

The Missing Link in NVlink, or “Hello Pascal” bye-bye PCI bus limitations!

May 13, 2014 by Rob Farber Leave a Comment

Say hello to NVlink, a new technology by NVIDIA that is not constrained by PCIe bandwidth and latency limitations, but you will have to wait for the Pascal generation of 2016 GPUs to get it.  NVlink is NVIDIA's properitary "DRAM speed and latency" class  interface for CPU to GPU and GPU to GPU point-to-point communications. The basic building block for NVLink is a high-speed, … [Read more...]

OpenCL Haswell Iris 5200 Performance Results – 800 GF/s Peak Performance

May 12, 2014 by Rob Farber Leave a Comment

The Intel Haswell chip contains an integrated GPU that delivers significantly better OpenCL performance than an NVIDIA GeForce GT 650M - exceeding 800 GF/s peak performance. Allan MacKinnon at PixelIO has been investigating the OpenCL performance of this device and has been finding a plethora of on-gpu registers but also that the GPU appears to be both power and thermally … [Read more...]

OpenCL + Java Acceleration on Mobile Promises 8x speedup with 3x Less Power

May 6, 2014 by Rob Farber Leave a Comment

In what will certainly become a flood of papers about GPU acceleration of Java applications on mobile devices, a masters theses by Iype P. Joseph at the University of Ottawa claims 8x performance gains and 3x reductions in power consumption through the use of Java binding with OpenCL 1.1 on a a Freescale i.MX6Q SabreLite board. With NVIDIA entering the programmable mobile GPU … [Read more...]

GTC 2014 Presentations Now Available Online to All

May 5, 2014 by Rob Farber Leave a Comment

The NVIDIA GTC presentations are now available for all to view at http://www.gputechconf.com/gtcnew/on-demand-gtc.php. Of-course, I recommend my 30 minute presentation, "S4178: Killer-app Fundamentals: Massively-parallel data structures, Performance to 13 PF/s, Portability, Transparency, and more " [pdf][video]. My talk covers: Deep-learning to 13 PF/s on the ORNL … [Read more...]

Proof-of-Concept WebCL Chrome Browser Available from AMD

April 21, 2014 by Rob Farber Leave a Comment

AMD has been working on implementing WebCL inside a Chrome browser to enable web programmer's access to OpenCL acceleration plus WebCL and WebGL interoperability. (Firefox, Chrome and Safari all have some form of WebCL support.) The following video shows the potential: http://youtu.be/dGD9NpipcrE Hands on experience can be found through the Chromium-WebCL github project, … [Read more...]

OpenCL 2.0 Conformance Test Suite

April 18, 2014 by Rob Farber Leave a Comment

The adage with OpenCL is "write once - test everywhere" is being addressed by the Khronos organization through the release of the OpenCL 2.0 test suite. The Khronos™ Group today announced the availability of the official conformance test suite for the OpenCL 2.0 specification, making it possible for implementers to certify that their implementations are officially conformant … [Read more...]

WebCL 1.0 specification released

April 18, 2014 by Rob Farber Leave a Comment

We all know that browser accelerated 3D graphics are coming and that this technology solution - however instantiated - is going to be a tremendous money maker. WebCL is a technology to watch for browser accelerated 3D graphics. The release of the webCL 1.0 specification is the latest evolution in the Khrnos effort to bring 3D browser acceleration to the Internet. WebCL 1.0 … [Read more...]

TechEnablement Adds Study Guides for CUDA, OpenACC, OpenCL, and Intel Xeon Phi

April 17, 2014 by Rob Farber Leave a Comment

Today techEnablement.com has provided study guides to help students "learn to change the world" with supercomputing for the masses  . The study guides cover: CUDA OpenACC OpenCL Intel Xeon Phi … [Read more...]

Intel Releases OpenCL™ 1.2 Support for Xeon Phi™ Coprocessors

April 16, 2014 by Rob Farber Leave a Comment

The Intel press room announced that OpenCL support is now available (link). The new SDK broadens options for developers on Intel® architecture and includes tools, optimization guides and training. The SDK helps OpenCL developers improve performance and efficiency on Intel® Xeon Phi™ coprocessors and Intel® Xeon® processors  For those interested in using OpenCL to program … [Read more...]

Part 1: OpenCL™ – Portable Parallelism

April 15, 2014 by Rob Farber Leave a Comment

This first article in a series on portable multithreaded programming using OpenCL™ briefly discusses the thought behind the standard and demonstrates how to download and use the ATI Stream software development kit (SDK) to build and run an OpenCL program. view at The Code Project (http://www.codeproject.com/Articles/110685/Part-OpenCL-Portable-Parallelism) The thought … [Read more...]

« Previous Page

Tell us you were here

Recent Posts

Farewell to a Familiar HPC Friend

May 27, 2020 By Rob Farber Leave a Comment

TechEnablement Blog Sunset or Sunrise?

February 12, 2020 By admin Leave a Comment

The cornerstone is laid – NVIDIA acquires ARM

September 13, 2020 By Rob Farber Leave a Comment

Third-Party Use Cases Illustrate the Success of CPU-based Visualization

April 14, 2018 By admin Leave a Comment

More Tutorials

Learn how to program IBM’s ‘Deep-Learning’ SyNAPSE chip

February 5, 2016 By Rob Farber Leave a Comment

Free Intermediate-Level Deep-Learning Course by Google

January 27, 2016 By Rob Farber Leave a Comment

Intel tutorial shows how to view OpenCL assembly code

January 25, 2016 By Rob Farber Leave a Comment

More Posts from this Category

Top Posts & Pages

  • Bright Computing SC15 Announcement About Reducing the Complexity of On-Premises HPC
  • IBM Linux Servers Designed to Accelerate Artificial Intelligence, Deep Learning and Advanced Analytics
  • Accelerating Python and Deep Learning
  • Intel Xeon Phi Study Guide
  • The cornerstone is laid - NVIDIA acquires ARM

Archives

© 2026 · techenablement.com