• Home
  • News
  • Tutorials
  • Analysis
  • About
  • Contact

TechEnablement

Education, Planning, Analysis, Code

  • CUDA
    • News
    • Tutorials
    • CUDA Study Guide
  • OpenACC
    • News
    • Tutorials
    • OpenACC Study Guide
  • Xeon Phi
    • News
    • Tutorials
    • Intel Xeon Phi Study Guide
  • OpenCL
    • News
    • Tutorials
    • OpenCL Study Guide
  • Web/Cloud
    • News
    • Tutorials
You are here: Home / Archives for GPU

Guide to Unlock the NVIDIA Shield Tablet Bootloader – First Step to a Dual-boot Device

September 14, 2014 by Rob Farber Leave a Comment

Many people wish to run Ubuntu, or Ubuntu touch on the recently released NVIDIA Shield Tablet. The first step is to unlock the bootloader, which can be performed according to the instructions by Abdur Rehman in his post, "How To Unlock/Relock Bootloader on NVIDIA Shield Tablet [Guide]". Note that unlocking the bootloader does void the warranty - even if you relock it! Ubuntu … [Read more...]

Intel Broadwell Compute Gen8 GPU Architecture

September 13, 2014 by Rob Farber Leave a Comment

Attention OpenCL programmers! Intel has released a detailed description of the Gen8 Broadwell GPU compute architecture, "The Compute Architecture of Intel® Processor Graphics Gen8". Broadwell is a 14nm die shrink of Intel’s microarchitecture incorporating significant reworking of the Intel HD 5000-series (Iris) Gen 7.5 GPU found in Haswell including: (1)  throughput for 32-bit … [Read more...]

Houston Workshop: Directives and Tools for Accelerators: A Seismic Programming Shift

September 11, 2014 by Rob Farber Leave a Comment

With space for 70-80 participants, those who wish to attend the FREE University of Houston workshop, "Directives and Tools for Accelerators: A Seismic Programming Shift" must register by October 10th, 2014. The workshop is a fill-day event on October 20th, 2014, with a preceding welcome reception on October 19th, 2014. THis workshop is organized by the HPC Tools group in the … [Read more...]

GPUs Power Over 90% of ImageNet Deep-Learning Visual Recognition Challenge Entries

September 7, 2014 by Rob Farber Leave a Comment

Over 90 percent of the participating teams and three of the four winners in the prestigious 2014 ImageNet Large Scale Visual Recognition Challenge used GPUs to enable their deep learning work. Deep learning is a fast-growing segment of machine learning that involves the creation of sophisticated, multi-level or “deep” neural networks. These networks enable powerful … [Read more...]

New PyFR Paper “Heterogeneous Computing on Mixed Unstructured Grids with PyFR”

September 6, 2014 by Rob Farber Leave a Comment

Peter Vincent's original PyFR post on TechEnablement has been extremely popular. Readers should be happy to hear that the PyFR team has published a new paper, "Heterogeneous Computing on Mixed Unstructured Grids with PyFR", showing this Python framework can perform high-order accurate unsteady simulations of flow on mixed unstructured grids using heterogeneous multi-node … [Read more...]

Dongarra Gives Deep-Learning a Python Interface With RaPyDLI

September 5, 2014 by Rob Farber Leave a Comment

An NSF-funded project called "Rapid Python Deep Learning Infrastructure", or RaPyDLI received nearly $1 million in NSF grants. The project led by supercomputing luminaries Jack Dongarra (University of Tennessee) and Geoffrey Fox (Indiana University) along with Andrew Ng (Stanford, Baidu and Coursera) will allow users to program deep learning models in Python and port them to … [Read more...]

OpenACC Compilers Deliver 85% The Performance Of Hand-Optimized Code

September 4, 2014 by Rob Farber Leave a Comment

Directive-based compilers offer both portability and the ability to optimized code for specific platforms such as GPUs and CPUs.  A recent LCPC14 paper, "Directive-Based Compilers for GPUs",  by Swapnil Ghike, Ruben Gran, Maria J. Garzaran, David Padua at the University of Illinois at Urbana-Champaign found OpenACC code generated by the PGI and Cray OpenACC compilers achieved … [Read more...]

CUDA 340.29 Driver Significantly Boosts GPU Performance (100s GF/s For Machine-Learning)

August 24, 2014 by Rob Farber Leave a Comment

Reports are now coming in about  performance boosts that are the result of the CUDA 6.5 production release. The Blender project reports faster rendering time with CUDA-6.5. As can be seen in the graphs below that report performance on the farbopt deep-learning teaching code, CUDA-6.5 with the NVIDIA 340.29 driver have increased performance on linear problems (PCA analysis from … [Read more...]

A Try-Before-You-Code Linear Regression Method Claims 32% Error Predicting GPU Perf

August 23, 2014 by Rob Farber Leave a Comment

The paper, "Estimating GPU Speedups for Programs Without Writing a Single Line of GPU Code" by Newsha Ardalani, Karthikeyan Sankaralingam, Xiaojin Zhu at the University of Wisconsin Madison claims a linear regression model can deliver  a robust "automated tool that programmers can use to estimate potential GPU speedup before writing any GPU code". According to their study a … [Read more...]

PyFR: A GPU-Accelerated Next-Generation Computational Fluid Dynamics Python Framework

August 22, 2014 by admin Leave a Comment

PyFR is an open-source 5,000 line Python based framework for solving fluid-flow problems that can exploit many-core computing hardware such as GPUs! Computational simulation of fluid flow, often referred to as Computational Fluid Dynamics (CFD), plays an critical role in the aerodynamic design of numerous complex systems, including aircraft, F1 racing cars, and wind turbines. … [Read more...]

« Previous Page
Next Page »

Tell us you were here

Recent Posts

Farewell to a Familiar HPC Friend

May 27, 2020 By Rob Farber Leave a Comment

TechEnablement Blog Sunset or Sunrise?

February 12, 2020 By admin Leave a Comment

The cornerstone is laid – NVIDIA acquires ARM

September 13, 2020 By Rob Farber Leave a Comment

Third-Party Use Cases Illustrate the Success of CPU-based Visualization

April 14, 2018 By admin Leave a Comment

More Tutorials

Learn how to program IBM’s ‘Deep-Learning’ SyNAPSE chip

February 5, 2016 By Rob Farber Leave a Comment

Free Intermediate-Level Deep-Learning Course by Google

January 27, 2016 By Rob Farber Leave a Comment

Intel tutorial shows how to view OpenCL assembly code

January 25, 2016 By Rob Farber Leave a Comment

More Posts from this Category

Top Posts & Pages

  • SC15 - Accelerator Use in World’s Top Supercomputers
  • Learn how to program IBM's 'Deep-Learning' SyNAPSE chip
  • NVIDIA Tegra K1 Powered Shield Should Soon Be Available
  • OpenACC Adoption Continues to Gain Momentum in 2016
  • Guide to Get Ubuntu 14.10 Running Natively on Nvidia Shield Tablet

Archives

© 2026 · techenablement.com