• Home
  • News
  • Tutorials
  • Analysis
  • About
  • Contact

TechEnablement

Education, Planning, Analysis, Code

  • CUDA
    • News
    • Tutorials
    • CUDA Study Guide
  • OpenACC
    • News
    • Tutorials
    • OpenACC Study Guide
  • Xeon Phi
    • News
    • Tutorials
    • Intel Xeon Phi Study Guide
  • OpenCL
    • News
    • Tutorials
    • OpenCL Study Guide
  • Web/Cloud
    • News
    • Tutorials
You are here: Home / Archives for CUDA

PyFR – Python/GPU Combustion Code Shortlisted for Several HPCWire Readers Choice Awards

September 26, 2015 by Rob Farber Leave a Comment

PyFR, the Python-based GPU accelerated CFD solver PyFR managed by TechEnablement contributor Peter Vincent has been shortlisted for several HPCWire Readers Choice Awards this year: 12. Best HPC Software Product or Technology 18. Best HPC Collaboration Between Academia & Industry 20. Top 5 New Products or Technologies to Watch If you would like to support them, … [Read more...]

Port Some CUDA Codes To Intel Xeon Phi Simply and Efficiently

May 15, 2015 by Rob Farber Leave a Comment

This tutorial shows that it relatively easy to port many CUDA C/C++ source codes to OpenMP. In the past, such efforts were not generally considered worthwhile because of the large performance difference between multicore processors (that use OpenMP) and GPUs. The introduction of teraflop/s Intel Xeon Phi coprocessors eliminated that performance difference, which makes it much … [Read more...]

PGI Compiled OpenACC ILP Loop Beats CUDA-7 by 200 GF/s on Deep-learning PCA Example

March 23, 2015 by Rob Farber Leave a Comment

The PGI OpenACC compiler beat the performance of a CUDA 7.0 NVIDIA nvcc compiled deep-learning based PCA (Principal Components Analysis) example by 200 GF/s on a K40c using an ILP (Instruction Level Parallelism) loop structure taught in the TechEnablement classes and forthcoming Farber OpenACC book. PCA is an important data analysis tool utilized by data scientists. Sign up for … [Read more...]

CUDA 7 Released

March 20, 2015 by Rob Farber Leave a Comment

NVIDIA released CUDA 7 for all to use! Download here for Windows, Linux x86, Linux Power 8,  and MacOSX: Productivity and Performance Improvements C++11 support makes it easier for C++ developers to accelerate their applications Write less code with ‘auto’ and ‘lambda’, especially when using the Thrust template library. New cuSOLVER library of dense and sparse direct … [Read more...]

ACM Paper Observes FPGA, GPU, CPU Energy Efficiency Hierarchy

March 10, 2015 by Rob Farber Leave a Comment

Sparsh Mittal and Jeff Vetter provide a survey of the comparative energy efficiency of GPUs relative to FPGAs and CPUs in their Jan. 2015 ACM Computing Surveys (CSUR) paper, "A Survey of Methods for Analyzing and Improving GPU Energy Efficiency" in which they observe a general energy efficiency hierarchy where, "it is clear that although for majority of works, FPGAs are … [Read more...]

NVIDIA Titan X Powers Games and Virtual Reality

March 5, 2015 by Rob Farber Leave a Comment

NVIDIA CEO Jen-Hsun Huang announced NVIDIA's latest GPU, the Titan X, in a surprise appearance at the 2015 Game Developers Conference. Jen-Hsun claims it is the most powerful GPU on the planet. The announcement followed a presentation by Epic Games' co-founder Tim Sweeney  about the convergence of photorealistic imagery, film, video games, architecture, industrial design, and … [Read more...]

Multiple OpenACC Hackathons Scheduled Around the World

February 23, 2015 by Rob Farber Leave a Comment

OakRidge National Laboratory has announced three GPU Hackathons for 2015. The first will be hosted April 20-24 by the National Center for Supercomputing Applications on the UIUC Campus. The second will be hosted by the Swiss National Supercomputing Centre in Lugano, Switzerland from July 6-10. The final one will be hosted by the Oak Ridge Leadership Computing Facility on … [Read more...]

TACC Accepting Summer Internship Applications

January 28, 2015 by Rob Farber Leave a Comment

TACC is now accepting applications for the 2015 Research Experience for Undergraduates (REU) from June 1 to August 1, 2015. This summer, 10 undergraduate students from across the United States majoring in science and engineering will be immersed in training at UT Austin to become the next generation of ‘game changers.' Participants will explore grand challenges including … [Read more...]

Facebook Open-Sources Torch for Deep-Learning Neural Networks

January 19, 2015 by Rob Farber Leave a Comment

Facebook has made Torch, an open source development environment for numerics, machine learning, and computer vision, with a particular emphasis on deep learning and convolutional nets available to everyone. The latest release includes GPU-optimized modules for large convolutional nets (ConvNets), as well as networks with sparse activations that are commonly used in Natural … [Read more...]

Kriging Interpolation Exhibits Strong Scaling Across GPUs

January 17, 2015 by Rob Farber Leave a Comment

Geostatistical interpolation (Kriging) can be useful in a great number of applications where high fidelity models are required for mapping spatial effects and making predictions based on observations. It is widely utilized in the domain of spatial analysis and computer experiments and heavily used by the US  Air Force and GIS services. The following images by Yang, et. al. … [Read more...]

« Previous Page
Next Page »

Tell us you were here

Recent Posts

Farewell to a Familiar HPC Friend

May 27, 2020 By Rob Farber Leave a Comment

TechEnablement Blog Sunset or Sunrise?

February 12, 2020 By admin Leave a Comment

The cornerstone is laid – NVIDIA acquires ARM

September 13, 2020 By Rob Farber Leave a Comment

Third-Party Use Cases Illustrate the Success of CPU-based Visualization

April 14, 2018 By admin Leave a Comment

More Tutorials

Learn how to program IBM’s ‘Deep-Learning’ SyNAPSE chip

February 5, 2016 By Rob Farber Leave a Comment

Free Intermediate-Level Deep-Learning Course by Google

January 27, 2016 By Rob Farber Leave a Comment

Intel tutorial shows how to view OpenCL assembly code

January 25, 2016 By Rob Farber Leave a Comment

More Posts from this Category

Top Posts & Pages

  • Part 1: OpenCL™ – Portable Parallelism
  • Pascal upgrade and Intel Xeon Phi at the Swiss National Supercomputing Centre
  • OpenACC Study Guide
  • ARM64 with CUDA Early Access Boards Now Available
  • Altera OpenCL Programmable FPGA Talks QPI, HMC, and 100G Optical Interconnect

Archives

© 2026 · techenablement.com