• Home
  • News
  • Tutorials
  • Analysis
  • About
  • Contact

TechEnablement

Education, Planning, Analysis, Code

  • CUDA
    • News
    • Tutorials
    • CUDA Study Guide
  • OpenACC
    • News
    • Tutorials
    • OpenACC Study Guide
  • Xeon Phi
    • News
    • Tutorials
    • Intel Xeon Phi Study Guide
  • OpenCL
    • News
    • Tutorials
    • OpenCL Study Guide
  • Web/Cloud
    • News
    • Tutorials
You are here: Home / Archives for Featured article

GTC 2015 GPU Computing Videos and Slides Now Available Online

April 6, 2015 by Rob Farber Leave a Comment

NVIDIA has released the pdf slides and session video recordings from GTC 2015 at http://www.gputechconf.com/attend/sessions. The sessions covered a diverse range of topics: Astronomy & Astrophysics Augmented & Virtual Reality Automotive Big Data Analytics Computational Physics Computer Vision Cloud Computing & HPC Developer – … [Read more...]

PGI Compiled OpenACC ILP Loop Beats CUDA-7 by 200 GF/s on Deep-learning PCA Example

March 23, 2015 by Rob Farber Leave a Comment

The PGI OpenACC compiler beat the performance of a CUDA 7.0 NVIDIA nvcc compiled deep-learning based PCA (Principal Components Analysis) example by 200 GF/s on a K40c using an ILP (Instruction Level Parallelism) loop structure taught in the TechEnablement classes and forthcoming Farber OpenACC book. PCA is an important data analysis tool utilized by data scientists. Sign up for … [Read more...]

GCC5 Release Candidate in Early April With OpenACC, Cilk, and OpenMP4 Offload!

March 20, 2015 by Rob Farber Leave a Comment

Richard Biener posted a message to gcc.gnu.org to expect a GCC 5 release candidate the first week of April. GCC5 supports OpenACC, Cilk Plus, and OpenMP 4.0 offload support. "We've come a long way towards the release criteria of zero P1 bugs. There are still a few remaining P1s though and we are targeting for a GCC 5 release candidate in the first week of April (given those … [Read more...]

PathScale Reports Highest SPEC-ACCEL OpenACC Results Using AMD s9150 GPU

March 18, 2015 by Rob Farber Leave a Comment

PathScale in collaboration with Cirrascale posted the fastest OpenACC SPEC-ACCEL results to date using an AMD Firepro s9150 GPU. ENZO 2015 is the first major PathScale release to include support for AMD Firepro devices and demonstrates the "pragmatic portability" of OpenACC to compile and run on a variety of hardware platforms using a single source base. The ENZO 2015 compiler … [Read more...]

NVIDIA GTC 2015 keynote – Near-term Roadmap is Deep-Learning

March 17, 2015 by Rob Farber Leave a Comment

The GTC 2015 Keynote by NVIDIA CEO Jen-Hsun Huang showed an intense focus on Deep-Learning through four topic areas: (1) The official Titan X GPU announcement that noted the 7 TF/s SP and 0.2 TF/s DP device has an excellent arithmetic performance mix for deep-learning, (2) The NVIDIA DIGITS (Deep GPU Training System for Data Scientists) software and custom NVIDIA built … [Read more...]

Intel Xeon Phi Benefits Commercial Engineering Simulation Code

March 12, 2015 by Rob Farber Leave a Comment

ANSYS and Intel worked closely together to optimize the first commercial engineering simulation package multi-core Intel® Xeon® processor E5-2600 v2 and v3 families and the many-core Intel® Xeon Phi™ coprocessor.  Results indicate a 3.1x speedup after upgrading from a Xeon V2 to V3 solution, and a further 1.7x speedup when using an Intel® Xeon Phi™ 7120 coprocessor (or total … [Read more...]

ACM Paper Observes FPGA, GPU, CPU Energy Efficiency Hierarchy

March 10, 2015 by Rob Farber Leave a Comment

Sparsh Mittal and Jeff Vetter provide a survey of the comparative energy efficiency of GPUs relative to FPGAs and CPUs in their Jan. 2015 ACM Computing Surveys (CSUR) paper, "A Survey of Methods for Analyzing and Improving GPU Energy Efficiency" in which they observe a general energy efficiency hierarchy where, "it is clear that although for majority of works, FPGAs are … [Read more...]

Plan Ahead – SC15 Housing Site Open For Room Reservations!

March 9, 2015 by Rob Farber Leave a Comment

The SC15 housing site is now open through October. The conference takes place Nov 15-20 in Austin, Texas. IMPORTANT DATES AND DEADLINES March 2, 2015 – Hotel reservation system opens September 20, 2015 – Last day for submitting occupancy lists for exhibitor room blocks October 16, 2015 – Last day for making hotel reservations October 16, 2015 – Last day for canceling … [Read more...]

NVIDIA Titan X Powers Games and Virtual Reality

March 5, 2015 by Rob Farber Leave a Comment

NVIDIA CEO Jen-Hsun Huang announced NVIDIA's latest GPU, the Titan X, in a surprise appearance at the 2015 Game Developers Conference. Jen-Hsun claims it is the most powerful GPU on the planet. The announcement followed a presentation by Epic Games' co-founder Tim Sweeney  about the convergence of photorealistic imagery, film, video games, architecture, industrial design, and … [Read more...]

Concise Comparision Adds OpenMP Versus OpenACC To CUDA Versus OpenCL Debates

March 4, 2015 by Rob Farber Leave a Comment

The debate over OpenMP versus OpenACC for manycore and heterogeneous computing is starting to heat up. Michael Wong (CEO of OpenMP Corp), Barbara Chapman (Univ. Houston), and Yonghong Yan (Assistant Prof. Univ. Oakland and OpenMP ARB representative) have written a nice, quick read, comparative article on HPCWire: "A Comparison of Heterogeneous and Manycore … [Read more...]

« Previous Page
Next Page »

Tell us you were here

Recent Posts

Farewell to a Familiar HPC Friend

May 27, 2020 By Rob Farber Leave a Comment

TechEnablement Blog Sunset or Sunrise?

February 12, 2020 By admin Leave a Comment

The cornerstone is laid – NVIDIA acquires ARM

September 13, 2020 By Rob Farber Leave a Comment

Third-Party Use Cases Illustrate the Success of CPU-based Visualization

April 14, 2018 By admin Leave a Comment

More Tutorials

Learn how to program IBM’s ‘Deep-Learning’ SyNAPSE chip

February 5, 2016 By Rob Farber Leave a Comment

Free Intermediate-Level Deep-Learning Course by Google

January 27, 2016 By Rob Farber Leave a Comment

Intel tutorial shows how to view OpenCL assembly code

January 25, 2016 By Rob Farber Leave a Comment

More Posts from this Category

Top Posts & Pages

  • New PyFR Paper "Heterogeneous Computing on Mixed Unstructured Grids with PyFR"
  • More Trinity Racks Installed
  • Bright Computing SC15 Announcement About Reducing the Complexity of On-Premises HPC
  • IBM Linux Servers Designed to Accelerate Artificial Intelligence, Deep Learning and Advanced Analytics
  • Accelerating Python and Deep Learning

Archives

© 2026 · techenablement.com