• Home
  • News
  • Tutorials
  • Analysis
  • About
  • Contact

TechEnablement

Education, Planning, Analysis, Code

  • CUDA
    • News
    • Tutorials
    • CUDA Study Guide
  • OpenACC
    • News
    • Tutorials
    • OpenACC Study Guide
  • Xeon Phi
    • News
    • Tutorials
    • Intel Xeon Phi Study Guide
  • OpenCL
    • News
    • Tutorials
    • OpenCL Study Guide
  • Web/Cloud
    • News
    • Tutorials
You are here: Home / Archives for Rob Farber

PGI Compiled OpenACC ILP Loop Beats CUDA-7 by 200 GF/s on Deep-learning PCA Example

March 23, 2015 by Rob Farber Leave a Comment

The PGI OpenACC compiler beat the performance of a CUDA 7.0 NVIDIA nvcc compiled deep-learning based PCA (Principal Components Analysis) example by 200 GF/s on a K40c using an ILP (Instruction Level Parallelism) loop structure taught in the TechEnablement classes and forthcoming Farber OpenACC book. PCA is an important data analysis tool utilized by data scientists. Sign up for … [Read more...]

April 3 Application Deadline for Argonne Extreme Scale Training

March 21, 2015 by Rob Farber Leave a Comment

Doctoral students, postdocs, and computational scientists are encouraged to apply to the intensive Argonne Extreme Scale (ATPESC)  two-week course. ATPESC provides advanced training to 60 participants. Qualified applicants must: Have substantial experience in MPI and/or OpenMP programming Have used at least one HPC system for a resonable complex application Plan to … [Read more...]

GCC5 Release Candidate in Early April With OpenACC, Cilk, and OpenMP4 Offload!

March 20, 2015 by Rob Farber Leave a Comment

Richard Biener posted a message to gcc.gnu.org to expect a GCC 5 release candidate the first week of April. GCC5 supports OpenACC, Cilk Plus, and OpenMP 4.0 offload support. "We've come a long way towards the release criteria of zero P1 bugs. There are still a few remaining P1s though and we are targeting for a GCC 5 release candidate in the first week of April (given those … [Read more...]

CUDA 7 Released

March 20, 2015 by Rob Farber Leave a Comment

NVIDIA released CUDA 7 for all to use! Download here for Windows, Linux x86, Linux Power 8,  and MacOSX: Productivity and Performance Improvements C++11 support makes it easier for C++ developers to accelerate their applications Write less code with ‘auto’ and ‘lambda’, especially when using the Thrust template library. New cuSOLVER library of dense and sparse direct … [Read more...]

PathScale Reports Highest SPEC-ACCEL OpenACC Results Using AMD s9150 GPU

March 18, 2015 by Rob Farber Leave a Comment

PathScale in collaboration with Cirrascale posted the fastest OpenACC SPEC-ACCEL results to date using an AMD Firepro s9150 GPU. ENZO 2015 is the first major PathScale release to include support for AMD Firepro devices and demonstrates the "pragmatic portability" of OpenACC to compile and run on a variety of hardware platforms using a single source base. The ENZO 2015 compiler … [Read more...]

NVIDIA GTC 2015 keynote – Near-term Roadmap is Deep-Learning

March 17, 2015 by Rob Farber Leave a Comment

The GTC 2015 Keynote by NVIDIA CEO Jen-Hsun Huang showed an intense focus on Deep-Learning through four topic areas: (1) The official Titan X GPU announcement that noted the 7 TF/s SP and 0.2 TF/s DP device has an excellent arithmetic performance mix for deep-learning, (2) The NVIDIA DIGITS (Deep GPU Training System for Data Scientists) software and custom NVIDIA built … [Read more...]

Google’s Ingress Location Based Game Continues to Grow in Popularity

March 16, 2015 by Rob Farber Leave a Comment

Ingress, a multi-player GPS based location game created Google' Naintic labs continues to grow. If you like geocaching, chances are you will like Ingress. Gameplay consists of establishing "portals" at places of public art, landmarks, monuments, etc., and linking them to create virtual triangular fields over geographic areas. Progress in the game is measured by the number of … [Read more...]

Intel Xeon Phi Benefits Commercial Engineering Simulation Code

March 12, 2015 by Rob Farber Leave a Comment

ANSYS and Intel worked closely together to optimize the first commercial engineering simulation package multi-core Intel® Xeon® processor E5-2600 v2 and v3 families and the many-core Intel® Xeon Phi™ coprocessor.  Results indicate a 3.1x speedup after upgrading from a Xeon V2 to V3 solution, and a further 1.7x speedup when using an Intel® Xeon Phi™ 7120 coprocessor (or total … [Read more...]

ACM Paper Observes FPGA, GPU, CPU Energy Efficiency Hierarchy

March 10, 2015 by Rob Farber Leave a Comment

Sparsh Mittal and Jeff Vetter provide a survey of the comparative energy efficiency of GPUs relative to FPGAs and CPUs in their Jan. 2015 ACM Computing Surveys (CSUR) paper, "A Survey of Methods for Analyzing and Improving GPU Energy Efficiency" in which they observe a general energy efficiency hierarchy where, "it is clear that although for majority of works, FPGAs are … [Read more...]

Plan Ahead – SC15 Housing Site Open For Room Reservations!

March 9, 2015 by Rob Farber Leave a Comment

The SC15 housing site is now open through October. The conference takes place Nov 15-20 in Austin, Texas. IMPORTANT DATES AND DEADLINES March 2, 2015 – Hotel reservation system opens September 20, 2015 – Last day for submitting occupancy lists for exhibitor room blocks October 16, 2015 – Last day for making hotel reservations October 16, 2015 – Last day for canceling … [Read more...]

« Previous Page
Next Page »

Tell us you were here

Recent Posts

Farewell to a Familiar HPC Friend

May 27, 2020 By Rob Farber Leave a Comment

TechEnablement Blog Sunset or Sunrise?

February 12, 2020 By admin Leave a Comment

The cornerstone is laid – NVIDIA acquires ARM

September 13, 2020 By Rob Farber Leave a Comment

Third-Party Use Cases Illustrate the Success of CPU-based Visualization

April 14, 2018 By admin Leave a Comment

More Tutorials

Learn how to program IBM’s ‘Deep-Learning’ SyNAPSE chip

February 5, 2016 By Rob Farber Leave a Comment

Free Intermediate-Level Deep-Learning Course by Google

January 27, 2016 By Rob Farber Leave a Comment

Intel tutorial shows how to view OpenCL assembly code

January 25, 2016 By Rob Farber Leave a Comment

More Posts from this Category

Top Posts & Pages

  • Learn how to program IBM's 'Deep-Learning' SyNAPSE chip
  • Run CUDA without Recompilation on x86, AMD GPUs, and Intel Xeon Phi with gpuOcelot
  • Lucid VR brings 3D cameras for Virtual Reality to the masses
  • OpenCL Haswell Iris 5200 Performance Results - 800 GF/s Peak Performance
  • Third-Party Use Cases Illustrate the Success of CPU-based Visualization

Archives

© 2026 · techenablement.com