• Home
  • News
  • Tutorials
  • Analysis
  • About
  • Contact

TechEnablement

Education, Planning, Analysis, Code

  • CUDA
    • News
    • Tutorials
    • CUDA Study Guide
  • OpenACC
    • News
    • Tutorials
    • OpenACC Study Guide
  • Xeon Phi
    • News
    • Tutorials
    • Intel Xeon Phi Study Guide
  • OpenCL
    • News
    • Tutorials
    • OpenCL Study Guide
  • Web/Cloud
    • News
    • Tutorials
You are here: Home / News

PGI OpenACC 14.7 Available

September 4, 2014 by Rob Farber Leave a Comment

Release 14.7 of the PGI compiler is now available for download. PGI Accelerator Features and Enhancements Support for CUDA managed data in CUDA Fortran Expanded OpenACC C++ Support Expanded OpenACC Features C global (extern) variables in OpenACC declare directives Fortran module variables in OpenACC declare directives Full support for the atomic … [Read more...]

Beowulf 20 Year Workshop To Also Honour Thomas Sterling

September 3, 2014 by Rob Farber Leave a Comment

The Center for Research in Extreme Scale Technologies (CREST) at the University of Indiana will organize a workshop to celebrate 20 years of Beowulf and to honour Professor Thomas Sterling for his 65th birthday. The workshop will be held October 13-14, 2014.  Visit the 20 Years of Beowulf website to register or learn more. The Beowulf project introduced clusters build from … [Read more...]

SC14 Paid US and Foreign Travel Grants – Deadline Sept. 7

September 2, 2014 by Rob Farber Leave a Comment

SIGHPC offers travel grants to help undergraduate and graduate students attend the annual SC conference. Who's eligible:  Students at the undergraduate or early graduate level, enrolled in an accredited university in any country, and who are members of SIGHPC.  Applicants do not have to be making a presentation at SC, but they need to describe why they will benefit from … [Read more...]

Shared Memory is Simple on Intel Xeon Phi – supports STL!

September 2, 2014 by Rob Farber Leave a Comment

Shared memory on Intel Xeon Phi, in OpenCL, and CUDA (via managed memory) greatly simplifies programming by eliminating the need to explicitly define all data transfers between host and device memory. Once these implementations mature, it is likely they will become the standard API that programmers use to access data on both Intel Xeon Phi and GPUs. (They also naturally support … [Read more...]

Deep-Learning Behind Microsoft Cross-Language Real-Time Skype Translator

August 26, 2014 by Rob Farber Leave a Comment

Deep-learning lies at the heart of the Microsoft Skype translator,  a near real-time speech-to-speech machine translation tool that enables voice conversations between individuals speaking different languages. The claim is that a beta  version of Skype Translator will be released sometime in 2014. According to Microsoft, machine-learning of large datasets culled from social … [Read more...]

Intel SPMD Compiler (ISPC) for Xeon and Xeon Phi

August 25, 2014 by Rob Farber Leave a Comment

When #pragma SIMD is not enough, companies and projects such as Intel, Embree,  Dreamworks, Pixar, Autodesk, and SURFsara are looking to the freely available Intel SPMD Program Compiler (ispc) to achieve high vector performance on Xeon and Intel Xeon Phi devices.  ISPC is  LLVM-based and designed to exploit Intel SIMD architectures … [Read more...]

Robobrain.me

August 25, 2014 by Rob Farber Leave a Comment

The Cornell Robo Brain (http://robobrain.me) is a large-scale learning system that attempts to learn from publicly available Internet resources, computer simulations, and real-life robot trials.  The idea is to associate objects in images with text in order to correlate how they relate to human language, behavior and usage. Applications include prototyping for robotics … [Read more...]

CUDA 340.29 Driver Significantly Boosts GPU Performance (100s GF/s For Machine-Learning)

August 24, 2014 by Rob Farber Leave a Comment

Reports are now coming in about  performance boosts that are the result of the CUDA 6.5 production release. The Blender project reports faster rendering time with CUDA-6.5. As can be seen in the graphs below that report performance on the farbopt deep-learning teaching code, CUDA-6.5 with the NVIDIA 340.29 driver have increased performance on linear problems (PCA analysis from … [Read more...]

AAA-Battery Powered Water Splitter Poses Computational Opportunity

August 23, 2014 by Rob Farber Leave a Comment

Stanford Scientists have developed a water splitter that runs on an ordinary AAA battery and uses commonly available nickel and iron - no exotic materials required! The "save the planet" potential is clearly there as such a low-voltage, low-amperage conversion opens the potential for solar conversion of water to hydrogen. The discovery was made by Stanford graduate student Ming … [Read more...]

A Try-Before-You-Code Linear Regression Method Claims 32% Error Predicting GPU Perf

August 23, 2014 by Rob Farber Leave a Comment

The paper, "Estimating GPU Speedups for Programs Without Writing a Single Line of GPU Code" by Newsha Ardalani, Karthikeyan Sankaralingam, Xiaojin Zhu at the University of Wisconsin Madison claims a linear regression model can deliver  a robust "automated tool that programmers can use to estimate potential GPU speedup before writing any GPU code". According to their study a … [Read more...]

« Previous Page
Next Page »

Tell us you were here

Recent Posts

Farewell to a Familiar HPC Friend

May 27, 2020 By Rob Farber Leave a Comment

TechEnablement Blog Sunset or Sunrise?

February 12, 2020 By admin Leave a Comment

The cornerstone is laid – NVIDIA acquires ARM

September 13, 2020 By Rob Farber Leave a Comment

Third-Party Use Cases Illustrate the Success of CPU-based Visualization

April 14, 2018 By admin Leave a Comment

More Tutorials

Learn how to program IBM’s ‘Deep-Learning’ SyNAPSE chip

February 5, 2016 By Rob Farber Leave a Comment

Free Intermediate-Level Deep-Learning Course by Google

January 27, 2016 By Rob Farber Leave a Comment

Intel tutorial shows how to view OpenCL assembly code

January 25, 2016 By Rob Farber Leave a Comment

More Posts from this Category

Top Posts & Pages

  • AMD Firepro S9150 5 TF/s Single, 2.5 TF/s Double-Precision GPU and OpenCL 1.2 Support
  • Rob Farber
  • Heterogeneous MPI Optimization With ITAC
  • Implications of OS Jitter on Real-Time Applications for FPGAs, GPUs, and Intel
  • Preparing For Knights Landing - Stay in HBM Memory

Archives

© 2026 · techenablement.com