• Home
  • News
  • Tutorials
  • Analysis
  • About
  • Contact

TechEnablement

Education, Planning, Analysis, Code

  • CUDA
    • News
    • Tutorials
    • CUDA Study Guide
  • OpenACC
    • News
    • Tutorials
    • OpenACC Study Guide
  • Xeon Phi
    • News
    • Tutorials
    • Intel Xeon Phi Study Guide
  • OpenCL
    • News
    • Tutorials
    • OpenCL Study Guide
  • Web/Cloud
    • News
    • Tutorials
You are here: Home / Archives for GPU

Paper Compares AMD, NVIDIA, Intel Xeon Phi CFD Turbulent Flow Mesh Performance Using OpenMP and OpenCL

August 10, 2014 by Rob Farber Leave a Comment

Timely for Siggraph 2014 (because animations use meshes) and food-for-thought for CFD (Computational Fluid Dynamics) research, the paper by A. Gorobets, F.X. Trias, R. Borrell, G. Oyarzún and A. Oliva, "Direct Numerical Simulation of Turbulent Flows with Parallel Algorithms for Various Computing Architectures" considers structured and unstructured meshes for incompressible … [Read more...]

OpenACC OLCF Hackathon – Applications Accepted Aug 15 – 29

August 9, 2014 by Rob Farber Leave a Comment

Oakridge National Laboratory announces the OLCF’s first ever hackathon will take place October 27-31, 2014. Apply between 15 - 29. Contact Fernanda Foertter. So what is this all about? Teams of developers will attend a 5-day intensive hands-on workshop guided by OpenACC mentors. These experts come from labs, universities and vendors and will guide your team from start to … [Read more...]

Breadth-First Graph Search Uses 2D Domain Decomposition – 400 GTEPS on 4096 GPUs

August 9, 2014 by Rob Farber Leave a Comment

Parallel Breadth-First Search is a standard benchmark and the basis of many other graph algorithms. The challenge lies in partitioning the graph across multiple nodes in a cluster while avoiding load-imbalance and communications delays. The authors of the paper, "Parallel Breadth First Search on the Kepler Architecture" utilize an interesting 2D decomposition of the graph … [Read more...]

Deep-learning Webinar Demonstrates Handwriting Recognition and Efforts to Teach Drone to Fly Down a Wooded Path

August 8, 2014 by Rob Farber Leave a Comment

Deep-learning is a computational expensive but rewarding method to solve many complex pattern recognition problems. The recent NVIDIA webinar by Dan Claudiu Cireșan, Senior Researcher at the Dalle Molle Institute for Artificial Intelligence (IDSIA) in Switzerland highlighted some of the capabilities of deep-learning for image recognition problems such as handwriting recognition … [Read more...]

SC14 – Fast Hybrid GPU Betweenness Centrality Code Achieves Nearly Ideal Scaling to 192 GPUs

August 8, 2014 by Rob Farber Leave a Comment

Don't miss the SC14 presentation Wednesday Nov. 19 in room 388-89-90, for the presentation of the McLaughlin and Bader paper "Scalable and High Performance Betweenness Centrality on the GPU". The authors report nearly ideal scaling to 192 GPUs and billions of edges traversed per step (GTEP). The paper can be downloaded here and their software can be  downloaded from … [Read more...]

Intel Paper Detailing the Gen 7.5 GPU Architecture – Attention OpenCL Programmers !

August 7, 2014 by Rob Farber Leave a Comment

Intel has labeled the Haswell graphic core as Gen 7.5 and the Ivy Bridge GPU as Gen.7. The latest Gen 7.5 Architecture paper, "The Compute Architecture of Intel® Processor Graphics Gen7.5" has now been released by Intel.  This paper is written for  those who wish to know the "architecture characteristics relevant to running compute applications on Intel® Processor … [Read more...]

SC14 WACCPD Workshop on Accelerator Programming Using Directives

August 7, 2014 by Rob Farber Leave a Comment

Call For Papers for the SC14 Workshop on Accelerator Programming Using Directives (WACCPD), which  brings together leading researchers and software designers at the forefront of the application of high-level directives to program accelerator-based architectures. Using directives improve productivity, and program portability with minimal changes to the applications while … [Read more...]

University of Houston Call To Participate in Oil and Gas Workshop Oct 20

August 7, 2014 by Rob Farber Leave a Comment

The University of Houston Center for Advanced Computing & Data Systems is reaching out to Oil and Gas domain scientists/researchers who are keen to hear about alternative high-level programming models used to port seismic codes to use accelerators. High-level models such as OpenACC/OpenMP proposes to be more portable and more vendor neutral, and may be complementary to … [Read more...]

Robots that See Through Solid Walls Using Wi-Fi

August 6, 2014 by Rob Farber Leave a Comment

For the past few years Yasamin Mostofi, professor of electrical and computer engineering at UCSB and her team have been busy realizing the dream of X-ray vision.by enabling robots to see objects and humans behind thick walls through the use of radio frequency signals. The project page is here. Dr Mostofi received the Presidential Early Career Award in … [Read more...]

AMD Firepro S9150 5 TF/s Single, 2.5 TF/s Double-Precision GPU and OpenCL 1.2 Support

August 6, 2014 by Rob Farber Leave a Comment

AMD just announced the Firepro S9150 GPU, the first to break 2.0 TF/s double-precision performance. No word yet on pricing or availability. Specifications: Cooling/Power/Form Factor Max Power: 235W Bus Interface: PCIe® x16 Slots: Two Form Factor: Full height/ Full length Cooling: Passive heat sink Memory Size/Type: 16GB GDDR5 Interface: … [Read more...]

« Previous Page
Next Page »

Tell us you were here

Recent Posts

Farewell to a Familiar HPC Friend

May 27, 2020 By Rob Farber Leave a Comment

TechEnablement Blog Sunset or Sunrise?

February 12, 2020 By admin Leave a Comment

The cornerstone is laid – NVIDIA acquires ARM

September 13, 2020 By Rob Farber Leave a Comment

Third-Party Use Cases Illustrate the Success of CPU-based Visualization

April 14, 2018 By admin Leave a Comment

More Tutorials

Learn how to program IBM’s ‘Deep-Learning’ SyNAPSE chip

February 5, 2016 By Rob Farber Leave a Comment

Free Intermediate-Level Deep-Learning Course by Google

January 27, 2016 By Rob Farber Leave a Comment

Intel tutorial shows how to view OpenCL assembly code

January 25, 2016 By Rob Farber Leave a Comment

More Posts from this Category

Top Posts & Pages

  • Fine-Tuning Vectorization and Memory Traffic on Intel Xeon Phi Coprocessors
  • Intel Xeon Phi Study Guide
  • OpenACC Adoption Continues to Gain Momentum in 2016
  • Dynamic Load Balancing using OpenMP 4.0
  • Register For Lustre's Brent Gorda Parallel Storage and Big Data HP-Cast

Archives

© 2026 · techenablement.com