• Home
  • News
  • Tutorials
  • Analysis
  • About
  • Contact

TechEnablement

Education, Planning, Analysis, Code

  • CUDA
    • News
    • Tutorials
    • CUDA Study Guide
  • OpenACC
    • News
    • Tutorials
    • OpenACC Study Guide
  • Xeon Phi
    • News
    • Tutorials
    • Intel Xeon Phi Study Guide
  • OpenCL
    • News
    • Tutorials
    • OpenCL Study Guide
  • Web/Cloud
    • News
    • Tutorials
You are here: Home / Featured news / Intel Broadwell Compute Gen8 GPU Architecture

Intel Broadwell Compute Gen8 GPU Architecture

September 13, 2014 by Rob Farber Leave a Comment

Attention OpenCL programmers! Intel has released a detailed description of the Gen8 Broadwell GPU compute architecture, “The Compute Architecture of Intel® Processor Graphics Gen8“. Broadwell is a 14nm die shrink of Intel’s microarchitecture incorporating significant reworking of the Intel HD 5000-series (Iris) Gen 7.5 GPU found in Haswell including: (1)  throughput for 32-bit integer computation has doubled, (2)  native 16-bit floating-point support, (3)  the write bandwidth from GTI has doubled on some devices, (4) coherent shared virtual memory between CPU cores and Intel® Processor Graphics enabling seamless sharing of pointer rich data structures, (5)  increased L3 data cache capacity and improved local bandwidth between EUs  (Execution Units) and L3 data cache.

Another potential product design that instantiates the compute architecture of Intel® Processor Graphics  Gen8. This design is composed of two slices, of three subslices each for a total of 48 EUs.

Another potential product design that instantiates the compute architecture of Intel® Processor Graphics Gen8. This design is composed of two slices, of three subslices each for a total of 48 EUs. (From The Compute Architecture of Intel® Processor Graphics Gen8)

 

Share this:

  • Twitter

Filed Under: Featured news, News, News, OpenCL Tagged With: GPU, Intel, OpenCL

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Tell us you were here

Recent Posts

Farewell to a Familiar HPC Friend

May 27, 2020 By Rob Farber Leave a Comment

TechEnablement Blog Sunset or Sunrise?

February 12, 2020 By admin Leave a Comment

The cornerstone is laid – NVIDIA acquires ARM

September 13, 2020 By Rob Farber Leave a Comment

Third-Party Use Cases Illustrate the Success of CPU-based Visualization

April 14, 2018 By admin Leave a Comment

More Tutorials

Learn how to program IBM’s ‘Deep-Learning’ SyNAPSE chip

February 5, 2016 By Rob Farber Leave a Comment

Free Intermediate-Level Deep-Learning Course by Google

January 27, 2016 By Rob Farber Leave a Comment

Intel tutorial shows how to view OpenCL assembly code

January 25, 2016 By Rob Farber Leave a Comment

More Posts from this Category

Top Posts & Pages

  • Face It: AI Gets Personal to Make You Look Better!
  • CUDA Study Guide
  • Apache Spark Claims 10x to 100x Faster than Hadoop MapReduce
  • PyFR: A GPU-Accelerated Next-Generation Computational Fluid Dynamics Python Framework
  • Paper Compares AMD, NVIDIA, Intel Xeon Phi CFD Turbulent Flow Mesh Performance Using OpenMP and OpenCL

Archives

© 2023 · techenablement.com