• Home
  • News
  • Tutorials
  • Analysis
  • About
  • Contact

TechEnablement

Education, Planning, Analysis, Code

  • CUDA
    • News
    • Tutorials
    • CUDA Study Guide
  • OpenACC
    • News
    • Tutorials
    • OpenACC Study Guide
  • Xeon Phi
    • News
    • Tutorials
    • Intel Xeon Phi Study Guide
  • OpenCL
    • News
    • Tutorials
    • OpenCL Study Guide
  • Web/Cloud
    • News
    • Tutorials
You are here: Home / News

Optimizing Gather/Scatter Patterns On Intel Xeon Phi

October 14, 2014 by Rob Farber Leave a Comment

Many modern microarchitectures rely on single-instruction multiple-data (SIMD) execution to provide high compute capabilities in an energy efficient manner. Such microarchitectures including those employed by the most recent Intel Xeon processors and Intel Xeon Phi coprocessors are optimized and/or better suited to dealing with contiguous loads and stores than non-contiguous … [Read more...]

Deep-­Learning And Numerical Optimization

October 13, 2014 by Rob Farber Leave a Comment

The massively parallel mapping and code described in this chapter is generic and can be applied to a broad spectrum of numerical optimization and machine-learning algorithms ranging from neural networks to support vector machines to expectation maximization and independent components analysis. Many of these techniques are heavily used in lucrative data-mining and social media … [Read more...]

Seattle LibreFest Oct. 26 2014 In-Person And Via IRC

October 12, 2014 by Rob Farber Leave a Comment

Representatives of the LibreOffice QA and Developer Teams will be on hand at the UW Communications Building, Rm 242, Seattle, WA to teach interested users how to get involved with the LibreOffice community. They will be providing food and t-shirts for all LibreFest participants, so grab your laptop, roll up your sleeves, and  dig in! Can't make it to Seattle?  ... then  join … [Read more...]

HTML5 Progress – Confirmed Netflix Works With Chrome And Ubuntu 14.04LTS

October 11, 2014 by Rob Farber Leave a Comment

Ubuntu 14.04 users can now simply load Netflix and start viewing - no beta builds, wine emulation or other work-arounds. Hopefully other services like Amazon Instant Video will follow suite. … [Read more...]

Intel Xeon Phi Provides Cambridge 30x Speedup in Production COSMOS WALLS Code

October 10, 2014 by Rob Farber Leave a Comment

Professor Paul Shellard, the COSMOS Director at Cambridge University reports a 30x speedup of the heavily utilized production WALLS code and he notes "Our expectation is that all our cosmological field theory codes, like WALLS, will have similarly large speed-ups when optimized and ported to Xeon Phi."  Currently the project is transferring a larger portion of the CMB analysis … [Read more...]

Parallel Evaluation Of Fault Tree Expressions

October 10, 2014 by Rob Farber Leave a Comment

Readers are guided through a progression from a scalar fault tree code to one mapped effectively to Intel Xeon Phi with the open-source ispc (Intel SPMD Program Compiler). Fault trees express failure relationships between systems using Boolean logic to evaluate the vulnerability of systems based on component reliability, system redundancy, physical protection, and other — … [Read more...]

Plesiochronous (Loosely Synchronous) Phasing Barriers To Avoid Thread Inefficiencies

October 9, 2014 by Rob Farber Leave a Comment

Jim Dempsey bests expert Intel programmers by 40% - 50% simply by using a little bit of ingenuity, along with a slightly different programming technique. He notes that, "a substantial portion of previously lost thread barrier wait time" can be recovered simply by using loosely synchronous (plesiochronous) barriers instead of strictly synchronous barriers.  Jim points out that, … [Read more...]

Micron Automata Processor SDK Now Available – Includes Online Demo!

October 8, 2014 by Rob Farber Leave a Comment

Click here to try a web-enabled intro simulation or visit micronautomata.com to sign up for a full preview of the Micron Automata Processor SDK (Software Development Kit) that includes a visual development environment, compiler, design rules checker, regular expression to automata generator and AP simulator. The SDK is also available through membership with the Center for … [Read more...]

Optimizing for Reacting Navier‐Stokes Equations

October 8, 2014 by Rob Farber Leave a Comment

Antonio Valles and Weiqun Zhang note the optimizations discussed in their High Performance Parallelism Pearls chapter that, "significantly improved concurrency on both Intel Xeon Phi coprocessors and Intel Xeon processors" by transforming a fine-grain thread parallel approach to a more coarse-grain, memory allocation considerate approach plus improving vectorization. They … [Read more...]

Microsoft Roomalive Creates Augmented Reality Room At Home

October 7, 2014 by Rob Farber Leave a Comment

Ditch the AR goggles! In their paper, "RoomAlive: Magical Experiences Enabled by Scalable, Adaptive Projector-Camera Units" Microsoft researchers discuss a proof-of-concept prototype that transforms any room into an immersive, augmented entertainment experience.  The basic building blocks are projector-depth camera units that create a unified model of the room with no … [Read more...]

« Previous Page
Next Page »

Tell us you were here

Recent Posts

Farewell to a Familiar HPC Friend

May 27, 2020 By Rob Farber Leave a Comment

TechEnablement Blog Sunset or Sunrise?

February 12, 2020 By admin Leave a Comment

The cornerstone is laid – NVIDIA acquires ARM

September 13, 2020 By Rob Farber Leave a Comment

Third-Party Use Cases Illustrate the Success of CPU-based Visualization

April 14, 2018 By admin Leave a Comment

More Tutorials

Learn how to program IBM’s ‘Deep-Learning’ SyNAPSE chip

February 5, 2016 By Rob Farber Leave a Comment

Free Intermediate-Level Deep-Learning Course by Google

January 27, 2016 By Rob Farber Leave a Comment

Intel tutorial shows how to view OpenCL assembly code

January 25, 2016 By Rob Farber Leave a Comment

More Posts from this Category

Top Posts & Pages

  • Intel Xeon Phi Study Guide
  • Netflix Provides 4K Content - Good News for Low-Power, High-Flop GPU Designs
  • AMD Firepro S9150 5 TF/s Single, 2.5 TF/s Double-Precision GPU and OpenCL 1.2 Support
  • Rob Farber
  • Heterogeneous MPI Optimization With ITAC

Archives

© 2026 · techenablement.com