The newest book by James Reinders and Jim Jeffers, “High Performance Parallelism Pearls” distills the experience of sixty-nine HPC experts into twenty-eight chapters designed to teach the world about the performance capabilities of the massively-parallel Intel® Xeon Phi™ family of products. Source code for numerous working examples selected for their educational content, … [Read more...]
Analysis of Phylogenetic Tree Code Shows OpenACC Within 10% Of Native CUDA
The paper, "Accelerating Phylogenetic Inference on GPUs: an OpenACC and CUDA comparison" by University of Barcelona and Intel Barcelona Research Center claim near-CUDA performance for OpenACC - within 10% - that can be achieved when accelerating a Phylogenetic Tree code based on the popular MrBayes Markov chain Monte Carlo (MCMC) package. Comparing with state-of-art … [Read more...]
MSI WS60 Mobile Workstation – Awesome CUDA-Capable, Linux, and Window Mobility
The recently released MSI mobile workstation (WS60 20 OJ 3K-004US) provides a no-compromise laptop for those who wish a thin-and-light desktop replacement at work and when traveling. This device is now my work machine of choice (that relegated a wonderful HP Z800 workstation to a remotely accessed resource). I have found that the WS60 provides a well-designed and … [Read more...]
Sony Demos Project Morpheus VR Headset on the PS4
Sony had demos for its Project Morpheus VR headset on the PS4 at the Tokyo Game Show. Competition in the VR headset space is getting intense with Sony's Project Morpheus is "85 percent" complete while Oculus is showing off it's Crescent Bay prototype. The current version has a 90 degree Field Of View (FOV) but this is likely to change. (Occulus has a 110 degree FOV). For … [Read more...]
Sony’s AR SmartEyeglass SDK Available Now – Purchase March 2015
Sony is releasing a Google Glass competitor in March 2014. Start developing augmented reality apps for Android now with the Sony SmartEyeglass SDK (Developer Preview) with emulator that extends the Sony SmartExtension framework by defining a set of Java classes and data structures that enable Android app to interact with and control the SmartEyeglass device. Sample projects … [Read more...]
Free Book From Altera Includes OpenCL on FPGA Section
Altera has just released a free ebook “FPGAs for dummies” simply by registering to get a copy. The OpenCL sections are very high-level, composed mainly of a single block-diagram and a few paragraphs of discussion. Also included is a Black-Scholes Monte Carlo case study that includes lines of code, performance, and performance per watt comparisons against an unspecified CPU and … [Read more...]
AI Researchers Talk Up Benefits of GPUs for Deep Learning
With the ability to deliver TF/s to PF/s of performance even on nonlinear problems, deep-learning researchers who participated in the ImageNet competition are espousing the charms of GPU computing technology. At the European Conference on Computer Vision (ECCV), held last week in Zurich, teams from Adobe, U.C. Berkeley, the National University of Singapore, Oxford University … [Read more...]
GPUdb Filters And Displays Billions Of Tweets In Seconds On Parallel Devices
Twitter is a fantastic news source and provider of billions of noisy, needle-in-the-haystack tweets to confound data-scientists and delight analysts plus commercial marketing efforts. Interactivity with billions of data items is key to developing, understanding, and validating analysis. (Validation is emphasized as Google was recently fingered by IEEE Spectrum for biased … [Read more...]
Latest Intel SDE Emulates New ISA Instructions For Knights Landing
Intel has released a new version of the Intel SDE (Software Development Emulator) so that customers can start working with upcoming instruction set extensions like AVX-512 for Knights Landing. The SDE can be downloaded after accepting a user agreement and used on Windows, Linux, and OS. It can also be used with the GNU gcc. The current version is 7.2 released on July 29, … [Read more...]
FabricEngine Leverages Python and LLVM For Digital Content Creation Everywhere
A company called FabricEngine (http://fabricengine.com) is leveraging the power of LLVM and Python to compile optimized code for Digital Content Creation (DCC) that can run on multicore and GPUs. Products like these demonstrate the acceptance of Python and LLVM into commercial/enterprise high-performance HPC applications just like PyFR and deep-learning. Integrated with … [Read more...]









