John Stone (Research Staff, The Beckman Institute) points out that improvements in the AVX-512 instruction set in the Intel Xeon Phi (and latest generation Intel Xeon processors) can deliver significant performance improvements for some time consuming molecular visualization kernels over most existing Intel Xeon CPUs. Based on his recent results using the Intel Xeon Phi … [Read more...]
SURFsara Achieves Accuracy and Performance Breakthroughs for Both Deep Learning and Wide Network Training
Sponsored Content SURFsara posted the best accuracy and an under 40 minute training time on some popular deep learning architectures and data sets to establish new single-model state-of-the-art results using only general-purpose CPU-based hardware, as opposed to special accelerators. Specifically SURFsara reports under 40 minutes to train the ResNet50 model on the ImageNet-1k … [Read more...]
NVIDIA – “[Intel] Should Get Their Facts Straight” on Machine Learning Benchmarks
NVIDIA responds to the machine learning benchmark results presented by Intel at ISC'16, "It’s great that Intel is now working on deep learning. This is the most important computing revolution with the era of AI upon us and deep learning is too big to ignore. But they should get their facts straight." (Source: NVIDIA) NVIDIA notes further that, "While we can correct each of … [Read more...]
Faster Deep Learning with the Intel® Scalable System Framework: Next Generation Processors
Sponsored Post This is the first in a multi-part series on machine-learning that examines the impact of Intel Scalable System Framework (Intel® SSF) technology on the valuable HPC field of deep-learning. In this article we will focus on Intel® Xeon® and Intel® Xeon Phi™ processors. Follow-on articles will discuss other Intel SSF components including networking and storage with … [Read more...]
OpenACC Highlights at GTC 2016
GTC 2016, the upcoming GPU Technology Conference (GTC), which is being held in San Jose, CA April 4-7, 2016 contains several OpenACC talks of note. If you are attending, check out the following OpenACC related activities. If not, NVIDIA will put the videos online within a short period after the conference. S6524 - Enabling the Electronic Structure Program Gaussian on GPGPUs … [Read more...]
3D XPoint Memory Poised to Revolutionize System Memory and Storage
3D XPoint™ memory (pronounced 3d cross-point) is a new nonvolatile, memory technology co-developed by Intel and Micron that can deliver, “Up to 4x system memory capacity at significantly lower cost than DRAM”, a hundred times lower latency than today’s best performing NAND, and write cycle durability that is 1000x that of NAND. The 3D XPoint memory technology is … [Read more...]
PGI/NVIDIA To Develop LLVM-based Open-Source Fortran Compiler for NNSA and Possibly OpenPower
PGI (The Portland Group) is working with the NNSA to create an open-source Fortran compiler designed to integrate with the widely used LLVM compiler infrastructure. Recently, PGI announced comparable x86 multicore OpenACC performance as compared the the Intel compiler using OpenMP. This move ensure that PGI will have access to the latest and best information about the AVX-512 … [Read more...]
Los Alamos Orders 1000+ Qubit D-Wave 2X Quantum Computer
Los Alamos National Laboratory (LANL) is continuing an aggressive program to stretch the limits of current computational technology. LANL is a participant in the NNSA Trinity supercomputer procurement with hardware already installed. In addition, LANL has just announced the purchase of a D-Wave 2X Quantum Computer. The D-Wave 2X system is expected to be installed at Los Alamos … [Read more...]
Learn to Modernize your code at IDF
Code modernization is easily the most beneficial, significant, and long-lasting investment the HPC community can make to capitalize on current and future hardware investments. The upcoming Intel IDF15 sees itself as partnership between Intel and code developers, makers, and technologists to bring software into the age of lotsofcores and massive parallelism. Register today … [Read more...]
Intel Extends Barcelona Supercomputing Center
Intel extended it's research collaboration with the Barcelona Supercomputing Center (BSC) – one of four Intel exascale labs in Europe. Begun in 2011 and now extended to September 2017, the Intel-BSC work focuses on scalability issues as parallel applications move towards the exascale. BSC has been making HPC tools such as an instrumentation package (Extrae), a performance data … [Read more...]