Register early for Supercomputing 2014 in New Orleans and save up to $275. View the Technical Program online (and register for our tutorial!) The Technical Program fee includes admission to all conference sessions, exhibits, the Monday night Exhibits opening event, Thursday night event, and one copy of the SC14 proceedings. Click here to view the grid showing access to … [Read more...]
Part 2: No Idle Time CUDA Task Parallelism Across Eight GPUs
Part 1 in this tutorial series showed that task-based parallelism using concurrent kernels can accelerate applications simply by plugging more GPUs into a system - just as the GPU strong scaling execution model can accelerate applications simply by installing a newer GPU containing more SMX (Streaming Multiprocessors). No recompilation required! NVIDIA nvvp timelines show very … [Read more...]
Pre-order Your NVIDIA Shield Tablet Now! (available July 29 in US)
The Tegra K1-powered NVIDIA Shield Tablet is here - available July 29 in the US! Pre-order here! For more information on the shield tablet, or go to shield.nvidia.com. http://youtu.be/VohrddwVQqg Those who are adventurous, Caonical has a dual-boot mode that will allow Ubuntu to run on tablets, and potentially other devices bringing full Linux and CUDA … [Read more...]
NVIDIA Shield 2 will be a tablet (with stylus) as well as a gaming device!
It's looks like it is true (and it can now be ordered), the K1-powered Shield 2 Gaming console will also be an 8" tablet! What a cool idea! This will be a CUDA/Gamers/Tablet/(cellphone?) honey of a device. (July 22: It's available now!). NVIDIA has created a tweet-based game "Ultimate Quest" in what is believed to be a promotion for the Shield 2. Click on the image below … [Read more...]
GCC 4.9.1 Adds OpenMP 4.0 Fortran Support for Multicore
Jakub Jelinek <jakub at redhat dot com> posted on July 16th that the GCC 4.9.1 release now supports OpenMP 4.0 in Fortran (as well as C/C++). This is great news for multi-core programmers. GCC looks to be on-track to become the opensource platform that both Intel Xeon Phi and GPU programmers can use to to test pragma based programming. As reported on techEnablement.com, … [Read more...]
Extreme Test – NVIDIA Shield Portals Two 9-year Old Boys Through 20-hour Road Trip
The Tegra 4 NVIDIA Shield is a powerful, portable handheld gaming console that can play demanding first-person video games like Valve's Portal with long battery life. A recent extreme, real-world test with two active nine-year old boys on a cumulative twenty-hour round-trip-road trip demonstrated in-car happiness by all occupants. The combination of an Xbox like controller … [Read more...]
South Africa Team Wins Their Second Student Supercomputing Competition At ISC14
Congratulations to the South African students who won their second ISC14 Student Supercomputing Competition! In 2013 the South African students were considered the underdog due to their youth and lack of competitive experience. This year the team from the South African Centre for High Performance Computing won the overall 2014 competition. To win, students have to build a … [Read more...]
Farber to Teach All-Day Tutorial At Supercomputing Nov 16 2014
Supercomputing 2014 recently approved my proposal for an all-day class "From 'Hello World' to Exascale Using x86, GPUs and Intel Xeon Phi Coprocessors" (tut106s1), at The International Conference for High Performance Computing, Networking, Storage, and Analysis (SC14). I hope to see you on Sunday November 16, 2014 in New Orleans,! Abstract Both GPUs and Intel Xeon Phi … [Read more...]
Intel Knights Landing: Claimed 4x An NVIDIA K40 (on some applications)
Joe Curley (Director of Marketing in the Technical Computing Group at Intel Corporation) just completed his webinar on BrightTalk, "The Faster Path to Discovery: New Details on the Intel® Xeon Phi™ Product Family" that disclosed new details on the upcoming Knights Landing massively=parallel chip including a claimed 4x performance improvement over the NVIDIA K40 on some … [Read more...]
ARM64 with CUDA Early Access Boards Now Available
The X-Gene™ ARM64 and CUDA Development Platform for High Performance Computing (HPC) is now available to order from Cirrascale, the US Applied Micro (APM) integration partner. This board represents an entry point for ARM64 + CUDA into the enterprise markets as well. The X-Gene™ board features custom high-performance ARM v8 processor based on an advanced 64-bit ARM … [Read more...]









