• Home
  • News
  • Tutorials
  • Analysis
  • About
  • Contact

TechEnablement

Education, Planning, Analysis, Code

  • CUDA
    • News
    • Tutorials
    • CUDA Study Guide
  • OpenACC
    • News
    • Tutorials
    • OpenACC Study Guide
  • Xeon Phi
    • News
    • Tutorials
    • Intel Xeon Phi Study Guide
  • OpenCL
    • News
    • Tutorials
    • OpenCL Study Guide
  • Web/Cloud
    • News
    • Tutorials
You are here: Home / Archives for CUDA

Metagenomic Sequence Clustering using CUDA-enabled GPUs

August 15, 2014 by admin Leave a Comment

There have been huge advances in DNA sequencing technologies in recent years; e.g. Illumina has just announced the HiSeq X system which can sequence human genomes at a cost of only $1000 per genome. Besides population-scale human genome sequencing another important application of sequencing technologies is environmental sequencing (so called metagenomics). Metagenomic studies … [Read more...]

Nvidia Talks About ARM64 and 64-bit K1 SoC

August 13, 2014 by Rob Farber Leave a Comment

The Hot Chips 2014 conference conveyed some hot information this week about Nvidia's 64-bit Tegra K1 -the first 64-bit ARM processor for Android devices that pairs the dual-core "Project Denver" CPU with Nvidia's 192-core Kepler GPU  (a ceepee geepee). The ARM-based Denver CPU was custom designed by Nvidia and is compatible with ARM's 64-bit ARMv8-A architecture. The chip is … [Read more...]

Acer K1-powered Chromebook $279 for Pre-Order – Dual-boot Linux?

August 11, 2014 by Rob Farber Leave a Comment

The Acer Chromebook 13, priced at $279,  is the first Chromebook to use an NVIDIA Tegra K1 processor. It offers customers fast graphics and a 13-hour battery  in an ultra-mobile form factor. Available for presale now at Amazon.com and BestBuy.com. Spec from Amazon.com: Screen Size 13.3 inches Max Screen Resolution 1366 x 768 pixels Processor 2.1 GHz … [Read more...]

GPU Accelerated Genetic Algorithm Can Plan Drone Missions

August 11, 2014 by Rob Farber Leave a Comment

Both the military and commercial organizations like Amazon will be interested in the GPU accelerated genetic algorithm (GA) proposed in the paper "UAV Path Planning with Parallel Genetic Algorithms on CUDA architecture" to create flight plans for drones. The authors noted "The experiments in this study show that the results reach up to 24 times speedup comparing to the CPU … [Read more...]

Accelerating the Traveling Salesman Problem with GPUs and Intel Xeon Phi

August 11, 2014 by Rob Farber Leave a Comment

The traveling salesman problem (TSP) is an important computer science optimization problem with numerous real-world applications. There is a huge body of literature on TSP solutions. Following are a few GPU and Intel Xeon Phi accelerated solutions. TSPgpu TSPGPU v2.1 is a GPU-accelerated heuristic solver for the symmetric Traveling Salesman Problem with up to 32767 … [Read more...]

Breadth-First Graph Search Uses 2D Domain Decomposition – 400 GTEPS on 4096 GPUs

August 9, 2014 by Rob Farber Leave a Comment

Parallel Breadth-First Search is a standard benchmark and the basis of many other graph algorithms. The challenge lies in partitioning the graph across multiple nodes in a cluster while avoiding load-imbalance and communications delays. The authors of the paper, "Parallel Breadth First Search on the Kepler Architecture" utilize an interesting 2D decomposition of the graph … [Read more...]

Deep-learning Webinar Demonstrates Handwriting Recognition and Efforts to Teach Drone to Fly Down a Wooded Path

August 8, 2014 by Rob Farber Leave a Comment

Deep-learning is a computational expensive but rewarding method to solve many complex pattern recognition problems. The recent NVIDIA webinar by Dan Claudiu Cireșan, Senior Researcher at the Dalle Molle Institute for Artificial Intelligence (IDSIA) in Switzerland highlighted some of the capabilities of deep-learning for image recognition problems such as handwriting recognition … [Read more...]

SC14 – Fast Hybrid GPU Betweenness Centrality Code Achieves Nearly Ideal Scaling to 192 GPUs

August 8, 2014 by Rob Farber Leave a Comment

Don't miss the SC14 presentation Wednesday Nov. 19 in room 388-89-90, for the presentation of the McLaughlin and Bader paper "Scalable and High Performance Betweenness Centrality on the GPU". The authors report nearly ideal scaling to 192 GPUs and billions of edges traversed per step (GTEP). The paper can be downloaded here and their software can be  downloaded from … [Read more...]

Start Developing on the ARM-Powered NVIDIA SHIELD Tablet Now!

July 29, 2014 by Rob Farber Leave a Comment

NVIDIA posted a development kit for the NVIDIA Shield Tablet including  a banquet of languages (CUDA, Java, C/C++, ...) and developer tools. A complete list of features can be found at Tegra Android Development Pack and are available for download under the NVIDIA GameWorks Download Center. Develop on the latest and fastest mobile hardware available with the all-new NVIDIA® … [Read more...]

SC14 Technical Program and Registration – XSEDE/TACC Resources for Farber Tutorial

July 28, 2014 by Rob Farber Leave a Comment

Register early for Supercomputing 2014 in New Orleans and save up to $275. View the Technical Program online (and register for our tutorial!) The Technical Program fee includes  admission to all conference sessions, exhibits, the Monday night Exhibits opening event, Thursday night event, and one copy of the SC14 proceedings. Click here to view the grid showing access to … [Read more...]

« Previous Page
Next Page »

Tell us you were here

Recent Posts

Farewell to a Familiar HPC Friend

May 27, 2020 By Rob Farber Leave a Comment

TechEnablement Blog Sunset or Sunrise?

February 12, 2020 By admin Leave a Comment

The cornerstone is laid – NVIDIA acquires ARM

September 13, 2020 By Rob Farber Leave a Comment

Third-Party Use Cases Illustrate the Success of CPU-based Visualization

April 14, 2018 By admin Leave a Comment

More Tutorials

Learn how to program IBM’s ‘Deep-Learning’ SyNAPSE chip

February 5, 2016 By Rob Farber Leave a Comment

Free Intermediate-Level Deep-Learning Course by Google

January 27, 2016 By Rob Farber Leave a Comment

Intel tutorial shows how to view OpenCL assembly code

January 25, 2016 By Rob Farber Leave a Comment

More Posts from this Category

Top Posts & Pages

  • Part 1: OpenCL™ – Portable Parallelism
  • Pascal upgrade and Intel Xeon Phi at the Swiss National Supercomputing Centre
  • OpenACC Study Guide
  • ARM64 with CUDA Early Access Boards Now Available
  • Altera OpenCL Programmable FPGA Talks QPI, HMC, and 100G Optical Interconnect

Archives

© 2026 · techenablement.com