Back to Table of contents

Primeur weekly 2019-03-25

Special

Meeting the challenges of tomorrow - at scale ...

FocusCoE to support HPC Centres of Excellence for outreach to the broader European HPC community ...

Focus

University of Maribor and XENYA signed contract for an HPC system as part of the national research infrastructure project HPC - RIVR, readying Slovenia for EuroHPC participation ...

Exascale supercomputing

U.S. Department of Energy and Intel to deliver first exascale supercomputer ...

Argonne deepens the Department of Energy's footprint in high-impact computing ...

Crowd computing

NVIDIA announces Jetson Nano: $99 tiny, yet mighty NVIDIA CUDA-X AI computer that runs all AI models ...

Quantum computing

In a new quantum simulator, light behaves like a magnet ...

Long-distance quantum information exchange - success at the nanoscale ...

Middleware

NERSC and NVIDIA to partner on compiler development for Perlmutter system ...

Hardware

DDN to unveil transformative solutions for accelerating AI, analytics and Deep Learning at NVIDIA GTC 2019 ...

NVIDIA introduces new breed of high-performance workstations for millions of data scientists ...

Global computer companies announce NVIDIA-powered enterprise servers optimized for data science ...

Mellanox HDR 200G InfiniBand deep learning acceleration engines demonstrates two times higher performance for Artificial Intelligence (AI) platforms with NVIDIA ...

Altair OptiStruct runs up to 10x faster on NVIDIA GPUs ...

Pure Storage unveils NVIDIA-powered solutions for full spectrum of AI initiatives ...

Preferred Networks builds MN-2, a state-of-the-art supercomputer powered with NVIDIA GPUs ...

Supermicro offers end-to-end portfolio of NVIDIA GPU systems ...

Applications

H5 Data Centers completes a High-Performance Compute (HPC) suite in partnership with Forced Physics Data Center Technology ...

Supercomputers to help supercharge ceramic matrix composite manufacturing ...

A surprising, cascading earthquake ...

Brain-inspired AI inspires insights about the brain and vice versa ...

Supercomputer simulations shed light on how liquid drops combine ...

Novel MD simulation sheds light on mystery of hydrated electron's structure ...

Kicking neural network automation into high gear ...

DOE extends University PPPL contract ...

The Cloud

NVIDIA teams with Amazon Web Services to bring AI to millions of connected devices ...

Mellanox HDR 200G InfiniBand deep learning acceleration engines demonstrates two times higher performance for Artificial Intelligence (AI) platforms with NVIDIA

18 Mar 2019 Sunnyvale, Yokneam - Mellanox Technologies Ltd.'s HDR 200G InfiniBand with the "Scalable Hierarchical Aggregation and Reduction Protocol" (SHARP) technology has set new performance records, doubling deep learning operations performance. The combination of Mellanox In-Network Computing SHARP with NVIDIA V100 Tensor Core GPU technology and Collective Communications Library (NCCL) deliver leading efficiency and scalability to deep learning and artificial intelligence applications.

The combination of the state-of-the-art NVIDIA GPUs, Mellanox's InfiniBand, GPUDirect RDMA and NCCL to train neural networks has already become a de-facto standard when scaling out deep learning frameworks, such as Caffe, Caffe2, Chainer, MXNet, TensorFlow, and PyTorch. With the Mellanox SHARP technology and HDR InfiniBand, deep learning training's data aggregation operations can be offloaded and accelerated by the InfiniBand network, resulting in improving their performance by two times.

The joint effort with NVIDIA and testing performed in Mellanox's performance labs, using the Mellanox HDR InfiniBand Quantum connecting four system hosts, each with eight NVIDIA V100 Tensor Core GPUs with NVLink interconnect technology and a single ConnectX-6 HDR adapter per host, have achieved an effective reduction bandwidth of 19.6GB/s by integrating SHARP's native streaming aggregation capability with NVIDIA's latest NCCL 2.4 library, which now takes full advantage of the bi-directional bandwidth available from the Mellanox interconnect. This implementation is effectively two times higher bandwidth than NVIDIA's current tree-based implementation using the same hardware configuration.

In the more common setup for this configuration, four HCAs in each system host are used for balanced performance across a variety of workloads where the initial SHARP and NCCL results yielded an expected 70.3GB/s. For more densely populated GPU-based systems, like NVIDIA DGX-2, which houses 16 NVIDIA V100 Tensor Core GPUs with NVLink in each system node, the in-network capabilities and available bidirectional bandwidth of the Mellanox fabric can be fully leveraged.

"Our long-standing collaboration with NVIDIA has again delivered a robust solution that takes full advantage of the best-of-breed capabilities from Mellanox InfiniBand, including GPUDirect RDMA and now extending in-network computing to NCCL, which delivers two times better performance for AI", stated Gilad Shainer, Vice President of Marketing at Mellanox Technologies. "HDR InfiniBand in-network computing acceleration engines, including the SHARP technology, provide the highest performance and scalability for HPC and AI workloads."

"Mellanox solutions amplify NVIDIA's unmatched CUDA-X acceleration libraries using NCCL, our open source collective communication library", stated Ian Buck, vice president and general manager of Accelerated Computing at NVIDIA. "Together, we offer solutions that ensure the most demanding AI applications in the data centre benefit from cutting-edge performance and scaling efficiency."
Source: Mellanox

Back to Table of contents

Primeur weekly 2019-03-25

Special

Meeting the challenges of tomorrow - at scale ...

FocusCoE to support HPC Centres of Excellence for outreach to the broader European HPC community ...

Focus

University of Maribor and XENYA signed contract for an HPC system as part of the national research infrastructure project HPC - RIVR, readying Slovenia for EuroHPC participation ...

Exascale supercomputing

U.S. Department of Energy and Intel to deliver first exascale supercomputer ...

Argonne deepens the Department of Energy's footprint in high-impact computing ...

Crowd computing

NVIDIA announces Jetson Nano: $99 tiny, yet mighty NVIDIA CUDA-X AI computer that runs all AI models ...

Quantum computing

In a new quantum simulator, light behaves like a magnet ...

Long-distance quantum information exchange - success at the nanoscale ...

Middleware

NERSC and NVIDIA to partner on compiler development for Perlmutter system ...

Hardware

DDN to unveil transformative solutions for accelerating AI, analytics and Deep Learning at NVIDIA GTC 2019 ...

NVIDIA introduces new breed of high-performance workstations for millions of data scientists ...

Global computer companies announce NVIDIA-powered enterprise servers optimized for data science ...

Mellanox HDR 200G InfiniBand deep learning acceleration engines demonstrates two times higher performance for Artificial Intelligence (AI) platforms with NVIDIA ...

Altair OptiStruct runs up to 10x faster on NVIDIA GPUs ...

Pure Storage unveils NVIDIA-powered solutions for full spectrum of AI initiatives ...

Preferred Networks builds MN-2, a state-of-the-art supercomputer powered with NVIDIA GPUs ...

Supermicro offers end-to-end portfolio of NVIDIA GPU systems ...

Applications

H5 Data Centers completes a High-Performance Compute (HPC) suite in partnership with Forced Physics Data Center Technology ...

Supercomputers to help supercharge ceramic matrix composite manufacturing ...

A surprising, cascading earthquake ...

Brain-inspired AI inspires insights about the brain and vice versa ...

Supercomputer simulations shed light on how liquid drops combine ...

Novel MD simulation sheds light on mystery of hydrated electron's structure ...

Kicking neural network automation into high gear ...

DOE extends University PPPL contract ...

The Cloud

NVIDIA teams with Amazon Web Services to bring AI to millions of connected devices ...