Media Summary: OFI 2.0 Update” Jianxin Xiong, Intel 9:00-9:30 am PT. "Accelerating MPI AllReduce Communication with Efficient GPU-Based Compression Schemes on Modern GPU Clusters" Hari ... "High Performance & Scalable MPI library over Broadcom RoCE" Mustafa Abduljabbar, The Ohio State University; Hemal Shah, ...
Ofa Virtual Workshop 2024 Day - Detailed Analysis & Overview
OFI 2.0 Update” Jianxin Xiong, Intel 9:00-9:30 am PT. "Accelerating MPI AllReduce Communication with Efficient GPU-Based Compression Schemes on Modern GPU Clusters" Hari ... "High Performance & Scalable MPI library over Broadcom RoCE" Mustafa Abduljabbar, The Ohio State University; Hemal Shah, ... "OFI Integrated Shared Memory Offload" Speakers: Alexia Ingerson, Intel; Shi Jin, Amazon; and Amir Shehata, Oak Ridge National ... "Scaling Large Language Model Training using Hybrid GPU-based Compression in MVAPICH" Speakers: Aamir Shafi and Lang ... Status of OpenFabrics Interfaces (OFI) Support in MPICH” Yanfei Guo, Argonne National Laboratory 9:45-10:15 am PT.
"An Integrated Deep Reinforcement Learning Agent for Sunfish and HPC Workload Manager Composable Disaggregated ... Opening Remarks Phil Cayton, Intel 8:00-8:05 am PT. "Managing Composable Disaggregated Infrastructure With "Designing In-Network Computing Aware Reduction Collectives in MPI" Speakers: Dhabaleswar Panda and Bharath Ramesh, ... "RecoNIC: RDMA-enabled Compute Offloading on FPGA-based SmartNIC" Speaker: Guanwen Zhong, AMD 10:45-11:15 am PT. "Optimized All-to-all Connection Establishment for High-Performance MPI Libraries over InfiniBand" Mustafa Abduljabbar and ...
"How to setup RDMA CI using the FSDP cluster" and "How to do manual RDMA testing using the FSDP cluster" Doug Ledford, ... "System Composability Using CXL" Kurtis Bowman, CXL Consortium MWG Co-Chair 9:45-10:15 am PT.