This lecture discusses the optimization of hardware interconnects, focusing on RDMA and GPU integration. Topics include memory scalability, storage disaggregation, heterogeneous device interconnection, RDMA in GPU-enabled servers, and execution traits in heterogeneous servers. It also covers observability, load balancing, offloading tasks to remote nodes, reducing data transfer trips, and representing operations in plans. The experimental setup with CPU-only and GPU-only servers is explored, along with the performance comparison in SSB benchmarks. The lecture concludes with strategies for moving complexity to the GPU side of the interconnect and utilizing CPU as a data cache.
This video is available exclusively on Mediaspace for a restricted audience. Please log in to MediaSpace to access it if you have the necessary permissions.
Watch on Mediaspace