[2020 VLDB] PyTorch Distributed: Experiences on Accelerating Data Parallel Training
One-line Summary
Paper Structure Outline
Background & Motivation
System Design
API
Gradient Reduction


Collective Communication
Implementation Details
Evaluation






Discussion
New Vocabulary
Links
Previous[2020 EuroSys] AlloX: Compute Allocation in Hybrid ClustersNext[2020 NetAI] Is Network the Bottleneck of Distributed Training?
Last updated