[2018 NIPS] Dynamic Space-Time Scheduling for GPU Inference
Summary
Background & Motivation
Approach
Utilization
Performance (throughput/latency)
Predictability/Performance Isolation
Design & Implementation

Links & References
Previous[2018 SIGCOMM] Chameleon: Scalable Adaptation of Video Analytics via Temporal and Cross-camera ...Next[2019 ATC] Analysis of Large-Scale Multi-Tenant GPU Clusters for DNN Training Workloads
Last updated