Presentation
Poster 83: ETL: Elastic Training Layer for Deep Learning
SessionResearch Posters Display
Event Type
Posters
Research Posters
TP
EX
EXH
TimeThursday, 21 November 20198:30am - 5pm
LocationE Concourse
DescriptionDue to the rising of deep learning, clusters for deep learning training are widely deployed in production. However, static task configuration and resource fragmentation problems in existing clusters result in low efficiency and poor quality of service. We propose ETL, an elastic training layer for deep learning, to help address them once for all. ETL adopts many novel mechanisms, such as lightweight and configurable report primitive and asynchronous, parallel and IO-free state replication, to achieve both high elasticity and efficiency. The evaluation demonstrates the low overhead and high efficiency of these mechanisms and reveals the advantages of elastic deep learning supported by ETL.
Archive