Skip to content
Papyros
Archive
Graph
Builders
Notes
Join
The Archive
model-parallelism
2019
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism