Running 3.55k The Ultra-Scale Playbook π 3.55k The ultimate guide to training LLM on large GPU Clusters
shenzhi-wang/Gemma-2-9B-Chinese-Chat Text Generation β’ 9B β’ Updated Jul 4, 2024 β’ 611 β’ β’ 78