view article Article What I Learned Upscaling a Long-distance Midjourney Photo w/ Stable Diffusion PLUS unboxing Qwen Image & Wan 2.2 Aug 8 • 6
Should We Still Pretrain Encoders with Masked Language Modeling? Paper • 2507.00994 • Published Jul 1 • 79 • 9
view article Article Training and Finetuning Sparse Embedding Models with Sentence Transformers v5 Jul 1 • 130
Running 3.55k The Ultra-Scale Playbook 🌌 3.55k The ultimate guide to training LLM on large GPU Clusters
view article Article From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages Feb 11 • 33
view article Article From Zero to Reasoning Hero: How DeepSeek-R1 Leverages Reinforcement Learning to Master Complex Reasoning Feb 4 • 16
gaochangkuan/whisper-large-v2_FT_model_checkpoints Automatic Speech Recognition • 2B • Updated Sep 29, 2024 • 4