Step-Audio-R1 is the first audio language model to successfully unlock test-time compute scaling.
AI & ML interests
None defined yet.
Recent Activity
Papers
REASONEDIT: Towards Reasoning-Enhanced Image Editing Models
iMontage: Unified, Versatile, Highly Dynamic Many-to-many Image Generation
Step-Audio-EditX
-
stepfun-ai/StepFun-Formalizer-7B
Text Generation • 8B • Updated • 87 • 6 -
stepfun-ai/StepFun-Formalizer-32B
Text Generation • 33B • Updated • 71 • 8 -
StepFun-Formalizer: Unlocking the Autoformalization Potential of LLMs through Knowledge-Reasoning Fusion
Paper • 2508.04440 • Published • 9 -
stepfun-ai/StepFun-Formalizer-Training
Viewer • Updated • 188k • 154 • 3
Step-Audio model family, including Audio-Tokenizer, Audio-Chat and TTS
Step-Audio-R1 is the first audio language model to successfully unlock test-time compute scaling.
Step-Audio-EditX
-
stepfun-ai/StepFun-Formalizer-7B
Text Generation • 8B • Updated • 87 • 6 -
stepfun-ai/StepFun-Formalizer-32B
Text Generation • 33B • Updated • 71 • 8 -
StepFun-Formalizer: Unlocking the Autoformalization Potential of LLMs through Knowledge-Reasoning Fusion
Paper • 2508.04440 • Published • 9 -
stepfun-ai/StepFun-Formalizer-Training
Viewer • Updated • 188k • 154 • 3
Step-Audio model family, including Audio-Tokenizer, Audio-Chat and TTS