ChatBench Datasets and Simulators (same prompt + fine-tuning set-up) from the ChatBench paper.
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
DoVer: Intervention-Driven Auto Debugging for LLM Multi-Agent Systems
Gold-Medal-Level Olympiad Geometry Solving with Efficient Heuristic Auxiliary Constructions
spaces
24
pinned
Running
14
MageBench Leaderboard
🥇
This is a leaderboard for magebench
Running
46
Phi 4 Mini
🌍
Demos for Phi-4-mini-instruct model
Running
37
ThoughtsOrganizer
🔥
Transform your spoken thoughts into organized insights
Runtime error
Featured
4.78k
TRELLIS
🏢
Scalable and Versatile 3D Generation from images
Running
Featured
34
PhineSpeechTranslator
👀
Break the language barrier
Build error
9
StoriesComeAlive
🏆
Transform handwritten moments into spoken memories
models
428
microsoft/GRIN-MoE
Text Generation
•
42B
•
Updated
•
24.6k
•
196
microsoft/Phi-3-mini-4k-instruct-onnx
Text Generation
•
Updated
•
878
•
143
microsoft/Phi-4-mini-flash-reasoning
Text Generation
•
4B
•
Updated
•
3.18k
•
254
microsoft/Phi-4-mini-instruct
Text Generation
•
4B
•
Updated
•
329k
•
640
microsoft/Phi-4-mini-reasoning
Text Generation
•
4B
•
Updated
•
15.7k
•
212
microsoft/Phi-3.5-vision-instruct
Image-Text-to-Text
•
4B
•
Updated
•
480k
•
718
microsoft/Phi-3-vision-128k-instruct
Text Generation
•
4B
•
Updated
•
17.2k
•
969
microsoft/Phi-3.5-MoE-instruct
Text Generation
•
42B
•
Updated
•
113k
•
566
microsoft/Phi-3.5-mini-instruct
Text Generation
•
4B
•
Updated
•
391k
•
934
microsoft/Phi-3-mini-4k-instruct-gguf
Text Generation
•
4B
•
Updated
•
94.2k
•
546
datasets
79
microsoft/SIMORD
Updated
•
72
•
4
microsoft/WebTailBench
Preview
•
Updated
•
234
•
14
microsoft/SWE-Sharp-Bench
Viewer
•
Updated
•
150
•
163
microsoft/sigmacollab
Updated
•
70
•
1
microsoft/SYNUR
Preview
•
Updated
•
64
•
4
microsoft/PatientSafetyBench
Viewer
•
Updated
•
466
•
125
•
2
microsoft/claimify-dataset
Viewer
•
Updated
•
6.49k
•
69
•
5
microsoft/LiveDRBench
Viewer
•
Updated
•
110
•
154
•
6
microsoft/CoSAlign-Train
Viewer
•
Updated
•
125k
•
107
•
2
microsoft/CoSApien
Viewer
•
Updated
•
200
•
149
•
2