My favorites
-
Text-to-Image âĒ Updated âĒ 10.4k âĒ âĒ 4.87k -
meta-llama/Llama-3.3-70B-Instruct
Text Generation âĒ 71B âĒ Updated âĒ 410k âĒ âĒ 2.59k -
NyxKrage/Microsoft_Phi-4
15B âĒ Updated âĒ 31 âĒ 55 -
cnfusion/Microsoft_Phi-4-mlx-8bit
Text Generation âĒ 4B âĒ Updated âĒ 28 -
cnfusion/Microsoft_Phi-4-mlx-4bit
Text Generation âĒ 2B âĒ Updated âĒ 20 -
Imagen 3
Paper âĒ 2408.07009 âĒ Published âĒ 62 -
deepseek-ai/DeepSeek-V3
Text Generation âĒ 685B âĒ Updated âĒ 706k âĒ âĒ 4k -
HKUSTAudio/xcodec2
Audio-to-Audio âĒ 0.8B âĒ Updated âĒ 23.1k âĒ 91 -
deepseek-ai/DeepSeek-R1
Text Generation âĒ 685B âĒ Updated âĒ 1.21M âĒ âĒ 12.9k -
MagicQuill
ðŠķ2.19kGenerate edited images using scribble inputs
-
deepseek-ai/Janus-Pro-7B
Any-to-Any âĒ Updated âĒ 51.7k âĒ 3.53k -
InstantX/InstantIR
Image-to-Image âĒ Updated âĒ 2 âĒ 180 -
hexgrad/Kokoro-82M
Text-to-Speech âĒ Updated âĒ 3.94M âĒ âĒ 5.37k -
microsoft/OmniParser-v2.0
Updated âĒ 800 âĒ 1.3k -
Magic 1-For-1: Generating One Minute Video Clips within One Minute
Paper âĒ 2502.07701 âĒ Published âĒ 35 -
MiniMaxAI/MiniMax-Text-01
Text Generation âĒ 456B âĒ Updated âĒ 1.48k âĒ 650 -
Chat With Janus-Pro-7B
ð2.01kA unified multimodal understanding and generation model.
-
Wan2.1
ðŧ1.93kWan: Open and Advanced Large-Scale Video Generative Models
-
FLUX LoRA DLC
ðĨģ1.16k270+ Impressive LoRAs for Flux.1
-
Alpha-VLLM/Lumina-Image-2.0
Text-to-Image âĒ Updated âĒ 1.53k âĒ âĒ 347 -
Lumina Image 2.0
ðžGenerate high-quality images from text prompts
-
InstructPix2Pix
ð1.54kTransform images based on text instructions
-
tencent/Tencent-Hunyuan-Large
Text Generation âĒ Updated âĒ 271 âĒ 615 -
deepseek-ai/DeepSeek-V3-0324
Text Generation âĒ 685B âĒ Updated âĒ 142k âĒ âĒ 3.08k -
InfiniteYou-FLUX
ðļ1.09kFlexible Photo Recrafting While Preserving Your Identity
-
OmniGen
ðž701Image generator/identifier/reposer
-
simplescaling/s1.1-32B
Text Generation âĒ 33B âĒ Updated âĒ 2.33k âĒ âĒ 96 -
meta-llama/Llama-4-Scout-17B-16E-Instruct
Any-to-Any âĒ 109B âĒ Updated âĒ 209k âĒ 1.15k -
stepfun-ai/stepvideo-ti2v
Image-to-Video âĒ Updated âĒ 24 âĒ 83 -
Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model
Paper âĒ 2503.07703 âĒ Published âĒ 37 -
Dia 1.6B
ðŊ1.72kGenerate realistic dialogue from a script, using Dia!
-
Parakeet-TDT-0.6b-V2
Â449Transcribe audio to text with timestamps
-
nvidia/parakeet-tdt-0.6b-v2
Automatic Speech Recognition âĒ Updated âĒ 879k âĒ 1.38k -
Lightricks/LTX-Video
Image-to-Video âĒ Updated âĒ 270k âĒ âĒ 2.06k -
Pyramid Flow
âą673Generate videos from text prompts and optional images
-
DreamO
ðĻ597A Unified Framework for Image Customization
-
Image Arena Leaderboard
ð554Image Generation and Image Editing Arena & Leaderboard
-
BAGEL
ð215Demo for BAGEL
-
HiDream-ai/HiDream-I1-Full
Text-to-Image âĒ Updated âĒ 22.5k âĒ âĒ 980 -
Finegrain Image Enhancer
ðž1.91kClarity AI Upscaler Reproduction
-
UGI Leaderboard
ðĒ1.3kUncensored General Intelligence Leaderboard
-
deepseek-ai/DeepSeek-R1-0528
Text Generation âĒ 685B âĒ Updated âĒ 423k âĒ âĒ 2.39k -
tencent/HunyuanVideo-Avatar
Image-to-Video âĒ Updated âĒ 312 -
ResembleAI/chatterbox
Text-to-Speech âĒ Updated âĒ 701k âĒ âĒ 1.31k -
maya-research/Veena
Text-to-Speech âĒ 4B âĒ Updated âĒ 2.64k âĒ 214 -
Sesame CSM
ðą852Conversational speech generation
-
Meigen MultiTalk
ð264Audio-Driven Multi-Person Conversational Video Generation
-
Wan-AI/Wan2.2-TI2V-5B
Text-to-Video âĒ Updated âĒ 3.33k âĒ âĒ 453 -
Wan-AI/Wan2.2-T2V-A14B
Text-to-Video âĒ Updated âĒ 5.56k âĒ âĒ 366 -
FLUX.1 Krea Dev
ð363Generate images from text prompts
-
BAAI/MTVCraft
Text-to-Video âĒ Updated âĒ 147 âĒ 36 -
facebook/MobileLLM-R1-950M
Text Generation âĒ 0.9B âĒ Updated âĒ 937 âĒ 352 -
lodestones/Chroma1-HD
Text-to-Image âĒ Updated âĒ 13k âĒ 298