7 652 881

xziayro

xziayro

AI & ML interests

None yet

Recent Activity

upvoted a paper about 13 hours ago

JUST-DUB-IT: Video Dubbing via Joint Audio-Visual Diffusion

upvoted a paper 1 day ago

UPLiFT: Efficient Pixel-Dense Feature Upsampling with Local Attenders

liked a model 2 days ago

DiffSynth-Studio/Z-Image-i2L

View all activity

Organizations

upvoted a paper about 13 hours ago

JUST-DUB-IT: Video Dubbing via Joint Audio-Visual Diffusion

Paper • 2601.22143 • Published 1 day ago • 2

upvoted a paper 1 day ago

UPLiFT: Efficient Pixel-Dense Feature Upsampling with Local Attenders

Paper • 2601.17950 • Published 5 days ago • 3

upvoted 4 papers 5 days ago

SAMTok: Representing Any Mask with Two Words

Paper • 2601.16093 • Published 8 days ago • 41

SALAD: Achieve High-Sparsity Attention via Efficient Linear Attention Tuning for Video Diffusion Transformer

Paper • 2601.16515 • Published 8 days ago • 15

VideoMaMa: Mask-Guided Video Matting via Generative Prior

Paper • 2601.14255 • Published 10 days ago • 13

HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding

Paper • 2601.14724 • Published 10 days ago • 73

upvoted an article 11 days ago

Article

Unlocking Longer Generation with Key-Value Cache Quantization

May 16, 2024

•

upvoted 3 papers 13 days ago

upvoted a paper 17 days ago

SnapGen++: Unleashing Diffusion Transformers for Efficient High-Fidelity Image Generation on Edge Devices

Paper • 2601.08303 • Published 18 days ago • 16

upvoted a paper 18 days ago

MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head

Paper • 2601.07832 • Published 18 days ago • 51

upvoted 2 papers 19 days ago

GenCtrl -- A Formal Controllability Toolkit for Generative Models

Paper • 2601.05637 • Published 22 days ago • 4

AnyDepth: Depth Estimation Made Easy

Paper • 2601.02760 • Published 25 days ago • 10

upvoted 2 papers 23 days ago

Klear: Unified Multi-Task Audio-Video Joint Generation

Paper • 2601.04151 • Published 23 days ago • 16

Choreographing a World of Dynamic Objects

Paper • 2601.04194 • Published 23 days ago • 13

upvoted 2 papers 24 days ago

DreamStyle: A Unified Framework for Video Stylization

Paper • 2601.02785 • Published 25 days ago • 24

LTX-2: Efficient Joint Audio-Visual Foundation Model

Paper • 2601.03233 • Published 24 days ago • 141

upvoted a paper 25 days ago

LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation

Paper • 2512.23576 • Published Dec 29, 2025 • 65

upvoted a paper 26 days ago

Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation

Paper • 2601.00664 • Published 29 days ago • 56

xziayro

AI & ML interests

Recent Activity

Organizations

xziayro's activity

Unlocking Longer Generation with Key-Value Cache Quantization