325 393 621

Yatharth Sharma

YaTharThShaRma999

AI & ML interests

None yet

Recent Activity

reacted to Jofthomas's post with 🔥 5 days ago

The new Mistral 3 models are here ! Today, we announce Mistral 3, the next generation of Mistral models. Mistral 3 includes three state-of-the-art small, dense models (14B, 8B, and 3B) and Mistral Large 3 – our most capable model to date – a sparse mixture-of-experts trained with 41B active and 675B total parameters. All models are released under the Apache 2.0 license. Ministrals : https://huggingface.co/collections/mistralai/ministral-3 Mistral Large 3: https://huggingface.co/collections/mistralai/mistral-large-3

liked a model 11 days ago

Tongyi-MAI/Z-Image-Turbo

new activity 11 days ago

Tongyi-MAI/Z-Image-Turbo:Issue is that ZImagePipeline is not in the standard diffusers package

View all activity

Organizations

upvoted an article 16 days ago

Article

How to make NeuTTS-air generate over 200 seconds of audio in a single second.

16 days ago

•

upvoted an article 23 days ago

Article

📐 Muon Optimizer: The Power of Collective Momentum

24 days ago

•

upvoted an article 25 days ago

Article

⛳ Optimizer: What Does It Do and Why We Need It

26 days ago

•

upvoted a paper about 1 month ago

The Principles of Diffusion Models

Paper • 2510.21890 • Published Oct 24 • 59

upvoted an article about 1 month ago

Article

Why Did MiniMax M2 End Up as a Full Attention Model?

Oct 30

•

upvoted 4 papers about 2 months ago

Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding

Paper • 2510.06308 • Published Oct 7 • 53

upvoted 5 papers 2 months ago

Ovi: Twin Backbone Cross-Modal Fusion for Audio-Video Generation

Paper • 2510.01284 • Published Sep 30 • 33

UniVid: The Open-Source Unified Video Model

Paper • 2509.24200 • Published Sep 29 • 4

MGM-Omni: Scaling Omni LLMs to Personalized Long-Horizon Speech

Paper • 2509.25131 • Published Sep 29 • 15

HunyuanImage 3.0 Technical Report

Paper • 2509.23951 • Published Sep 28 • 21

SANA-Video: Efficient Video Generation with Block Linear Diffusion Transformer

Paper • 2509.24695 • Published Sep 29 • 45

upvoted 3 collections 2 months ago

SVDQuant

Collection

Models and datasets for "SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models" • 20 items • Updated May 29 • 64

Nunchaku

Collection

10 items • Updated Jun 29 • 34

LPD

Collection

Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation • 6 items • Updated Jul 2 • 2

upvoted 3 papers 3 months ago

<think> So let's replace this phrase with insult... </think> Lessons learned from generation of toxic texts with LLMs

Paper • 2509.08358 • Published Sep 10 • 13

Q-Sched: Pushing the Boundaries of Few-Step Diffusion Models with Quantization-Aware Scheduling

Paper • 2509.01624 • Published Sep 1 • 7

Directly Aligning the Full Diffusion Trajectory with Fine-Grained Human Preference

Paper • 2509.06942 • Published Sep 8 • 17

Yatharth Sharma

AI & ML interests

Recent Activity

Organizations

YaTharThShaRma999's activity

How to make NeuTTS-air generate over 200 seconds of audio in a single second.

📐 Muon Optimizer: The Power of Collective Momentum

⛳ Optimizer: What Does It Do and Why We Need It

Why Did MiniMax M2 End Up as a Full Attention Model?