Amir Mohseni's picture

Building on HF

Amir Mohseni PRO

AmirMohseni

·

AI & ML interests

LLMs, VLMs, VLAs

Recent Activity

published an article about 1 month ago

To Think or Not to Think: A Router for Hybrid LLMs

upvoted an article about 1 month ago

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

new activity about 1 month ago

AmirMohseni/GroceryList:Fix Category for 2 items

View all activity

Organizations

upvoted an article about 1 month ago

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

Jul 9

•

745

upvoted an article about 2 months ago

Article

To Think or Not to Think: A Router for Hybrid LLMs

Nov 16

•

8

upvoted an article 2 months ago

Article

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

Apr 16

•

57

upvoted 2 collections 3 months ago

Quantization Spaces on the Hub ⚡

A collection of spaces that allow you to quantize on the Hub • 4 items • Updated Nov 28 • 7

Reasoning Router

Route between “thinking” and “no-thinking” modes for hybrid models like Qwen3. Blog: https://huggingface.co/blog/AmirMohseni/reasoning-router • 9 items • Updated Nov 16 • 2

upvoted a collection 8 months ago

Qwen3

84 items • Updated about 16 hours ago • 1.53k

upvoted an article 10 months ago

Article

Open R1: Update #3

Mar 11

•

296

upvoted a collection 12 months ago

Scaling Test-Time Compute with Open Models

Models and datasets used in our blog post: https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute • 10 items • Updated Jan 6 • 27

upvoted an article about 1 year ago

Article

Use Models from the Hugging Face Hub in LM Studio

Nov 28, 2024

•

140

upvoted a collection over 1 year ago

🪐 SmolLM

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated May 5 • 241