1 75 8

Sweker

Swekerr

AI & ML interests

None yet

Recent Activity

upvoted an article 17 days ago

Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio and Video Agents

upvoted a paper 25 days ago

OpenGame: Open Agentic Coding for Games

upvoted an article about 1 month ago

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

View all activity

Organizations

upvoted an article 17 days ago

Article

Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio and Video Agents

nvidia

•

18 days ago

• 55

upvoted a paper 25 days ago

OpenGame: Open Agentic Coding for Games

Paper • 2604.18394 • Published 26 days ago • 80

upvoted an article about 1 month ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego

•

Mar 10

• 152

upvoted 2 articles 5 months ago

Article

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

ariG23498, sergiopaniego, reach-vb, pcuenq, ArthurZ, SaylorTwift, cyrilvallez

•

Sep 11, 2025

• 188

Article

LLM based Audio models

YatharthS

•

Dec 18, 2025

• 58

upvoted an article 8 months ago

Article

Smol2Operator: Post-Training GUI Agents for Computer Use

A-Mahla, merve, sergiopaniego, reach-vb, lewtun

•

Sep 23, 2025

• 138

upvoted 3 articles 10 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf

•

Jul 8, 2025

• 775

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

thomwolf, matthieu-lapeyre

•

Jul 9, 2025

• 799

Article

Announcing NeurIPS 2025 E2LM Competition: Early Training Evaluation of Language Models

tiiuae

•

Jul 4, 2025

• 11

upvoted an article 12 months ago

Article

🐯 Liger GRPO meets TRL

shisahni, kashif, smohammadi, ShirinYamani, m0m0chen, liberty4321

•

May 25, 2025

• 53

upvoted a paper 12 months ago

ReFT: Representation Finetuning for Language Models

Paper • 2404.03592 • Published Apr 4, 2024 • 101

upvoted an article 12 months ago

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

ariG23498, lusxvr, andito, sergiopaniego, merve, pcuenq, reach-vb

•

May 21, 2025

• 258

upvoted 2 articles about 1 year ago

Article

The Transformers Library: standardizing model definitions

lysandre, ArthurZ, pcuenq, julien-c

•

May 15, 2025

• 122

Article

Vision Language Models Explained

merve, edbeeching

•

Apr 11, 2024

• 531

upvoted a paper about 1 year ago

Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers

Paper • 2504.20752 • Published Apr 29, 2025 • 96

upvoted 2 articles about 1 year ago

Article

Train your first Decision Transformer

edbeeching, ThomasSimonini

•

Sep 8, 2022

• 15

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

NormalUhr

•

Feb 7, 2025

• 292

upvoted a paper about 1 year ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31, 2025 • 126

upvoted 2 articles about 1 year ago

Article

What is test-time compute and how to scale it?

Kseniase

•

Feb 6, 2025

• 120

Article

Topic 33: Slim Attention, KArAt, XAttention and Multi-Token Attention Explained – What’s Really Changing in Transformers?

Kseniase

•

Apr 4, 2025

• 16

Sweker

AI & ML interests

Recent Activity

Organizations

Swekerr's activity

Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio and Video Agents

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

LLM based Audio models

Smol2Operator: Post-Training GUI Agents for Computer Use

SmolLM3: smol, multilingual, long-context reasoner

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

Announcing NeurIPS 2025 E2LM Competition: Early Training Evaluation of Language Models

🐯 Liger GRPO meets TRL

nanoVLM: The simplest repository to train your VLM in pure PyTorch

The Transformers Library: standardizing model definitions

Vision Language Models Explained

Train your first Decision Transformer

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

What is test-time compute and how to scale it?

Topic 33: Slim Attention, KArAt, XAttention and Multi-Token Attention Explained – What’s Really Changing in Transformers?