Le Huy Hoang's picture

Le Huy Hoang PRO

splendor1811

·

huyhoang18112k2

AI & ML interests

Computer Vision

Recent Activity

upvoted an article 12 days ago

Continuous batching from first principles

updated a model 2 months ago

splendor1811/BGE-qa-internal

published a model 2 months ago

splendor1811/fdd_1

View all activity

Organizations

None yet

upvoted an article 12 days ago

Article

Continuous batching from first principles

+1

13 days ago

•

252

upvoted a paper 3 months ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21 • 256

upvoted a collection 4 months ago

Qwen3

84 items • Updated Aug 6 • 1.47k

upvoted 3 articles 4 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7

•

253

Article

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

Mar 17

•

343

Article

I trained a Language Model to schedule events with GRPO!

Apr 29

•

91

upvoted a paper 5 months ago

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16 • 272

upvoted a collection 6 months ago

Qwen3-Embedding

6 items • Updated Jul 21 • 138

upvoted an article 7 months ago

Article

Vision Language Models (Better, faster, stronger)

+3

May 12

•

568

upvoted a paper 10 months ago

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published Feb 11 • 58

upvoted 2 articles 10 months ago

Article

Open-source DeepResearch – Freeing our search agents

+3

Feb 4

•

1.31k

Article

SmolVLM - small yet mighty Vision Language Model

+3

Nov 26, 2024

•

389

upvoted a paper 11 months ago

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published Jan 11 • 90

upvoted a collection 11 months ago

Phi-3

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated May 1 • 574

upvoted a collection about 1 year ago

MIT Talk 31/10 Papers

14 items • Updated Oct 28, 2024 • 32

upvoted a paper over 1 year ago

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15, 2024 • 89

upvoted 2 articles over 1 year ago

Article

Mixture of Experts Explained

+4

Dec 11, 2023

•

995

Article

SeeMoE: Implementing a MoE Vision Language Model from Scratch

Jun 23, 2024

•

37

upvoted 2 papers over 1 year ago

FLAME: Factuality-Aware Alignment for Large Language Models

Paper • 2405.01525 • Published May 2, 2024 • 28

StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation

Paper • 2405.01434 • Published May 2, 2024 • 56