Jorge Roldan's picture

Jorge Roldan PRO

roldanjorge

·

AI & ML interests

None yet

Recent Activity

liked a dataset 5 days ago

upvoted a paper 18 days ago

Llama-Nemotron: Efficient Reasoning Models

liked a model 18 days ago

nvidia/Llama-3_3-Nemotron-Super-49B-v1

View all activity

Organizations

upvoted a paper 18 days ago

Llama-Nemotron: Efficient Reasoning Models

Paper • 2505.00949 • Published May 2, 2025 • 44

upvoted 2 articles 3 months ago

Article

How to generate text: using different decoding methods for language generation with Transformers

patrickvonplaten

•

Mar 1, 2020

• 297

Article

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

+4

ggerganov, ngxson, allozaur, lysandre, victor, julien-c

•

Feb 20

• 505

upvoted 2 collections 5 months ago

Gemma Scope Release

A comprehensive, open suite of sparse autoencoders for Gemma 2 2B and 9B. • 10 items • Updated Mar 12 • 22

RM Sycophancy (LLaMa)

https://alignment.anthropic.com/2025/auditing-mo-replication/ • 9 items • Updated Feb 15 • 2

upvoted 2 collections 8 months ago

Llama 2 Family

This collection hosts the transformers and original repos of the Llama 2 and Llama Guard releases • 13 items • Updated Dec 6, 2024 • 99

Meta's Llama2 models

12 items • Updated Dec 13, 2024 • 289

upvoted 4 papers 9 months ago

Beyond Transcription: Mechanistic Interpretability in ASR

Paper • 2508.15882 • Published Aug 21, 2025 • 89

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21, 2025 • 273

Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens

Paper • 2508.01191 • Published Aug 2, 2025 • 240

LongSplat: Robust Unposed 3D Gaussian Splatting for Casual Long Videos

Paper • 2508.14041 • Published Aug 19, 2025 • 59

upvoted a collection 9 months ago

🔍 Interpretability & Analysis of LMs

Outstanding research in LM interpretability and evaluation, summarized • 135 items • Updated Dec 18, 2025 • 119

upvoted a paper 9 months ago

Gemma Scope: Open Sparse Autoencoders Everywhere All At Once on Gemma 2

Paper • 2408.05147 • Published Aug 9, 2024 • 41

upvoted a collection 9 months ago

The Well

A 15TB collection of physics simulation datasets. • 18 items • Updated Mar 24, 2025 • 51

upvoted 2 collections 11 months ago

OLMo 2

Artifacts for the OLMo 2 release. • 35 items • Updated Mar 3 • 155

blt

4 items • Updated Apr 17, 2025 • 29

upvoted an article 11 months ago

Article

Learn the Hugging Face Kernel Hub in 5 Minutes

+5

drbh, danieldk, Narsil, pcuenq, pagezyhf, merve, reach-vb

•

Jun 12, 2025

• 164

upvoted a paper 11 months ago

INTELLECT-2: A Reasoning Model Trained Through Globally Decentralized Reinforcement Learning

Paper • 2505.07291 • Published May 12, 2025 • 15

upvoted an article 11 months ago

Article

LeRobot Community Datasets: The “ImageNet” of Robotics — When and How?

+5

danaaubakirova, Beegbrain, mshukor, m1b, villekuosmanen, cadene, pcuenq

•

May 11, 2025

• 97

upvoted a paper 11 months ago

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published Jun 2, 2025 • 159