Hiring 💼

3 73 157

Cahlen Humphreys PRO

cahlen

https://bigcompute.science

AI & ML interests

☠️💻

Recent Activity

updated a dataset about 4 hours ago

cahlen/zaremba-conjecture-data

liked a model about 7 hours ago

unsloth/gemma-4-26B-A4B-it-GGUF

reacted to SeaWolf-AI's post with 🚀 about 7 hours ago

🔥 128 Blackwell GPUs — Thank You, Hugging Face I've been awarded 128 NVIDIA Blackwell GPUs through NIPA (Korea's National IT Industry Promotion Agency). Sharing this here first — because Hugging Face is where it all started. I design LLM architectures from scratch. HF was my lab — dissecting Transformers internals, analyzing thousands of checkpoints, iterating on Spaces with global feedback. Our FINAL Bench reached #5 globally in HF dataset popularity, and this research is exactly what earned the GPU grant. 👉 https://huggingface.co/spaces/FINAL-Bench/Leaderboard These 128 Blackwells will scale AETHER-Net — our Proto-AGI architecture (Emergence Engine · Meta-Cognition · SLAI · Multi-Intelligence · Synergy & Critique) — validated at 0.8B with MoE expansion to 2.1B params. Next stop: 166B. People I must thank: @John6666 — Guardian of this ecosystem. Never misses a forum question, interested in every project, active 24/7. I've genuinely wondered if you're a machine. Remarkable. @bartowski — Master of quantization. The hidden infrastructure of open-source LLM. Countless experiments possible thanks to you. @SaylorTwift — You see what others miss. Insight that cuts to the essence. Deep respect. My promise: AETHER-Net design docs, training recipes, checkpoints, and failure logs — all shared here openly. 🤗 Thank you, Hugging Face. Let's turn the next page together. 🚀 vidraft · VIDRAFT #OpenScience #HuggingFace #ProtoAGI #AETHER #LLMArchitecture #Blackwell #NIPA

View all activity

Organizations

upvoted an article 1 day ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

2 days ago

•

467

upvoted 2 papers 8 days ago

Less Gaussians, Texture More: 4K Feed-Forward Textured Splatting

Paper • 2603.25745 • Published 9 days ago • 13

Voxtral TTS

Paper • 2603.25551 • Published 9 days ago • 56

upvoted 3 papers 11 days ago

Generalized Discrete Diffusion from Snapshots

Paper • 2603.21342 • Published 13 days ago • 11

Group3D: MLLM-Driven Semantic Grouping for Open-Vocabulary 3D Object Detection

Paper • 2603.21944 • Published 12 days ago • 26

Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model

Paper • 2603.21986 • Published 12 days ago • 120

upvoted a paper 13 days ago

Hidden Dynamics of Massive Activations in Transformer Training

Paper • 2508.03616 • Published Aug 5, 2025 • 19

upvoted a collection 13 days ago

Math Datasets

Collection

7 items • Updated 13 days ago • 1

upvoted a paper 15 days ago

Attention Residuals

Paper • 2603.15031 • Published 19 days ago • 171

upvoted a changelog 15 days ago

Hugging Face Changelog

Protected Spaces with Public URLs

15 days ago

• 107

upvoted 3 papers 17 days ago

upvoted 2 papers 20 days ago

FMB: a Functional Manipulation Benchmark for Generalizable Robotic Learning

Paper • 2401.08553 • Published Jan 16, 2024 • 2

FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization

Paper • 2303.14189 • Published Mar 24, 2023 • 5

upvoted a collection 23 days ago

NVIDIA Nemotron v3

Collection

Open, Production-ready Enterprise Models • 15 items • Updated 4 days ago • 259

upvoted an article 23 days ago

Article

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

Jun 3, 2025

•

342

upvoted a paper 23 days ago

DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation

Paper • 2601.22153 • Published Jan 29 • 74

upvoted 2 papers 29 days ago

AgentScope 1.0: A Developer-Centric Framework for Building Agentic Applications

Paper • 2508.16279 • Published Aug 22, 2025 • 61

Helios: Real Real-Time Long Video Generation Model

Paper • 2603.04379 • Published about 1 month ago • 178