Christoph Holthaus's picture

Christoph Holthaus

choltha

·

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

Qwen/Qwen3.5-35B-A3B-GPTQ-Int4

liked a model 2 days ago

Qwen/Qwen3-TTS-12Hz-1.7B-VoiceDesign

liked a model 3 days ago

unsloth/Qwen-Image-Edit-2511-GGUF

View all activity

Organizations

upvoted a collection 8 days ago

Qwen3.5

Qwen3.5 is Qwen's new model family including Qwen3.5 Small: 0.8B, 2B, 4B, 9B and Qwen3.5 Medium: 35B-A3B, 27B, 122B-A10B and 397B-A17B. • 25 items • Updated 2 days ago • 84

upvoted a collection 13 days ago

ColBERT-Zero 🐶

First large-scale fully pre-trained ColBERT model using only public data, outperforming GTE-ModernColBERT and GTE-ModernBERT • 10 items • Updated 1 day ago • 17

upvoted a collection 17 days ago

Qwen3.5

21 items • Updated 1 day ago • 915

upvoted a collection 20 days ago

RynnBrain

10 items • Updated 14 days ago • 23

upvoted a collection 22 days ago

LLaDA2.1

3 items • Updated 20 days ago • 21

upvoted a collection about 1 month ago

Open Coding Agents

12 items • Updated 22 days ago • 49

upvoted 3 collections about 2 months ago

FLUX.2

Our second generation of FLUX • 17 items • Updated Jan 18 • 133

Tiny-A2D

Small diffusion language models adapted from AR models • 4 items • Updated Dec 6, 2025 • 17

DFlash

Block Diffusion for Flash Speculative Decoding • 7 items • Updated 4 days ago • 19

upvoted an article 2 months ago

Article

Aligning to What? Rethinking Agent Generalization in MiniMax M2

Oct 30, 2025

•

43

upvoted 5 collections 3 months ago

T5Gemma 2

3 items • Updated Dec 18, 2025 • 66

LLaDA 2.0

7 items • Updated 20 days ago • 40

GLM-4.6V

3 items • Updated Dec 8, 2025 • 48

Ministral 3

A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated Dec 2, 2025 • 157

Olmo 3

Artifacts for the Olmo 3 release. • 7 items • Updated 3 days ago • 164

upvoted a paper 4 months ago

Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence

Paper • 2511.07384 • Published Nov 10, 2025 • 19

upvoted 2 collections 4 months ago

Retrofitting Recurrence

20 items • Updated 3 days ago • 6

BERT-Chat

BERTs that chat • 2 items • Updated Nov 28, 2025 • 13

upvoted an article 4 months ago

Article

What’s MXFP4? The 4-Bit Secret Powering OpenAI’s GPT‑OSS Models on Modest Hardware

Aug 8, 2025

•

33

upvoted a collection 5 months ago

💧 LFM2

LFM2 is a new generation of hybrid models, designed for on-device deployment. • 28 items • Updated 3 days ago • 146