RedHatAI/Llama-3.1-8B-Instruct-speculator.eagle3 Text Generation • 1.0B • Updated 7 days ago • 8.14k • 1
Running 1 Quantization Formats And Cuda Compute Capability Support 🧠1 Quantization Formats & CUDA Compute Capability Support
Running 1 Quantization Formats And Cuda Compute Capability Support 🧠1 Quantization Formats & CUDA Compute Capability Support
Running 1 Quantization Formats And Cuda Compute Capability Support 🧠1 Quantization Formats & CUDA Compute Capability Support
Less is More: Recursive Reasoning with Tiny Networks Paper • 2510.04871 • Published Oct 6 • 495
Qwen/Qwen3-VL-30B-A3B-Instruct Image-Text-to-Text • 31B • Updated 13 days ago • 1.23M • • 429
Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents Paper • 2509.09265 • Published Sep 11 • 46