OSCAR: Offline Spectral Covariance-Aware Rotation for 2-bit KV Cache Quantization Paper • 2605.17757 • Published 8 days ago • 62
Forgetting That Sticks: Quantization-Permanent Unlearning via Circuit Attribution Paper • 2605.15138 • Published 12 days ago • 6
EvolveMem:Self-Evolving Memory Architecture via AutoResearch for LLM Agents Paper • 2605.13941 • Published 13 days ago • 24
Workspace-Bench 1.0: Benchmarking AI Agents on Workspace Tasks with Large-Scale File Dependencies Paper • 2605.03596 • Published 21 days ago • 10
Forge-UGC: FX optimization and register-graph engine for universal graph compiler Paper • 2604.16498 • Published Apr 14 • 4
LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model Paper • 2604.20796 • Published Apr 22 • 242
ali-elganzory/1.7b-MixtureVitae-100BT-longsft_16k Feature Extraction • 2B • Updated 28 days ago • 20 • 1
Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability Paper • 2604.06628 • Published Apr 8 • 326
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published Apr 2 • 503