SeeingEye: Agentic Information Flow Unlocks Multimodal Reasoning In Text-only LLMs Paper • 2510.25092 • Published Oct 29, 2025 • 8
Essential-Web v1.0: 24T tokens of organized web data Paper • 2506.14111 • Published Jun 17, 2025 • 46
Fin-PRM: A Domain-Specialized Process Reward Model for Financial Reasoning in Large Language Models Paper • 2508.15202 • Published Aug 21, 2025 • 5
AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs Paper • 2508.16153 • Published Aug 22, 2025 • 160
CARFT: Boosting LLM Reasoning via Contrastive Learning with Annotated Chain-of-Thought-based Reinforced Fine-Tuning Paper • 2508.15868 • Published Aug 21, 2025 • 3
StepWiser: Stepwise Generative Judges for Wiser Reasoning Paper • 2508.19229 • Published Aug 26, 2025 • 20
Efficient Differentially Private Fine-Tuning of LLMs via Reinforcement Learning Paper • 2507.22565 • Published Jul 30, 2025 • 9
GEPA: Reflective Prompt Evolution Can Outperform Reinforcement Learning Paper • 2507.19457 • Published Jul 25, 2025 • 30