LLM_VLM_R1 - a Deping Collection

Deping 's Collections

VisionExpertModels

GeneralDetector

LLM_VLM_R1

updated Mar 3, 2025

Med-RLVR: Emerging Medical Reasoning from a 3B base model via reinforcement Learning

Paper • 2502.19655 • Published Feb 27, 2025 • 1
MedVLM-R1: Incentivizing Medical Reasoning Capability of Vision-Language Models (VLMs) via Reinforcement Learning

Paper • 2502.19634 • Published Mar 19, 2025 • 62
R1-T1: Fully Incentivizing Translation Capability in LLMs via Reasoning Learning

Paper • 2502.19735 • Published Feb 27, 2025 • 9
AlphaMaze: Enhancing Large Language Models' Spatial Intelligence via GRPO

Paper • 2502.14669 • Published Feb 20, 2025 • 15