A Clark's picture

A Clark

aclark63

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Listwise Policy Optimization: Group-based RLVR as Target-Projection on the LLM Response Simplex

liked a model 6 days ago

MinhPhuc0804/me5-256-kiem-tra-di-t1-v2.2-epoch-10

upvoted a paper 8 days ago

From Context to Skills: Can Language Models Learn from Context Skillfully?

View all activity

Organizations

None yet

upvoted a paper 1 day ago

Listwise Policy Optimization: Group-based RLVR as Target-Projection on the LLM Response Simplex

Paper • 2605.06139 • Published 6 days ago • 63

upvoted a paper 8 days ago

From Context to Skills: Can Language Models Learn from Context Skillfully?

Paper • 2604.27660 • Published 10 days ago • 152

upvoted 9 papers about 1 month ago

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published Apr 8 • 324

SkillClaw: Let Skills Evolve Collectively with Agentic Evolver

Paper • 2604.08377 • Published Apr 9 • 289

HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents

Paper • 2604.07430 • Published Apr 8 • 187

Token Warping Helps MLLMs Look from Nearby Viewpoints

Paper • 2604.02870 • Published Apr 3 • 34

When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models

Paper • 2604.08546 • Published Apr 9 • 115

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 628

Type-Checked Compliance: Deterministic Guardrails for Agentic Financial Systems Using Lean 4 Theorem Proving

Paper • 2604.01483 • Published Apr 1 • 7

Superintelligence and Law

Paper • 2603.28669 • Published Mar 30 • 7

Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

Paper • 2603.25716 • Published Mar 26 • 156

upvoted a paper about 2 months ago

Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

Paper • 2603.04597 • Published Mar 4 • 210

upvoted a paper 3 months ago

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 523