Peking University

university

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

Skywalker0410 submitted a paper 29 days ago

AffordanceVLA: A Vision-Language-Action Model Empowering Action Generation through Affordance-Aware Understanding

Skywalker0410 authored a paper 29 days ago

LongAct: Harnessing Intrinsic Activation Patterns for Long-Context Reinforcement Learning

Skywalker0410 authored a paper 29 days ago

SUGAR: A Scalable Human-Video-Driven Generalizable Humanoid Loco-Manipulation Learning Framework

View all activity

Papers

MISA: Mixture of Indexer Sparse Attention for Long-Context LLM Inference

VideoZeroBench: Probing the Limits of Video MLLMs with Spatio-Temporal Evidence Verification

View all Papers

authored 2 papers 9 days ago

Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs

Paper • 2510.18876 • Published Oct 21, 2025 • 37

PerceptionDLM: Parallel Region Perception with Multimodal Diffusion Language Models

Paper • 2606.19534 • Published 18 days ago • 64

submitted a paper to Daily Papers 29 days ago

AffordanceVLA: A Vision-Language-Action Model Empowering Action Generation through Affordance-Aware Understanding

Paper • 2606.06155 • Published about 1 month ago • 10

authored 4 papers 29 days ago

LongAct: Harnessing Intrinsic Activation Patterns for Long-Context Reinforcement Learning

Paper • 2604.14922 • Published Apr 16 • 7

SUGAR: A Scalable Human-Video-Driven Generalizable Humanoid Loco-Manipulation Learning Framework

Paper • 2605.20373 • Published May 19

AffordanceVLA: A Vision-Language-Action Model Empowering Action Generation through Affordance-Aware Understanding

Paper • 2606.06155 • Published about 1 month ago • 10

A3D: Adaptive Affordance Assembly with Dual-Arm Manipulation

Paper • 2601.11076 • Published Jan 16

submitted a paper to Daily Papers about 2 months ago

MISA: Mixture of Indexer Sparse Attention for Long-Context LLM Inference

Paper • 2605.07363 • Published May 8 • 12

submitted a paper to Daily Papers 3 months ago

VideoZeroBench: Probing the Limits of Video MLLMs with Spatio-Temporal Evidence Verification

Paper • 2604.01569 • Published Apr 2 • 14

submitted a paper to Daily Papers 4 months ago

Enhancing Spatial Understanding in Image Generation via Reward Modeling

Paper • 2602.24233 • Published Feb 27 • 60

submitted a paper to Daily Papers 5 months ago

Mechanistic Data Attribution: Tracing the Training Origins of Interpretable LLM Units

Paper • 2601.21996 • Published Jan 29 • 5

submitted a paper to Daily Papers 5 months ago

Knowledge is Not Enough: Injecting RL Skills for Continual Adaptation

Paper • 2601.11258 • Published Jan 16 • 10

authored a paper 9 months ago

Uniworld-V2: Reinforce Image Editing with Diffusion Negative-aware Finetuning and MLLM Implicit Feedback

Paper • 2510.16888 • Published Oct 19, 2025 • 22

authored a paper 10 months ago

ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving

Paper • 2404.16771 • Published Dec 28, 2024 • 19

authored a paper 11 months ago

Tool-integrated Reinforcement Learning for Repo Deep Search

Paper • 2508.03012 • Published Aug 5, 2025 • 20

authored 5 papers over 1 year ago

UPME: An Unsupervised Peer Review Framework for Multimodal Large Language Model Evaluation

Paper • 2503.14941 • Published Mar 19, 2025 • 5

MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use

Paper • 2310.03128 • Published Oct 4, 2023 • 1

TrustLLM: Trustworthiness in Large Language Models

Paper • 2401.05561 • Published Jan 10, 2024 • 69

MLLM-as-a-Judge: Assessing Multimodal LLM-as-a-Judge with Vision-Language Benchmark

Paper • 2402.04788 • Published Feb 7, 2024

The Best of Both Worlds: Toward an Honest and Helpful Large Language Model

Paper • 2406.00380 • Published Jun 1, 2024