Qingyan's picture

Qingyan PRO

QingyanBai

·

https://bqy.info/

AI & ML interests

Generative Models, UMMs, and Agents.

Recent Activity

upvoted a paper 7 days ago

RELIC: Interactive Video World Model with Long-Horizon Memory

upvoted a paper 7 days ago

MagicQuillV2: Precise and Interactive Image Editing with Layered Visual Cues

new activity 8 days ago

QingyanBai/Ditto-1M:difference between csvs?

View all activity

Organizations

upvoted 2 papers 7 days ago

RELIC: Interactive Video World Model with Long-Horizon Memory

Paper • 2512.04040 • Published 7 days ago • 22

MagicQuillV2: Precise and Interactive Image Editing with Layered Visual Cues

Paper • 2512.03046 • Published 8 days ago • 11

upvoted a paper 8 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published 9 days ago • 194

upvoted a paper 22 days ago

Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

Paper • 2511.08892 • Published 29 days ago • 194

upvoted 2 papers about 1 month ago

World Simulation with Video Foundation Models for Physical AI

Paper • 2511.00062 • Published Oct 28 • 40

From Denoising to Refining: A Corrective Framework for Vision-Language Diffusion Model

Paper • 2510.19871 • Published Oct 22 • 29

upvoted 3 papers about 2 months ago

HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives

Paper • 2510.20822 • Published Oct 23 • 40

Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset

Paper • 2510.15742 • Published Oct 17 • 50

D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI

Paper • 2510.05684 • Published Oct 7 • 141

upvoted 2 papers 3 months ago

ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data

Paper • 2509.15221 • Published Sep 18 • 111

Scaling Agents via Continual Pre-training

Paper • 2509.13310 • Published Sep 16 • 117

upvoted 4 papers 5 months ago

Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10 • 159

Rethinking Verification for LLM Code Generation: From Generation to Testing

Paper • 2507.06920 • Published Jul 9 • 28

Go to Zero: Towards Zero-shot Motion Generation with Million-scale Data

Paper • 2507.07095 • Published Jul 9 • 54

Calligrapher: Freestyle Text Image Customization

Paper • 2506.24123 • Published Jun 30 • 37

upvoted 2 papers 10 months ago

DICEPTION: A Generalist Diffusion Model for Visual Perceptual Tasks

Paper • 2502.17157 • Published Feb 24 • 52

Goku: Flow Based Video Generative Foundation Models

Paper • 2502.04896 • Published Feb 7 • 106

upvoted 3 papers 11 months ago

MatchAnything: Universal Cross-Modality Image Matching with Large-Scale Pre-Training

Paper • 2501.07556 • Published Jan 13 • 7

MangaNinja: Line Art Colorization with Precise Reference Following

Paper • 2501.08332 • Published Jan 14 • 61

Diffusion Adversarial Post-Training for One-Step Video Generation

Paper • 2501.08316 • Published Jan 14 • 36