7 59 51

Haiwen Diao

Paranioar

https://Paranioar.github.io/

AI & ML interests

Vision-and-Language, Parameter-efficient Transfer Learning, Multi-modal Large Language Model

Recent Activity

authored a paper 1 day ago

Show the Signal, Hide the Noise: Spectral Forcing for Pixel-Space Diffusion

upvoted a paper 3 days ago

Show the Signal, Hide the Noise: Spectral Forcing for Pixel-Space Diffusion

authored a paper 22 days ago

From Pixels to Words -- Towards Native One-Vision Models at Scale

View all activity

Organizations

authored a paper 1 day ago

Show the Signal, Hide the Noise: Spectral Forcing for Pixel-Space Diffusion

Paper • 2606.15236 • Published 4 days ago • 18

upvoted a paper 3 days ago

Show the Signal, Hide the Noise: Spectral Forcing for Pixel-Space Diffusion

Paper • 2606.15236 • Published 4 days ago • 18

authored a paper 22 days ago

From Pixels to Words -- Towards Native One-Vision Models at Scale

Paper • 2605.28820 • Published 24 days ago • 73

commented a paper 23 days ago

From Pixels to Words -- Towards Native One-Vision Models at Scale

Paper • 2605.28820 • Published 24 days ago • 73 •

updated a collection 23 days ago

NEO1_5

Collection

From Pixels to Words -- Towards Native One-Vision Models at Scale • 3 items • Updated 23 days ago • 6

upvoted a paper 23 days ago

From Pixels to Words -- Towards Native One-Vision Models at Scale

Paper • 2605.28820 • Published 24 days ago • 73

submitted a paper to Daily Papers 23 days ago

From Pixels to Words -- Towards Native One-Vision Models at Scale

Paper • 2605.28820 • Published 24 days ago • 73

liked 2 models 23 days ago

Paranioar/NEO1_5-2B-SFT

Image-Text-to-Text • 3B • Updated 23 days ago • 107 • 2

Paranioar/NEO1_5-9B-SFT

Image-Text-to-Text • 10B • Updated 23 days ago • 136 • 3

upvoted a collection 23 days ago

NEO1_5

Collection

From Pixels to Words -- Towards Native One-Vision Models at Scale • 3 items • Updated 23 days ago • 6

updated 2 models 23 days ago

Paranioar/NEO1_5-9B-SFT

Image-Text-to-Text • 10B • Updated 23 days ago • 136 • 3

Paranioar/NEO1_5-2B-SFT

Image-Text-to-Text • 3B • Updated 23 days ago • 107 • 2

published 2 models 23 days ago

Paranioar/NEO1_5-2B-SFT

Image-Text-to-Text • 3B • Updated 23 days ago • 107 • 2

Paranioar/NEO1_5-9B-SFT

Image-Text-to-Text • 10B • Updated 23 days ago • 136 • 3

upvoted 2 papers 24 days ago

LLaVA-OneVision-2: Towards Next-Generation Perceptual Intelligence

Paper • 2605.25979 • Published 26 days ago • 27

SpatialBench: Is Your Spatial Foundation Model an All-Round Player?

Paper • 2605.27367 • Published 25 days ago • 72

upvoted a paper 29 days ago

PhysX-Omni: Unified Simulation-Ready Physical 3D Generation for Rigid, Deformable, and Articulated Objects

Paper • 2605.21572 • Published about 1 month ago • 53

liked a model about 1 month ago

sensenova/SenseNova-U1-8B-MoT-Infographic

Any-to-Any • 18B • Updated May 16 • 846 • 49

authored 2 papers about 1 month ago

DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation

Paper • 2601.22153 • Published Jan 29 • 75

VISTA-Bench: Do Vision-Language Models Really Understand Visualized Text as Well as Pure Text?

Paper • 2602.04802 • Published Feb 4 • 2

Haiwen Diao

AI & ML interests

Recent Activity

Organizations

Paranioar's activity