Alara Dirik

adirik

alaradirik

AI & ML interests

None yet

Recent Activity

liked a model 3 minutes ago

nvidia/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16

liked a dataset about 3 hours ago

Syn4D/Syn4D_RGBD

upvoted a paper 2 days ago

UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors

View all activity

Organizations

upvoted a paper 2 days ago

UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors

Paper • 2605.00658 • Published 7 days ago • 80

upvoted a paper 14 days ago

DeVI: Physics-based Dexterous Human-Object Interaction via Synthetic Video Imitation

Paper • 2604.20841 • Published 16 days ago • 24

upvoted 2 papers about 1 month ago

Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

Paper • 2603.25716 • Published Mar 26 • 156

Repurposing Geometric Foundation Models for Multi-view Diffusion

Paper • 2603.22275 • Published Mar 23 • 47

upvoted 4 papers about 2 months ago

Utonia: Toward One Encoder for All Point Clouds

Paper • 2603.03283 • Published Mar 3 • 185

Efficiently Reconstructing Dynamic Scenes One D4RT at a Time

Paper • 2512.08924 • Published Dec 9, 2025 • 21

Geometry-Guided Reinforcement Learning for Multi-view Consistent 3D Scene Editing

Paper • 2603.03143 • Published Mar 3 • 145

Flash-KMeans: Fast and Memory-Efficient Exact K-Means

Paper • 2603.09229 • Published Mar 10 • 82

upvoted a paper 2 months ago

dLLM: Simple Diffusion Language Modeling

Paper • 2602.22661 • Published Feb 26 • 153

upvoted 2 papers 3 months ago

Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

Paper • 2312.02145 • Published Dec 4, 2023 • 8

Stroke of Surprise: Progressive Semantic Illusions in Vector Sketching

Paper • 2602.12280 • Published Feb 12 • 34

upvoted 2 articles 5 months ago

Article

Introduction to 3D Gaussian Splatting

Sep 18, 2023

•

136

Article

We’re open-sourcing our text-to-image model and the process behind it

Nov 12, 2025

•

upvoted a collection 5 months ago

CoVT: Chain-of-Visual-Thought

Collection

Enrich VLMs’ vision-centric reasoning capabilities via Chain-of-Visual-Thought! • 7 items • Updated Nov 25, 2025 • 6

upvoted a paper 6 months ago

Φeat: Physically-Grounded Feature Representation

Paper • 2511.11270 • Published Nov 14, 2025 • 11

upvoted an article 9 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025

•

291

upvoted 4 articles 10 months ago

Article

FineVideo: behind the scenes

Sep 23, 2024

•

Article

CinePile 2.0 - making stronger datasets with adversarial refinement

Oct 23, 2024

•

Article

TimeScope: How Long Can Your Video Large Multimodal Model Go?

Jul 23, 2025

•

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

May 14, 2024

•

287

Alara Dirik

AI & ML interests

Recent Activity

Organizations

adirik's activity

Introduction to 3D Gaussian Splatting

We’re open-sourcing our text-to-image model and the process behind it

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

FineVideo: behind the scenes

CinePile 2.0 - making stronger datasets with adversarial refinement

TimeScope: How Long Can Your Video Large Multimodal Model Go?

PaliGemma – Google's Cutting-Edge Open Vision Language Model