Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Shiqi's picture
2 3

Shiqi

aquila147

AI & ML interests

computer vision

Organizations

None yet

authored a paper about 1 year ago

GLOV: Guided Large Language Models as Implicit Optimizers for Vision Language Models

Paper • 2410.06154 • Published Oct 8, 2024 • 16
authored 3 papers over 1 year ago

Visual Echoes: A Simple Unified Transformer for Audio-Visual Generation

Paper • 2405.14598 • Published May 23, 2024 • 14

MaTe3D: Mask-guided Text-based 3D-aware Portrait Editing

Paper • 2312.06947 • Published Dec 12, 2023

Dynamic Prompt Learning: Addressing Cross-Attention Leakage for Text-Based Image Editing

Paper • 2309.15664 • Published Sep 27, 2023 • 2
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs