[EMNLP2024] RoLoRA: Fine-tuning Rotated Outlier-free LLMs for Effective Weight-Activation Quantization
Xijie Huang
ScarletAce
AI & ML interests
Efficient deep learning, Model Compression, Large Language Models(LLMs)
Recent Activity
authored
a paper
1 day ago
FFP-300K: Scaling First-Frame Propagation for Generalizable Video Editing
authored
a paper
about 1 year ago
SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices
with Efficient Architectures and Training
Organizations
None yet