-
CoPE-VideoLM: Codec Primitives For Efficient Video Language Models
Paper • 2602.13191 • Published • 30 -
KTV: Keyframes and Key Tokens Selection for Efficient Training-Free Video LLMs
Paper • 2602.03615 • Published -
TiFRe: Text-guided Video Frame Reduction for Efficient Video Multi-modal Large Language Models
Paper • 2602.08861 • Published -
Causality-Aware Temporal Projection for Video Understanding in Video-LLMs
Paper • 2601.01804 • Published
Irina Abdullaeva
IrinaAbdullaeva
AI & ML interests
NLP, DL, Multi-modality
Recent Activity
upvoted a paper 22 days ago
VidEoMT: Your ViT is Secretly Also a Video Segmentation Model liked a Space 29 days ago
librarian-bots/recommend_similar_papers updated a collection 29 days ago
Video Perception