Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Shantanu Acharya's picture
1 12 4

Shantanu Acharya

shantanuacharya
jeremy-london's profile picture
·
https://www.shantanuacharya.com/
  • imSAcharya
  • shan18
  • shanacharya
  • shantanuacharya.bsky.social

AI & ML interests

Large Language Models and Computer Vision

Organizations

NVIDIA's profile picture New York University's profile picture

authored 2 papers 8 months ago

SWAN-GPT: An Efficient and Scalable Approach for Long-Context Language Modeling

Paper • 2504.08719 • Published Apr 11

Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

Paper • 2504.03624 • Published Apr 4 • 15
authored a paper about 1 year ago

Star Attention: Efficient LLM Inference over Long Sequences

Paper • 2411.17116 • Published Nov 26, 2024 • 55
authored 3 papers over 1 year ago

RULER: What's the Real Context Size of Your Long-Context Language Models?

Paper • 2404.06654 • Published Apr 9, 2024 • 39

Damage Control During Domain Adaptation for Transducer Based Automatic Speech Recognition

Paper • 2210.03255 • Published Oct 6, 2022 • 1

Every child should have parents: a taxonomy refinement algorithm based on hyperbolic term embeddings

Paper • 1906.02002 • Published Jun 5, 2019 • 1
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs