Nikita Balagansky's picture

Nikita Balagansky

elephantmipt

·

https://elephantmipt.github.io

AI & ML interests

None yet

Recent Activity

authored a paper 7 days ago

Unstable Features, Reproducible Subspaces: Understanding Seed Dependence in Sparse Autoencoders

upvoted a paper 7 days ago

Unstable Features, Reproducible Subspaces: Understanding Seed Dependence in Sparse Autoencoders

submitted a paper 7 days ago

Unstable Features, Reproducible Subspaces: Understanding Seed Dependence in Sparse Autoencoders

View all activity

Organizations

authored a paper 7 days ago

Unstable Features, Reproducible Subspaces: Understanding Seed Dependence in Sparse Autoencoders

Paper • 2606.12138 • Published 13 days ago • 8

upvoted a paper 7 days ago

Unstable Features, Reproducible Subspaces: Understanding Seed Dependence in Sparse Autoencoders

Paper • 2606.12138 • Published 13 days ago • 8

submitted a paper to Daily Papers 7 days ago

Unstable Features, Reproducible Subspaces: Understanding Seed Dependence in Sparse Autoencoders

Paper • 2606.12138 • Published 13 days ago • 8

authored 4 papers 12 days ago

Small Vectors, Big Effects: A Mechanistic Study of RL-Induced Reasoning via Steering Vectors

Paper • 2509.06608 • Published Sep 8, 2025

Steering LLM Reasoning Through Bias-Only Adaptation

Paper • 2505.18706 • Published May 24, 2025

Train One Sparse Autoencoder Across Multiple Sparsity Budgets to Preserve Interpretability and Accuracy

Paper • 2505.24473 • Published May 30, 2025

Interpreting and Steering a Text-to-Speech Language Model with Sparse Autoencoders

Paper • 2606.10029 • Published 14 days ago • 12

upvoted a paper 13 days ago

Interpreting and Steering a Text-to-Speech Language Model with Sparse Autoencoders

Paper • 2606.10029 • Published 14 days ago • 12

submitted a paper to Daily Papers 13 days ago

Interpreting and Steering a Text-to-Speech Language Model with Sparse Autoencoders

Paper • 2606.10029 • Published 14 days ago • 12

authored a paper 22 days ago

Trust-Region Behavior Blending for On-Policy Distillation

Paper • 2605.31159 • Published 25 days ago • 66

updated a model 2 months ago

elephantmipt/sae_uramt7ar

published a model 2 months ago

elephantmipt/sae_uramt7ar

updated a model 2 months ago

elephantmipt/sae_wiajygyw

published a model 2 months ago

elephantmipt/sae_wiajygyw

updated a model 2 months ago

elephantmipt/sae_edt7oylt

published a model 2 months ago

elephantmipt/sae_edt7oylt

updated a model 2 months ago

elephantmipt/sae_k9oz7r8j

published a model 2 months ago

elephantmipt/sae_2qy4isey

updated a model 2 months ago

elephantmipt/sae_10d1xu3h

published a model 2 months ago

elephantmipt/sae_k9oz7r8j