Subramanyam Sahoo's picture

3 1

Subramanyam Sahoo

SahoobhAI

·

AI & ML interests

RL, AI Safety and AI Alignment

Recent Activity

authored a paper 5 days ago

Who Evaluates AI's Social Impacts? Mapping Coverage and Gaps in First and Third Party Evaluations

authored a paper 5 days ago

Position: The Complexity of Perfect AI Alignment -- Formalizing the RLHF Trilemma

authored a paper 5 days ago

Catch Me If You Can: How Smaller Reasoning Models Pretend to Reason with Mathematical Fidelity

View all activity

Organizations

upvoted a paper 6 days ago

The Reasoning Trap -- Logical Reasoning as a Mechanistic Pathway to Situational Awareness

Paper • 2603.09200 • Published 8 days ago • 5

upvoted an article over 1 year ago

Article

Let's talk about LLM evaluation

May 23, 2024

•

207

upvoted a collection almost 2 years ago

[lecture artifacts] aligning open language models

artifacts referenced in the talk timeline! Slides: https://docs.google.com/presentation/d/1quMyI4BAx4rvcDfk8jjv063bmHg4RxZd9mhQloXpMn0/edit?usp=sharin • 63 items • Updated Apr 17, 2024 • 58