-
Natural Language Reinforcement Learning
Paper • 2411.14251 • Published • 31 -
Benjamin-eecs/Llama-3.1-8B-Instruct-NLRL-TicTacToe-Value
Feature Extraction • 8B • Updated • 7 -
Benjamin-eecs/Llama-3.1-8B-Instruct-NLRL-TicTacToe-Policy
Feature Extraction • 8B • Updated • 6 -
Waterhorse/Llama-3.1-8B-Instruct-NLRL-Breakthrough-Value
Feature Extraction • 8B • Updated • 6
Bo Liu
Benjamin-eecs
AI & ML interests
None yet
Recent Activity
upvoted a paper about 21 hours ago
Agents' Last Exam authored a paper 2 days ago
Agents' Last Exam authored a paper about 2 months ago
From $P(y|x)$ to $P(y)$: Investigating Reinforcement Learning in Pre-train Space