2 3

Jongwon Lim

Jongwondd

AI & ML interests

None yet

Recent Activity

submitted a paper 3 days ago

KL for a KL: On-Policy Distillation with Control Variate Baseline

commentedon a paper 3 days ago

KL for a KL: On-Policy Distillation with Control Variate Baseline

submitted a paper 5 days ago

Your Language Model is Its Own Critic: Reinforcement Learning with Value Estimation from Actor's Internal States

View all activity

Organizations

submitted a paper to Daily Papers 3 days ago

KL for a KL: On-Policy Distillation with Control Variate Baseline

Paper • 2605.07865 • Published 10 days ago • 17

commented a paper 3 days ago

KL for a KL: On-Policy Distillation with Control Variate Baseline

Paper • 2605.07865 • Published 10 days ago • 17 •

submitted a paper to Daily Papers 5 days ago

Your Language Model is Its Own Critic: Reinforcement Learning with Value Estimation from Actor's Internal States

Paper • 2605.07579 • Published 10 days ago • 15

authored 3 papers 5 days ago

DAHL: Domain-specific Automated Hallucination Evaluation of Long-Form Text through a Benchmark Dataset in Biomedicine

Paper • 2411.09255 • Published Nov 14, 2024

Learning to Retrieve User History and Generate User Profiles for Personalized Persuasiveness Prediction

Paper • 2601.05654 • Published 29 days ago

Your Language Model is Its Own Critic: Reinforcement Learning with Value Estimation from Actor's Internal States

Paper • 2605.07579 • Published 10 days ago • 15

upvoted 2 papers 7 days ago

KL for a KL: On-Policy Distillation with Control Variate Baseline

Paper • 2605.07865 • Published 10 days ago • 17

Your Language Model is Its Own Critic: Reinforcement Learning with Value Estimation from Actor's Internal States

Paper • 2605.07579 • Published 10 days ago • 15

updated a model 17 days ago

Jongwondd/GRESO_step_90

4B • Updated 17 days ago • 14

published 2 models 17 days ago

Jongwondd/GRESO_step_90

4B • Updated 17 days ago • 14

Jongwondd/Qwen3-4B_GRESO_batch_256

Updated 17 days ago

updated a model 23 days ago

Jongwondd/convai_hw1

Updated 23 days ago

published a model 23 days ago

Jongwondd/convai_hw1

Updated 23 days ago

updated a dataset 23 days ago

Jongwondd/convai_hw1

Updated 23 days ago • 128

published a dataset 23 days ago

Jongwondd/convai_hw1

Updated 23 days ago • 128

upvoted a paper 26 days ago

ThinkBrake: Efficient Reasoning via Log-Probability Margin Guided Decoding

Paper • 2510.00546 • Published 28 days ago • 14

Jongwon Lim

AI & ML interests

Recent Activity

Organizations

Jongwondd's activity