Surprisal Guided Selection Training at test-time for kernel optimization Surprisal-Guided Selection: Compute-Optimal Test-Time Strategies for Execution-Grounded Code Generation Paper • 2602.07670 • Published 14 days ago • 1 Jarrodbarnes/KernelBench-RLVR-120b Text Generation • 117B • Updated 11 days ago • 32 • 1
Surprisal-Guided Selection: Compute-Optimal Test-Time Strategies for Execution-Grounded Code Generation Paper • 2602.07670 • Published 14 days ago • 1
OpenSec: Incident Response Agent Calibration OpenSec is a dual-control RL environment, dataset, and evaluation suite that measures agent calibration on incident response tasks. OpenSec: Measuring Incident Response Agent Calibration Under Adversarial Evidence Paper • 2601.21083 • Published 24 days ago • 1 Jarrodbarnes/opensec-seeds Viewer • Updated 2 days ago • 540 • 199 • 1 Jarrodbarnes/opensec-gdpo-4b Text Generation • 4B • Updated 9 days ago • 74 • 1 Sleeping RL OpenSec Environment 🔐
OpenSec: Measuring Incident Response Agent Calibration Under Adversarial Evidence Paper • 2601.21083 • Published 24 days ago • 1
Surprisal Guided Selection Training at test-time for kernel optimization Surprisal-Guided Selection: Compute-Optimal Test-Time Strategies for Execution-Grounded Code Generation Paper • 2602.07670 • Published 14 days ago • 1 Jarrodbarnes/KernelBench-RLVR-120b Text Generation • 117B • Updated 11 days ago • 32 • 1
Surprisal-Guided Selection: Compute-Optimal Test-Time Strategies for Execution-Grounded Code Generation Paper • 2602.07670 • Published 14 days ago • 1
OpenSec: Incident Response Agent Calibration OpenSec is a dual-control RL environment, dataset, and evaluation suite that measures agent calibration on incident response tasks. OpenSec: Measuring Incident Response Agent Calibration Under Adversarial Evidence Paper • 2601.21083 • Published 24 days ago • 1 Jarrodbarnes/opensec-seeds Viewer • Updated 2 days ago • 540 • 199 • 1 Jarrodbarnes/opensec-gdpo-4b Text Generation • 4B • Updated 9 days ago • 74 • 1 Sleeping RL OpenSec Environment 🔐
OpenSec: Measuring Incident Response Agent Calibration Under Adversarial Evidence Paper • 2601.21083 • Published 24 days ago • 1