Toggle navigation
HN
Paper
All
Show
Ask
Jobs
Top stories
Today
Last 7 days
Last months
This year
Stats
Stories by Danau5tin
1 points
Danau5tin
2025-11-25T18:10:02Z
news.ycombinator.com
Scaling Coding-Agent RL to 32x H100s. 160% Improvement on Stanford's TBench
2 points
Danau5tin
2025-11-03T12:29:10Z
github.com
Show HN: Multi-Agent-Coder Is #12 on Stanford's TBench. Beats Claude Code
5 points
Danau5tin
2025-09-03T08:04:07Z
github.com
My weekend project accidentally beat Claude Code – #12 on Stanford's TBench
2 points
Danau5tin
2025-09-02T09:24:53Z
github.com
Show HN: Terminal-Bench-RL: Training long-horizon terminal agents with RL
124 points
Danau5tin
2025-07-29T11:12:03Z
github.com