HN
Paper
All
Show
Ask
Jobs
Top
Today
Last 7 days
Last months
This year
Statistics
All
Show
Ask
Jobs
Top stories
Today
Last 7 days
Last months
This year
Statistics
Stories by
Giovan321
Show HN: RewardGuard – detect reward hacking in RL training loops
1 points
Giovan321
2026-04-26T04:28:11Z
github.com