Toggle navigation
HN
Paper
All
Show
Ask
Jobs
Top stories
Today
Last 7 days
Last months
This year
Stats
Stories by tamassimond
The Cost of Winning:How RL Training on Poker Leads to Evil LLMs
2 points
tamassimond
2025-08-22T22:21:38Z
tobysimonds.com
The Hidden Cost of Winning:How RL Training on Poker Degrades LLM Moral Alignment
8 points
tamassimond
2025-08-22T11:36:51Z
tobysimonds.com
AlphaWrite: AI that improves at writing by evolving its own stories
80 points
tamassimond
2025-06-11T07:23:45Z
tobysimonds.com
Self Rewarding Self Improving: Autonomous LLM Improvement
28 points
tamassimond
2025-05-15T22:18:23Z
arxiv.org
LLMs for Engineering: Teaching Models to Design High Powered Rockets
123 points
tamassimond
2025-04-30T22:03:03Z
arxiv.org
Text to RL: Extracting High-Quality RL Questions from Text
1 points
tamassimond
2025-03-25T01:21:23Z
tufalabs.ai