HN
Paper
All
Show
Ask
Jobs
Top
Today
Last 7 days
Last months
This year
Statistics
All
Show
Ask
Jobs
Top stories
Today
Last 7 days
Last months
This year
Statistics
Stories by
starzmustdie
Show HN: #1 On This Day
18 points
starzmustdie
2026-04-24T16:12:30Z
onthisday-theta.vercel.app
A minimal hackable implementation of policy gradients (GRPO, PPO, REINFORCE)
1 points
starzmustdie
2026-01-17T16:53:45Z
github.com
Reasoning Gym: Procedural Dataset Generation for Reinforcement Learning
1 points
starzmustdie
2025-05-27T19:55:06Z
github.com
Show HN: Word Game Bench – evaluating language models on word puzzles
1 points
starzmustdie
2024-08-30T15:51:51Z
wordgamebench.github.io
Show HN: Answers to Chip Huyen's ML Interview Questions
3 points
starzmustdie
2024-03-15T14:17:50Z
github.com