HN
Paper
All
Show
Ask
Jobs
Top
Today
Last 7 days
Last months
This year
Statistics
All
Show
Ask
Jobs
Top stories
Today
Last 7 days
Last months
This year
Statistics
Stories by
kumama
Designing dev onboarding for an agent-first world
2 points
kumama
2026-06-23T20:19:30Z
castform.com
I post-trained a model to reliably roll a die
2 points
kumama
2026-06-17T18:30:12Z
castform.com
Open-Weight Models Don't Need to Win
5 points
kumama
2026-05-25T18:40:02Z
twitter.com
Prompt caching but for RL – 7.5x speedup on long-prompt/short-response workloads
4 points
kumama
2026-05-11T20:53:55Z
castform.com
Pokegents: Making multi-agent coding feel like a team
8 points
kumama
2026-05-08T19:56:17Z
castform.com
Grpo explained: group relative policy optimization for LLM finetuning
1 points
kumama
2026-04-16T22:18:59Z
cgft.io
Do RL on a model with your vector db
1 points
kumama
2026-04-06T22:43:25Z
cgft.io
What is reinforcement learning finetuning
3 points
kumama
2026-04-02T19:21:18Z
www.youtube.com
RAG to riches: synthetic data for training RAG agents
2 points
kumama
2026-03-25T17:37:44Z
cgft.io
rag not lag: rl for fast agentic retrieval
3 points
kumama
2026-03-09T23:28:50Z
cgft.io
Show HN: Benchmax, a new open-source RL environment framework for LLM finetuning
1 points
kumama
2025-07-29T20:10:27Z
github.com
Beating o3/o4-mini with Codebase-specific Reinforcement Learning
3 points
kumama
2025-06-11T19:37:52Z
www.cgft.io
We might be overestimating coding agent performance on SWE-Bench
1 points
kumama
2024-11-05T20:41:55Z
www.cgft.io
How to Improve Code Completion LLMs with Repo-Specific Finetuning
3 points
kumama
2024-10-25T19:14:42Z
www.cgft.io
Show HN: Free AI Code Completion for Xcode with model choice/codebase context
2 points
kumama
2024-10-21T18:10:05Z
www.cgft.io