Toggle navigation
HN
Paper
All
Show
Ask
Jobs
Top stories
Today
Last 7 days
Last months
This year
Stats
Stories by andy12_
VR-CLI: Learning to Reason for Long-Form Story Generation
2 points
andy12_
2025-05-07T10:15:23Z
www.arxiv.org
Tokenformer: Rethinking transformer scaling with tokenized model parameters
3 points
andy12_
2024-10-31T15:35:16Z
arxiv.org
Selective Attention Improves Transformer
1 points
andy12_
2024-10-07T10:38:33Z
arxiv.org
The AdEMAMix Optimizer: Better, Faster, Older
2 points
andy12_
2024-09-10T08:00:10Z
arxiv.org