HN
Paper
All
Show
Ask
Jobs
Top
Today
Last 7 days
Last months
This year
Statistics
All
Show
Ask
Jobs
Top stories
Today
Last 7 days
Last months
This year
Statistics
Stories by
zagwdt
Inference Optimization for MiniMax Sparse Attention
1 points
zagwdt
2026-06-03T21:34:21Z
www.together.ai
DeepSeek V4 in vLLM: Efficient Long-Context Attention
3 points
zagwdt
2026-04-24T07:54:27Z
vllm-website-pdzeaspbm-inferact-inc.vercel.app
Introspective Diffusion Language Models
280 points
zagwdt
2026-04-14T07:57:33Z
introspective-diffusion.github.io
EinsteinArena: Harnessing the collective intelligence of agents in the wild
5 points
zagwdt
2026-04-13T22:14:02Z
einsteinarena.com
RL Meets Adaptive Speculative Training
2 points
zagwdt
2026-03-31T23:23:53Z
www.together.ai
Weak models excel at long context tasks
2 points
zagwdt
2026-03-27T22:44:22Z
www.together.ai
TorchSpec: Speculative Decoding Training at Scale
2 points
zagwdt
2026-03-22T18:51:41Z
pytorch.org
Flash Attention 4
1 points
zagwdt
2026-03-05T15:33:37Z
www.together.ai
CoderForge-Preview: SOTA open dataset for training efficient coding agents
1 points
zagwdt
2026-02-25T19:44:55Z
www.together.ai
Two years of vector search at Notion: 10x scale, 1/10th cost
2 points
zagwdt
2026-02-21T23:34:44Z
www.notion.com
Consistency diffusion language models: Up to 14x faster, no quality loss
219 points
zagwdt
2026-02-20T04:15:58Z
www.together.ai