HN
Paper
All
Show
Ask
Jobs
Top
Today
Last 7 days
Last months
This year
Statistics
All
Show
Ask
Jobs
Top stories
Today
Last 7 days
Last months
This year
Statistics
Stories by
mezark
A running list of reasons to move to open source
5 points
mezark
2026-06-22T15:42:39Z
whyopensource.ai
Moe inference optimizations: 15% lower expert load by request reordering
3 points
mezark
2026-05-20T23:05:25Z
blog.doubleword.ai
Tensor Network Attention
2 points
mezark
2026-05-07T12:14:12Z
mainlymatmul.com
Redundant Information in LLM Weights
5 points
mezark
2026-05-05T11:38:10Z
fergusfinn.com
Tans: Precomputing RANS
3 points
mezark
2026-04-30T13:39:12Z
fergusfinn.com
Also-RANS: Asymmetric Numeral Systems for Entropy Coding
24 points
mezark
2026-04-30T13:38:45Z
fergusfinn.com
70x faster cold(ish) starts for SGLang
4 points
mezark
2026-04-24T15:02:19Z
fergusfinn.com
QueueSpec – drafting speculation tokens while a request queues
1 points
mezark
2026-01-26T12:49:46Z
blog.doubleword.ai
ZeroDP: Just-in-Time Weight Offloading over NVLink for Data Parallelism
1 points
mezark
2026-01-19T12:37:58Z
mainlymatmul.com
Parallel Primitives for Multi-Agent Workflows
1 points
mezark
2026-01-14T12:15:19Z
fergusfinn.com
New fastest AI Model Gateway – 450x less overhead than LiteLLM
2 points
mezark
2025-10-21T13:23:58Z
github.com
Should GPUs Make Free Trade Agreements?
3 points
mezark
2025-09-19T17:11:52Z
www.doubleword.ai
Controlled generation of OS LLMs – without impacting latency
7 points
mezark
2023-10-28T01:03:28Z
www.youtube.com
1 points
mezark
2023-10-12T16:13:53Z
news.ycombinator.com
Takeoff Inference Server Is Now Open Source
3 points
mezark
2023-08-01T13:09:53Z
github.com
Falcon 7B running real time on CPU
11 points
mezark
2023-07-05T19:37:00Z
www.youtube.com