Toggle navigation
HN
Paper
All
Show
Ask
Jobs
Top stories
Today
Last 7 days
Last months
This year
Stats
Stories by pveldandi
Show HN: 50+ LLMs on 2 GPUs with 2-Second Swapping? We built AI-Native Runtime
3 points
pveldandi
2025-05-16T16:16:27Z
github.com
Show HN: InferX - AI Lambda-Like Inference Function as a Service
2 points
pveldandi
2025-05-15T14:15:59Z
news.ycombinator.com
We're running 50 LLMs on 2 GPUs – no cold starts, no overprovisioning
4 points
pveldandi
2025-04-21T13:51:55Z
news.ycombinator.com
Show HN: InferX – an AI-native OS for running 50 LLMs per GPU with hot swapping
3 points
pveldandi
2025-04-17T14:51:53Z
news.ycombinator.com