Toggle navigation
HN
Paper
All
Show
Ask
Jobs
Top stories
Today
Last 7 days
Last months
This year
Stats
Stories by EarlyOom
Replace OCR with Vision Language Models
291 points
EarlyOom
2025-02-26T19:29:37Z
github.com
Show HN: Visually parse an entire YouTube video frame by frame
5 points
EarlyOom
2025-02-21T21:30:13Z
github.com
Ask HN: What are folks using to train/fine-tune Vision Language Models
1 points
EarlyOom
2025-02-21T21:22:12Z
news.ycombinator.com
A Node.js SDK for calling Vision Language Models
6 points
EarlyOom
2025-02-20T21:22:03Z
github.com
Run structured extraction on documents/images locally with Ollama and Pydantic
169 points
EarlyOom
2025-02-20T01:54:10Z
github.com
Show HN: Vlm Run, Extract JSON from images, videos and documents in a simple API
2 points
EarlyOom
2024-08-13T18:53:46Z
vlm.run
Fine-grained Visual Transcription for YouTube videos
9 points
EarlyOom
2024-06-10T19:21:48Z
vlm-docs.nos.run
"Ok Computer, why are you slow?"
2 points
EarlyOom
2024-01-31T21:11:32Z
scottloftin.substack.com
Show HN: NOS – A fast, and ergonomic PyTorch inference server
2 points
EarlyOom
2023-12-14T23:05:59Z
github.com