HN
Paper
All
Show
Ask
Jobs
Top
Today
Last 7 days
Last months
This year
Statistics
All
Show
Ask
Jobs
Top stories
Today
Last 7 days
Last months
This year
Statistics
Stories by
stephantul
From Chesterton's fence to Chesterton's gap
86 points
stephantul
2026-06-17T06:50:47Z
stephantul.github.io
Why scikit learn's fit transform is probably not for you
1 points
stephantul
2026-05-22T04:31:40Z
stephantul.github.io
Show HN: Semble – Code search for agents that uses 98% fewer tokens than grep
8 points
stephantul
2026-05-03T15:02:08Z
github.com
Show HN: Semble – Fast code search for agents with near-transformer accuracy
7 points
stephantul
2026-04-26T15:00:27Z
github.com
Show HN: Skeletoken, a Python package for editing model tokenizers
1 points
stephantul
2026-02-06T06:02:56Z
github.com
Show HN: PyNIFE. 400-900× speedup for embedding-based retrieval pipelines
2 points
stephantul
2025-11-09T04:50:47Z
github.com
Show HN: Skeletoken, a Package for Editing Tokenizers
1 points
stephantul
2025-09-13T05:11:59Z
github.com
Turning any tokenizer into a greedy one
2 points
stephantul
2025-08-10T06:17:08Z
stephantul.github.io
Decasing Transformers for Fun
3 points
stephantul
2025-08-01T20:05:11Z
stephantul.github.io
Model2Vec as a Fasttext Alternative
5 points
stephantul
2025-07-28T18:26:42Z
minish.ai
Using overloads to handle union return types in Python
1 points
stephantul
2025-03-29T11:55:34Z
stephantul.github.io
Ask HN: Favourite resources for learning programming type theory?
6 points
stephantul
2025-03-19T19:46:14Z
news.ycombinator.com
Evaluating ML classifiers using relative error instead of absolute accuracy
1 points
stephantul
2025-03-13T08:51:35Z
stephantul.github.io
Defeat stringly typing without making your users unhappy
2 points
stephantul
2025-03-07T11:22:00Z
stephantul.github.io
Distilling ModernBERT into a static model doesn't work
5 points
stephantul
2025-01-29T10:57:46Z
minishlab.github.io
Show HN: SemHash – Fast Semantic Text Deduplication for Cleaner Datasets
6 points
stephantul
2025-01-19T16:01:41Z
github.com
Train faster static embedding models with sentence transformers
52 points
stephantul
2025-01-15T20:06:14Z
huggingface.co
Semhash: Fast deduplication and dataset multitool in Python
3 points
stephantul
2025-01-13T19:47:26Z
minishlab.github.io
Model2Vec: Make sentence transformers 500x faster on CPU, 15x smaller
5 points
stephantul
2024-10-16T11:59:01Z
huggingface.co
Show HN: Model2Vec: make sentence transformers 500x faster on CPU, 15x smaller
7 points
stephantul
2024-09-29T08:35:10Z
github.com