HN
Paper
All
Show
Ask
Jobs
Top
Today
Last 7 days
Last months
This year
Statistics
All
Show
Ask
Jobs
Top stories
Today
Last 7 days
Last months
This year
Statistics
Stories by
aestrad7
I ran 3,360 safety tests on GPT-4o, Claude, Grok, DeepSeek, Gemini
4 points
aestrad7
2026-03-25T14:09:59Z
github.com