HN
Paper
All
Show
Ask
Jobs
Top
Today
Last 7 days
Last months
This year
Statistics
All
Show
Ask
Jobs
Top stories
Today
Last 7 days
Last months
This year
Statistics
Stories by
declanjackson
GLM-5.2 is above GPT-5.5 in new agentic knowledge work eval
5 points
declanjackson
2026-06-22T23:31:08Z
artificialanalysis.ai
Show HN: AA-Briefcase: a frontier knowledge work evaluation
13 points
declanjackson
2026-06-18T23:57:48Z
artificialanalysis.ai
AA-Omniscience: Evaluating Cross-Domain Knowledge Reliability in Language Models
6 points
declanjackson
2025-11-18T04:20:03Z
arxiv.org