Toggle navigation
HN
Paper
All
Show
Ask
Jobs
Top stories
Today
Last 7 days
Last months
This year
Stats
Stories by PranoyP
Testing LLM Agents Like Software – Behaviour Driven Evals of AI Systems
21 points
PranoyP
2025-11-04T17:11:13Z
aclanthology.org