Toggle navigation
HN
Paper
All
Show
Ask
Jobs
Top stories
Today
Last 7 days
Last months
This year
Stats
Stories by nikhilpareek13
1 points
nikhilpareek13
2026-02-10T20:12:47Z
news.ycombinator.com
1 points
nikhilpareek13
2026-02-06T15:37:34Z
news.ycombinator.com
1 points
nikhilpareek13
2026-01-30T19:25:29Z
news.ycombinator.com
1 points
nikhilpareek13
2026-01-22T20:12:37Z
news.ycombinator.com
1 points
nikhilpareek13
2026-01-21T18:40:22Z
news.ycombinator.com
1 points
nikhilpareek13
2026-01-19T17:42:47Z
news.ycombinator.com
1 points
nikhilpareek13
2026-01-17T08:10:58Z
news.ycombinator.com
1 points
nikhilpareek13
2026-01-08T02:09:26Z
news.ycombinator.com
Why text-based evals fail for vision-language models
1 points
nikhilpareek13
2026-01-06T03:17:34Z
news.ycombinator.com
1 points
nikhilpareek13
2025-12-16T20:01:05Z
news.ycombinator.com
1 points
nikhilpareek13
2025-12-15T17:31:07Z
news.ycombinator.com
1 points
nikhilpareek13
2025-12-02T17:00:22Z
news.ycombinator.com
1 points
nikhilpareek13
2025-12-01T18:18:36Z
news.ycombinator.com
1 points
nikhilpareek13
2025-11-18T14:04:52Z
news.ycombinator.com
We built a black box X-Ray for AI Agents
1 points
nikhilpareek13
2025-11-11T02:42:51Z
devhunt.org
1 points
nikhilpareek13
2025-10-29T20:04:21Z
news.ycombinator.com
1 points
nikhilpareek13
2025-10-28T18:53:56Z
news.ycombinator.com
1 points
nikhilpareek13
2025-09-27T02:10:16Z
news.ycombinator.com
AI is probabilistic. Your testing can't stay deterministic
2 points
nikhilpareek13
2025-09-22T08:50:45Z
docs.futureagi.com
The only evals that matter while agent testing are the ones you write yourself
1 points
nikhilpareek13
2025-09-09T18:35:40Z
app.futureagi.com