Toggle navigation
HN
Paper
All
Show
Ask
Jobs
Top stories
Today
Last 7 days
Last months
This year
Stats
Stories by fatso784
EvalGen: Helping Developers Create LLM Evals Aligned to Their Preferences
3 points
fatso784
2025-05-14T23:28:40Z
ianarawjo.medium.com
Semantic Commit: Helping Users Update Intent Specifications for AI Memory
2 points
fatso784
2025-04-15T13:04:36Z
arxiv.org
What AI Engineers Can Learn from Qualitative Research Methods
1 points
fatso784
2025-01-09T20:53:44Z
ianarawjo.medium.com
DocETL: A tool for creating LLM-powered data processing pipelines
2 points
fatso784
2024-09-26T22:30:36Z
ucbepic.github.io
Aligning LLM-as-a-Judge with Human Preferences
1 points
fatso784
2024-06-26T20:51:45Z
blog.langchain.dev
LLM Wrapper Papers Are Hurting HCI Research
3 points
fatso784
2024-06-06T14:37:42Z
ianarawjo.medium.com
Who Validates the Validators? Aligning LLM-Assisted Evaluation of LLM Outputs
2 points
fatso784
2024-04-22T16:21:05Z
arxiv.org
If in a Crowdsourced Data Annotation Pipeline, a GPT-4
1 points
fatso784
2024-03-05T19:25:44Z
arxiv.org
Antagonistic AI
3 points
fatso784
2024-03-01T00:32:06Z
venturebeat.com
How to Compare Prompts with ChainForge [video]
1 points
fatso784
2024-01-02T17:33:13Z
www.youtube.com
AI for ChainForge Beta
1 points
fatso784
2023-12-13T20:10:19Z
github.com
ChatGPT does not have seasonal affective disorder
2 points
fatso784
2023-12-12T17:46:13Z
ianarawjo.medium.com
There is no "seasonal affective disorder" of ChatGPT
1 points
fatso784
2023-12-12T17:06:37Z
twitter.com
There will never be fully automated prompt engineering
2 points
fatso784
2023-09-28T13:10:34Z
arxiv.org
ChainForge: A Visual Toolkit for Prompt Engineering and LLM Hypothesis Testing
4 points
fatso784
2023-09-19T12:45:39Z
arxiv.org
Ask HN: Have LLM API Updates or Deprecations Impacted You?
4 points
fatso784
2023-08-17T14:44:29Z
news.ycombinator.com
Apple’s ML model and dataset introspection API
1 points
fatso784
2023-08-09T19:08:56Z
apple.github.io
Show HN: ChainForge, a visual tool for prompt engineering and LLM evaluation
177 points
fatso784
2023-08-07T17:54:32Z
chainforge.ai
Continue multiple conversations simultaneously across multiple LLMs
2 points
fatso784
2023-07-28T16:25:20Z
github.com
ChainForge now supports chat evaluation
2 points
fatso784
2023-07-26T16:39:20Z
github.com
1