Toggle navigation
HN
Paper
All
Show
Ask
Jobs
Top stories
Today
Last 7 days
Last months
This year
Stats
Stories by bearseascape
Weird Generalization and Inductive Backdoors: New Ways to Corrupt LLMs
1 points
bearseascape
2025-12-11T05:07:40Z
arxiv.org
MooseAgent: A LLM Based Multi-Agent Framework for Automating Moose Simulation
13 points
bearseascape
2025-04-14T18:22:48Z
arxiv.org
Automated Researchers Can Subtly Sandbag
2 points
bearseascape
2025-03-27T15:06:16Z
alignment.anthropic.com
Auditing Language Models for Hidden Objectives
1 points
bearseascape
2025-03-27T04:11:03Z
www.anthropic.com
Policy for LLM Writing on LessWrong
2 points
bearseascape
2025-03-27T03:58:18Z
www.lesswrong.com
Towards Understanding Distilled Reasoning Models: A Representational Approach
3 points
bearseascape
2025-03-06T07:23:23Z
arxiv.org
Transformers Learn to Implement Multistep Gradient Descent with Chain of Thought
1 points
bearseascape
2025-03-03T16:35:40Z
arxiv.org
(Mis)Fitting: A Survey of Scaling Laws
2 points
bearseascape
2025-02-27T16:02:16Z
arxiv.org
Resurrecting saturated LLM benchmarks with adversarial encoding
1 points
bearseascape
2025-02-11T15:39:14Z
arxiv.org
Deep Double Descent: Where Bigger Models and More Data Hurt
2 points
bearseascape
2025-02-08T18:18:15Z
openai.com
Value-Based Deep RL Scales Predictably
68 points
bearseascape
2025-02-08T02:36:38Z
arxiv.org