Toggle navigation
HN
Paper
All
Show
Ask
Jobs
Top stories
Today
Last 7 days
Last months
This year
Stats
Stories by og_kalu
Gemini Diffusion
61 points
og_kalu
2025-05-20T17:50:56Z
deepmind.google
Tails Tell Tales: Chapter-Wide Manga Transcriptions with Character Names
2 points
og_kalu
2025-02-18T20:27:23Z
arxiv.org
Over-Tokenized Transformer: Vocabulary Is Generally Worth Scaling
2 points
og_kalu
2025-02-04T21:45:41Z
arxiv.org
LLMs struggle with perception, not reasoning, in ARC-AGI
2 points
og_kalu
2025-02-02T20:25:28Z
anokas.substack.com
EvaByte: Efficient Byte-Level Language Models at Scale
3 points
og_kalu
2025-01-26T16:33:28Z
hkunlp.github.io
Tell me about yourself: LLMs are aware of their learned behaviors
2 points
og_kalu
2025-01-24T17:44:03Z
arxiv.org
Imagine While Reasoning in Space: Multimodal Visualization-of-Thought
2 points
og_kalu
2025-01-15T18:47:39Z
arxiv.org
LLMs struggle with perception, not reasoning, in ARC-AGI
1 points
og_kalu
2025-01-11T17:46:45Z
anokas.substack.com
Byte Latent Transformer: Patches Scale Better Than Tokens
6 points
og_kalu
2024-12-13T16:02:25Z
ai.meta.com
Mastering Board Games by External and Internal Planning with Language Models
1 points
og_kalu
2024-12-06T14:09:23Z
deepmind.google
Emergence of Hidden Capabilities: Exploring Learning Dynamics in Concept Space
2 points
og_kalu
2024-11-14T20:39:33Z
arxiv.org
GameGen-X: Open-World Video Game Generation
4 points
og_kalu
2024-11-05T15:02:03Z
gamegen-x.github.io
TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters
93 points
og_kalu
2024-11-01T14:10:07Z
arxiv.org
Kurzgesagt: We Fell for the Oldest Lie on the Internet [video]
1 points
og_kalu
2024-10-31T17:48:18Z
www.youtube.com
Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-Wise LoRA
1 points
og_kalu
2024-10-29T15:53:37Z
arxiv.org
Solving Global Lyapunov functions: open problem in mathematics with transformers
2 points
og_kalu
2024-10-27T16:36:50Z
arxiv.org
ChatGPT Topped 3B Visits in September
2 points
og_kalu
2024-10-18T19:09:03Z
www.similarweb.com
Tx-LLM: Supporting therapeutic development with large language models
2 points
og_kalu
2024-10-14T17:30:45Z
research.google
Tx-LLM: Supporting therapeutic development with large language models
2 points
og_kalu
2024-10-09T23:53:37Z
research.google
Visual Autoregressive Modeling: Image Generation via Next-Resolution Prediction
1 points
og_kalu
2024-10-05T14:15:51Z
arxiv.org
1
2
3
4
5
6
7
8
9