Toggle navigation
HN
Paper
All
Show
Ask
Jobs
Top stories
Today
Last 7 days
Last months
This year
Stats
Stories by og_kalu
Multimodal Neurons in Pretrained Text-Only Transformers
66 points
og_kalu
2023-08-04T12:25:45Z
huggingface.co
From Sparse to Soft Mixtures of Experts. Outperforms Dense/Sparse models
2 points
og_kalu
2023-08-03T13:32:07Z
arxiv.org
Skills-in-Context Prompting: Unlocking Compositionality in Large Language Models
1 points
og_kalu
2023-08-02T15:33:27Z
arxiv.org
Communicative LLM Agents for Software Development
1 points
og_kalu
2023-07-31T14:56:51Z
arxiv.org
GPT-4 Vision
4 points
og_kalu
2023-07-25T14:23:54Z
imgur.com
Generating songs with coherent speech and sound effects
8 points
og_kalu
2023-07-20T17:09:37Z
suno-ai.notion.site
Does Visual Pretraining Help End-to-End Reasoning?
1 points
og_kalu
2023-07-18T13:49:15Z
arxiv.org
1 points
og_kalu
2023-07-13T12:10:00Z
news.ycombinator.com
One Embedder, Any Task: Instruction-Finetuned Text Embeddings
1 points
og_kalu
2023-07-12T21:19:54Z
arxiv.org
Model card and evaluations for Claude models [pdf]
60 points
og_kalu
2023-07-11T15:00:24Z
www-files.anthropic.com
Large Language Models can complete complex non linguistic patterns in context
2 points
og_kalu
2023-07-11T10:58:24Z
huggingface.co
Large Language Models as General Pattern Machines
1 points
og_kalu
2023-07-11T09:41:58Z
general-pattern-machines.github.io
Teaching Arithmetic to Small Transformers
3 points
og_kalu
2023-07-10T15:44:29Z
arxiv.org
GPT-4 solves Mystery-o-Matic's Mystery Puzzle of the day
5 points
og_kalu
2023-07-08T23:09:04Z
old.reddit.com
XTrimoPGLM: Unified 100B-Scale Transformer for Deciphering the Protein Language
3 points
og_kalu
2023-07-08T16:38:54Z
www.biorxiv.org
Instruct tuned Mixture of Experts LLMs significantly surpass dense counterparts
2 points
og_kalu
2023-07-07T12:46:00Z
arxiv.org
Building Cooperative Embodied Agents Modularly with Large Language Models
2 points
og_kalu
2023-07-07T00:41:09Z
vis-www.cs.umass.edu
KokoMind: Can LLMs Understand Social Interactions?
2 points
og_kalu
2023-07-06T16:03:57Z
github.com
KokoMind: Can LLMs Understand Social Interactions?
2 points
og_kalu
2023-07-06T02:55:35Z
chats-lab.github.io
LongNet: Scaling Transformers to 1B Tokens
15 points
og_kalu
2023-07-06T02:41:54Z
arxiv.org
2
3
4
5
6
7
8
9