Toggle navigation
HN
Paper
All
Show
Ask
Jobs
Top stories
Today
Last 7 days
Last months
This year
Stats
Stories by tosh
LLMs: Common HParam Settings
1 points
tosh
2023-12-28T14:51:55Z
docs.google.com
Iron Lung
2 points
tosh
2023-12-28T14:38:25Z
en.wikipedia.org
AirLLM
2 points
tosh
2023-12-28T14:25:08Z
github.com
Capybara dataset is now open-source and available
1 points
tosh
2023-12-28T14:14:53Z
old.reddit.com
Pressure-tested the most popular open-source LLMs
1 points
tosh
2023-12-28T14:11:30Z
old.reddit.com
Why do countries change their name?
2 points
tosh
2023-12-28T11:11:32Z
www.bbc.com
Yi-34B-200K-DARE-merge-v5
1 points
tosh
2023-12-27T19:52:49Z
huggingface.co
Extreme brainstorming questions to trigger new, better ideas
2 points
tosh
2023-12-27T18:02:04Z
longform.asmartbear.com
Kamal and Hetzner 50x cheaper than Heroku
5 points
tosh
2023-12-27T18:01:33Z
twitter.com
Ultra-Low Inference Latency with LLaMA 65B on PyTorch/XLA
2 points
tosh
2023-12-27T17:05:20Z
pytorch.org
MiniMA-2-3B
2 points
tosh
2023-12-27T16:56:35Z
huggingface.co
Kaggle: GPU Notebooks with 4 CPUs, 29 GBs of RAM
3 points
tosh
2023-12-27T16:52:33Z
www.kaggle.com
Solar 10.7B
3 points
tosh
2023-12-27T14:03:36Z
huggingface.co
Japan Preparing EU-Style Law to Force Apple to Allow App Sideloading
2 points
tosh
2023-12-27T12:27:31Z
www.macrumors.com
Nous-Hermes-2-Yi-34B
1 points
tosh
2023-12-26T19:35:54Z
old.reddit.com
Understanding and Using Supervised Fine-Tuning (SFT) for Language Models
1 points
tosh
2023-12-26T19:28:08Z
cameronrwolfe.substack.com
YAYI 2
1 points
tosh
2023-12-26T15:00:58Z
arxiv.org
Metal-flash-attention: Faster alternative to Metal Performance Shaders
1 points
tosh
2023-12-25T20:26:00Z
github.com
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis
1 points
tosh
2023-12-25T19:41:00Z
arxiv.org
Mixtral_7Bx2_MoE
2 points
tosh
2023-12-24T14:19:28Z
huggingface.co
218
219
220
221
222
223
224
225
226
227