Toggle navigation
HN
Paper
All
Show
Ask
Jobs
Top stories
Today
Last 7 days
Last months
This year
Stats
Stories by shreyansh26
Deriving the gradient for the backward pass of Layer Normalization
3 points
shreyansh26
2025-06-05T03:02:32Z
shreyansh26.github.io
1 points
shreyansh26
2025-03-23T17:57:41Z
news.ycombinator.com
GTC'25 Notes: CUDA Techniques to Maximize Memory Bandwidth – Part 1
1 points
shreyansh26
2025-03-23T16:50:36Z
shreyansh26.github.io
FlashAttention in PyTorch
2 points
shreyansh26
2023-06-14T04:38:11Z
github.com
Understanding FlashAttention
2 points
shreyansh26
2023-03-31T03:36:41Z
shreyansh26.github.io
Ask HN: What are some good resources on Recommender Systems?
14 points
shreyansh26
2022-11-25T01:24:28Z
news.ycombinator.com