Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
larry-dial
Karma:
108
All
Posts
Comments
New
Top
Old
Finding the uncertainty vector in GPT2-scale transformers
larry-dial
23 Nov 2025 23:34 UTC
9
points
0
comments
10
min read
LW
link
How the NanoGPT Speedrun WR dropped by 20% in 3 months
larry-dial
5 Oct 2025 1:05 UTC
56
points
9
comments
9
min read
LW
link
The Curious Case of the bos_token
larry-dial
17 Jun 2025 19:00 UTC
26
points
4
comments
10
min read
LW
link
Back to top