atticusw

Karma: 17

atticusw 5 Mar 2026 19:09 UTC
1 point
0
on: Tools to generate realistic prompts help surprisingly little with Petri audit realism
In figure 3, given that 64-shot Haiku does a lot worse than 64-shot Llama 405B-base, should I conclude that base models (without the assistant persona) are way better at generating realistic user prompts?

[CS2881r] Optimizing Prompts with Reinforcement Learning

Anastasia Ahani and atticusw

1 Oct 2025 14:02 UTC

2 points

0 comments5 min readLW link

[CS 2881r AI Safety] [Week 1] Introduction

bira, nsiwek and atticusw

14 Sep 2025 19:52 UTC

17 points

0 comments13 min readLW link

atticusw 23 Mar 2025 19:05 UTC
2 points
0
on: Logits, log-odds, and loss for parallel circuits
What is the formal statement of the fact that logits add for perfectly calibrated and maximally independent predictions?