RSS

Nick Merrill

Karma: −1

Research at the Forecasting Research Institute. Previously U.C. Berkeley Center for Long-Term Cybersecurity. I’m interested in interpretability, particularly introspection and introspective access. https://​​else.how

Emer­gent in­tro­spec­tion does not repli­cate on Llama-3.1-405B

Nick Merrill11 May 2026 4:05 UTC
0 points
0 comments6 min readLW link