RSS

Anthropic

TagLast edit: 25 Dec 2021 4:12 UTC by Multicore

Anthropic is an AI organization.

Not to be confused with anthropics.

Toy Models of Superposition

evhub21 Sep 2022 23:48 UTC
63 points
2 comments5 min readLW link
(transformer-circuits.pub)

Mechanis­tic In­ter­pretabil­ity for the MLP Lay­ers (rough early thoughts)

MadHatter24 Dec 2021 7:24 UTC
11 points
2 comments1 min readLW link
(www.youtube.com)

A challenge for AGI or­ga­ni­za­tions, and a challenge for readers

1 Dec 2022 23:11 UTC
253 points
25 comments2 min readLW link

A Sum­mary Of An­thropic’s First Paper

Sam Ringer30 Dec 2021 0:48 UTC
79 points
0 comments8 min readLW link

How do new mod­els from OpenAI, Deep­Mind and An­thropic perform on Truth­fulQA?

Owain_Evans26 Feb 2022 12:46 UTC
42 points
3 comments11 min readLW link

Trans­former Circuits

evhub22 Dec 2021 21:09 UTC
142 points
4 comments3 min readLW link
(transformer-circuits.pub)

An­thropic’s SoLU (Soft­max Lin­ear Unit)

Joel Burget4 Jul 2022 18:38 UTC
15 points
1 comment4 min readLW link
(transformer-circuits.pub)

The limited up­side of interpretability

Peter S. Park15 Nov 2022 18:46 UTC
13 points
10 comments1 min readLW link