RSS

Gra­di­ent Hacking

Tag

Some real ex­am­ples of gra­di­ent hacking

Oliver Sourbut22 Nov 2021 0:11 UTC
7 points
4 comments2 min readLW link

Towards De­con­fus­ing Gra­di­ent Hacking

leogao24 Oct 2021 0:43 UTC
25 points
1 comment12 min readLW link

Gra­di­ent hacking

evhub16 Oct 2019 0:53 UTC
94 points
39 comments3 min readLW link2 reviews

[Question] How does Gra­di­ent Des­cent In­ter­act with Good­hart?

Scott Garrabrant2 Feb 2019 0:14 UTC
68 points
19 comments4 min readLW link

Thoughts on gra­di­ent hacking

Richard_Ngo3 Sep 2021 13:02 UTC
32 points
12 comments4 min readLW link

Ap­proaches to gra­di­ent hacking

adamShimi14 Aug 2021 15:16 UTC
16 points
7 comments8 min readLW link

Gra­di­ent hack­ing: defi­ni­tions and examples

Richard_Ngo29 Jun 2022 21:35 UTC
19 points
0 comments5 min readLW link

Meta learn­ing to gra­di­ent hack

Quintin Pope1 Oct 2021 19:25 UTC
45 points
10 comments3 min readLW link

Ob­sta­cles to gra­di­ent hacking

leogao5 Sep 2021 22:42 UTC
21 points
11 comments4 min readLW link

Un­der­stand­ing Gra­di­ent Hacking

peterbarnett10 Dec 2021 15:58 UTC
30 points
5 comments30 min readLW link

Some mo­ti­va­tions to gra­di­ent hack

peterbarnett17 Dec 2021 3:06 UTC
7 points
0 comments6 min readLW link

Gra­di­ent Hack­ing via Schel­ling Goals

Adam Scherlis28 Dec 2021 20:38 UTC
30 points
4 comments4 min readLW link

Is Fish­e­rian Ru­n­away Gra­di­ent Hack­ing?

Ryan Kidd10 Apr 2022 13:47 UTC
15 points
7 comments4 min readLW link

A Toy Model of Gra­di­ent Hacking

Oam Patel20 Jun 2022 22:01 UTC
22 points
7 comments4 min readLW link

Crys­tal­iz­ing an agent’s ob­jec­tive: how in­ner-mis­al­ign­ment could work in our favor

Josh16 Jun 2022 3:30 UTC
10 points
9 comments4 min readLW link
No comments.