Job Board (28 March 2033)

dr_s28 Mar 2023 22:44 UTC
20 points
1 comment3 min readLW link

Four lenses on AI risks

jasoncrawford28 Mar 2023 21:52 UTC
23 points
5 comments3 min readLW link
(rootsofprogress.org)

Some com­mon con­fu­sion about in­duc­tion heads

Alexandre Variengien28 Mar 2023 21:51 UTC
46 points
4 comments5 min readLW link

Draft: The op­ti­miza­tion toolbox

Alex_Altair28 Mar 2023 20:40 UTC
13 points
1 comment7 min readLW link

Inch­ing “Kubla Khan” and GPT into the same in­tel­lec­tual frame­work @ 3 Quarks Daily

Bill Benzon28 Mar 2023 19:50 UTC
5 points
0 comments3 min readLW link

A rough and in­com­plete re­view of some of John Went­worth’s research

So8res28 Mar 2023 18:52 UTC
175 points
17 comments18 min readLW link

[Question] How do you man­age your in­puts?

Mateusz Bagiński28 Mar 2023 18:26 UTC
15 points
3 comments1 min readLW link

Chat­bot con­vinces Bel­gian to com­mit suicide

Jeroen De Ryck28 Mar 2023 18:14 UTC
60 points
18 comments3 min readLW link
(www.standaard.be)

A Primer On Chaos

johnswentworth28 Mar 2023 18:01 UTC
53 points
9 comments9 min readLW link

[Question] How likely are sce­nar­ios where AGI ends up overtly or de facto tor­tur­ing us? How likely are sce­nar­ios where AGI pre­vents us from com­mit­ting suicide or dy­ing?

JohnGreer28 Mar 2023 18:00 UTC
11 points
4 comments1 min readLW link

How do we al­ign hu­mans and what does it mean for the new Con­jec­ture’s strategy

Igor Ivanov28 Mar 2023 17:54 UTC
7 points
4 comments7 min readLW link

Govern­ing High-Im­pact AI Sys­tems: Un­der­stand­ing Canada’s Pro­posed AI Bill. April 15, Car­leton Univer­sity, Ottawa

Liav Koren28 Mar 2023 17:48 UTC
11 points
1 comment1 min readLW link
(forum.effectivealtruism.org)

I had a chat with GPT-4 on the fu­ture of AI and AI safety

Kristian Freed28 Mar 2023 17:47 UTC
1 point
0 comments8 min readLW link

LessWrong Hangout

Raymond Koopmanschap28 Mar 2023 17:47 UTC
0 points
0 comments1 min readLW link

Half-baked al­ign­ment idea

ozb28 Mar 2023 17:47 UTC
6 points
27 comments1 min readLW link

[Question] Solv­ing Mys­ter­ies -

Phib28 Mar 2023 17:46 UTC
1 point
0 comments1 min readLW link

Some of My Cur­rent Im­pres­sions En­ter­ing AI Safety

Phib28 Mar 2023 17:46 UTC
2 points
0 comments2 min readLW link

[Question] Why do the Se­quences say that “Löb’s The­o­rem shows that a math­e­mat­i­cal sys­tem can­not as­sert its own sound­ness with­out be­com­ing in­con­sis­tent.”?

Thoth Hermes28 Mar 2023 17:19 UTC
12 points
30 comments1 min readLW link

Cor­rigi­bil­ity, Self-Dele­tion, and Iden­ti­cal Strawberries

Robert_AIZI28 Mar 2023 16:54 UTC
8 points
2 comments6 min readLW link
(aizi.substack.com)

[Question] Why no ma­jor LLMs with mem­ory?

Kaj_Sotala28 Mar 2023 16:34 UTC
41 points
15 comments1 min readLW link

Re­sponse to Tyler Cowen’s Ex­is­ten­tial risk, AI, and the in­evitable turn in hu­man history

Zvi28 Mar 2023 16:00 UTC
72 points
27 comments20 min readLW link
(thezvi.wordpress.com)

Adapt­ing to Change: Over­com­ing Chronos­ta­sis in AI Lan­guage Models

RationalMindset28 Mar 2023 14:32 UTC
−1 points
0 comments6 min readLW link

Feel­ing Progress as Motivation

Sable28 Mar 2023 9:11 UTC
4 points
1 comment3 min readLW link
(affablyevil.substack.com)

Be Not Afraid

Alex Beyman28 Mar 2023 8:12 UTC
−12 points
0 comments6 min readLW link

Creat­ing a fam­ily with GPT-4

Kaj_Sotala28 Mar 2023 6:40 UTC
23 points
3 comments10 min readLW link
(kajsotala.fi)

Some 2-4-6 problems

abstractapplic28 Mar 2023 6:32 UTC
28 points
9 comments1 min readLW link
(h-b-p.github.io)

[Question] Deep fold­ing docs site?

mcint28 Mar 2023 6:01 UTC
−1 points
2 comments1 min readLW link

[Question] Why does ad­vanced AI want not to be shut down?

RedFishBlueFish28 Mar 2023 4:26 UTC
3 points
19 comments1 min readLW link

100 Din­ners And A Work­shop: In­for­ma­tion Preser­va­tion And Goals

Stephen Fowler28 Mar 2023 3:13 UTC
8 points
0 comments7 min readLW link

De­mons from the 5&10verse!

Slimepriestess28 Mar 2023 2:41 UTC
−3 points
15 comments4 min readLW link
(voidgoddess.org)

[Question] Can GPT-4 play 20 ques­tions against an­other in­stance of it­self?

Nathan Helm-Burger28 Mar 2023 1:11 UTC
15 points
1 comment1 min readLW link
(evanthebouncy.medium.com)

Ge­offrey Hin­ton—Full “not in­con­ceiv­able” quote

WilliamKiely28 Mar 2023 0:22 UTC
21 points
2 comments2 min readLW link

An A.I. Safety Pre­sen­ta­tion at RIT

NicholasKross27 Mar 2023 23:49 UTC
8 points
0 comments1 min readLW link
(www.youtube.com)

Which AI out­puts should hu­mans check for shenani­gans, to avoid AI takeover? A sim­ple model

Tom Davidson27 Mar 2023 23:36 UTC
16 points
3 comments8 min readLW link

The Prospect of an AI Winter

Erich_Grunewald27 Mar 2023 20:55 UTC
62 points
24 comments15 min readLW link
(www.erichgrunewald.com)

[Question] Best ar­gu­ments against the out­side view that AGI won’t be a huge deal, thus we sur­vive.

Noosphere8927 Mar 2023 20:49 UTC
4 points
7 comments1 min readLW link

EA & LW Fo­rum Weekly Sum­mary (20th − 26th March 2023)

Zoe Williams27 Mar 2023 20:46 UTC
4 points
0 comments1 min readLW link

Three of my be­liefs about up­com­ing AGI

Robert_AIZI27 Mar 2023 20:27 UTC
6 points
0 comments3 min readLW link
(aizi.substack.com)

No­body knows how to re­li­ably test for AI safety

marcusarvan27 Mar 2023 19:48 UTC
1 point
0 comments5 min readLW link

New blog: Planned Obsolescence

Ajeya Cotra27 Mar 2023 19:46 UTC
96 points
7 comments1 min readLW link
(www.planned-obsolescence.org)

South Bay ACX/​SSC Spring Mee­tups Everywhere

allisona27 Mar 2023 19:39 UTC
2 points
0 comments1 min readLW link

[Question] Re­sources to see how peo­ple think/​ap­proach math­e­mat­ics and prob­lem-solving

zef27 Mar 2023 19:12 UTC
7 points
2 comments1 min readLW link

Stag­ger­ing Hunters

Screwtape27 Mar 2023 19:11 UTC
12 points
2 comments5 min readLW link

Neu­rotech­nol­ogy is Crit­i­cal for AI Alignment

Milan Cvitkovic27 Mar 2023 18:27 UTC
10 points
3 comments1 min readLW link
(milan.cvitkovic.net)

[Question] Best re­sources to learn philos­o­phy of mind and AI?

Sky Moo27 Mar 2023 18:22 UTC
1 point
0 comments1 min readLW link

the ten­sor is a lonely place

jml627 Mar 2023 18:22 UTC
−11 points
0 comments4 min readLW link
(ekjsgrjelrbno.substack.com)

[Question] Ber­mudez In­ter­face Problem

Motor Vehicle27 Mar 2023 18:11 UTC
1 point
2 comments1 min readLW link

Would you be a bet­ter RLHF la­beler than GPT-4?

kache27 Mar 2023 18:10 UTC
1 point
1 comment1 min readLW link

LLM Pow­ered LW Search

odraode1727 Mar 2023 18:09 UTC
−1 points
0 comments1 min readLW link

An­nounc­ing the Swiss Ex­is­ten­tial Risk Ini­ti­a­tive (CHERI) 2023 Re­search Fellowship

Tobias H27 Mar 2023 16:36 UTC
3 points
0 comments1 min readLW link