12 ca­reer-re­lated ques­tions that may (or may not) be helpful for peo­ple in­ter­ested in al­ign­ment research

Akash12 Dec 2022 22:36 UTC
20 points
0 comments2 min readLW link

Con­cept ex­trap­o­la­tion for hy­poth­e­sis generation

12 Dec 2022 22:09 UTC
20 points
2 comments3 min readLW link

Let’s go meta: Gram­mat­i­cal knowl­edge and self-refer­en­tial sen­tences [ChatGPT]

Bill Benzon12 Dec 2022 21:50 UTC
5 points
0 comments9 min readLW link

D&D.Sci De­cem­ber 2022 Eval­u­a­tion and Ruleset

abstractapplic12 Dec 2022 21:21 UTC
14 points
7 comments2 min readLW link

Log-odds are bet­ter than Probabilities

Robert_AIZI12 Dec 2022 20:10 UTC
22 points
4 comments4 min readLW link
(aizi.substack.com)

Ben­galuru LW/​ACX So­cial Meetup—De­cem­ber 2022

faiz12 Dec 2022 19:30 UTC
4 points
0 comments1 min readLW link

Psy­cholog­i­cal Di­sor­ders and Problems

12 Dec 2022 18:15 UTC
39 points
6 comments1 min readLW link

Con­fus­ing the goal and the path

adamShimi12 Dec 2022 16:42 UTC
44 points
7 comments1 min readLW link
(epistemologicalvigilance.substack.com)

Mean­ingful things are those the uni­verse pos­sesses a se­man­tics for

Abhimanyu Pallavi Sudhir12 Dec 2022 16:03 UTC
16 points
14 comments14 min readLW link

Trade­offs in com­plex­ity, ab­strac­tion, and generality

12 Dec 2022 15:55 UTC
32 points
0 comments2 min readLW link

Green Line Ex­ten­sion Open­ing Dates

jefftk12 Dec 2022 14:40 UTC
12 points
0 comments1 min readLW link
(www.jefftk.com)

Join the AI Test­ing Hackathon this Friday

Esben Kran12 Dec 2022 14:24 UTC
10 points
0 comments1 min readLW link

Side-chan­nels: in­put ver­sus output

davidad12 Dec 2022 12:32 UTC
44 points
16 comments2 min readLW link

Take 9: No, RLHF/​IDA/​de­bate doesn’t solve outer al­ign­ment.

Charlie Steiner12 Dec 2022 11:51 UTC
33 points
14 comments2 min readLW link

Creat­ing a database for base rates

nikos12 Dec 2022 10:09 UTC
2 points
1 comment3 min readLW link
(forum.effectivealtruism.org)

Triv­ial GPT-3.5 limi­ta­tion workaround

Dave Lindbergh12 Dec 2022 8:42 UTC
5 points
4 comments1 min readLW link

Ponzi schemes can be highly prof­itable if your timing is good

GeneSmith12 Dec 2022 6:42 UTC
10 points
18 comments5 min readLW link

Prod­ding ChatGPT to solve a ba­sic alge­bra problem

shminux12 Dec 2022 4:09 UTC
14 points
6 comments1 min readLW link
(twitter.com)

Wider De­fault Au­dio Player in Chrome?

jefftk12 Dec 2022 3:30 UTC
11 points
2 comments1 min readLW link
(www.jefftk.com)

A brain­teaser for lan­guage models

Adam Scherlis12 Dec 2022 2:43 UTC
47 points
3 comments2 min readLW link

a rough sketch of for­mal al­igned AI us­ing QACI

Tamsin Leake11 Dec 2022 23:40 UTC
14 points
0 comments4 min readLW link
(carado.moe)

Bench­marks for Com­par­ing Hu­man and AI Intelligence

MrThink11 Dec 2022 22:06 UTC
8 points
4 comments2 min readLW link

Reflec­tions on the PIBBSS Fel­low­ship 2022

11 Dec 2022 21:53 UTC
32 points
0 comments18 min readLW link

A crisis for on­line com­mu­ni­ca­tion: bots and bot users will over­run the In­ter­net?

Mitchell_Porter11 Dec 2022 21:11 UTC
15 points
11 comments1 min readLW link

Finite Fac­tored Sets in Pictures

Magdalena Wache11 Dec 2022 18:49 UTC
174 points
35 comments12 min readLW link

For­mal­iza­tion as sus­pen­sion of intuition

adamShimi11 Dec 2022 15:16 UTC
54 points
18 comments1 min readLW link
(epistemologicalvigilance.substack.com)

An ar­gu­ment on an­i­mal con­scious­ness (so­lic­it­ing crit­i­cism)

SciHamster11 Dec 2022 15:12 UTC
−3 points
2 comments1 min readLW link

ChatGPT’s new novel ra­tio­nal­ity tech­nique of fact checking

ChristianKl11 Dec 2022 13:54 UTC
−14 points
7 comments1 min readLW link

Refram­ing in­ner alignment

davidad11 Dec 2022 13:53 UTC
53 points
13 comments4 min readLW link

A poem about ap­plied ra­tio­nal­ity by ChatGPT

ChristianKl11 Dec 2022 13:43 UTC
4 points
0 comments1 min readLW link

ChatGPT goes through a worm­hole hole in our Shandyesque uni­verse [vir­tual wacky weed]

Bill Benzon11 Dec 2022 11:59 UTC
−1 points
2 comments3 min readLW link

Us­ing Ob­sidian if you’re used to us­ing Roam

Solenoid_Entity11 Dec 2022 8:59 UTC
19 points
4 comments2 min readLW link

[fic­tion] Our Fi­nal Hour

Mati_Roy11 Dec 2022 5:49 UTC
17 points
5 comments3 min readLW link

Con­sider us­ing re­versible au­tomata for al­ign­ment research

Alex_Altair11 Dec 2022 1:00 UTC
88 points
30 comments2 min readLW link

High level dis­course struc­ture in ChatGPT: Part 2 [Quasi-sym­bolic?]

Bill Benzon10 Dec 2022 22:26 UTC
7 points
0 comments6 min readLW link

Poll Re­sults on AGI

Niclas Kupper10 Dec 2022 21:25 UTC
18 points
0 comments2 min readLW link

Reflect­ing on the 2022 Guild of the Rose Workshops

moridinamael10 Dec 2022 21:21 UTC
26 points
7 comments8 min readLW link

[Question] Rev­ers­ing a quan­tum simu­la­tion on the plane­tary scale

Mythopoeist10 Dec 2022 20:26 UTC
2 points
3 comments1 min readLW link

ACX Zurich De­cem­ber Meetup

MB10 Dec 2022 19:23 UTC
1 point
0 comments1 min readLW link

FMT: a great op­por­tu­nity for soon-to-be parents

Anton Rodenhauser10 Dec 2022 17:56 UTC
7 points
0 comments6 min readLW link

[ASoT] Nat­u­ral ab­strac­tions and AlphaZero

Ulisse Mini10 Dec 2022 17:53 UTC
33 points
1 comment1 min readLW link
(arxiv.org)

[Question] How promis­ing are le­gal av­enues to re­strict AI train­ing data?

thehalliard10 Dec 2022 16:31 UTC
9 points
2 comments1 min readLW link

In­spira­tion as a Scarce Resource

zenbu zenbu zenbu zenbu10 Dec 2022 15:23 UTC
7 points
0 comments4 min readLW link
(inflorescence.substack.com)

Will Man­i­fold Mar­kets/​Me­tac­u­lus have built-in sup­port for re­flec­tive la­tent vari­ables by 2025?

tailcalled10 Dec 2022 13:55 UTC
34 points
0 comments1 min readLW link

My thoughts on OpenAI’s Align­ment plan

Donald Hobson10 Dec 2022 10:35 UTC
25 points
1 comment6 min readLW link

[Question] How would you im­prove ChatGPT’s fil­ter­ing?

Noah Scales10 Dec 2022 8:05 UTC
9 points
6 comments1 min readLW link

[Question] A thought experiment

sisyphus10 Dec 2022 5:23 UTC
3 points
12 comments1 min readLW link

pa­tio11′s “Ob­ser­va­tions from an EA-ad­ja­cent (?) char­i­ta­ble effort”

RobertM10 Dec 2022 0:27 UTC
43 points
0 comments1 min readLW link
(forum.effectivealtruism.org)

A dy­nam­i­cal sys­tems primer for en­tropy and optimization

Alex_Altair10 Dec 2022 0:13 UTC
47 points
3 comments7 min readLW link

[Linkpost] The Story Of VaccinateCA

hath9 Dec 2022 23:54 UTC
103 points
4 comments10 min readLW link
(www.worksinprogress.co)