a rough sketch of for­mal al­igned AI us­ing QACI

Tamsin Leake11 Dec 2022 23:40 UTC
14 points
0 comments4 min readLW link
(carado.moe)

Bench­marks for Com­par­ing Hu­man and AI Intelligence

MrThink11 Dec 2022 22:06 UTC
8 points
4 comments2 min readLW link

Reflec­tions on the PIBBSS Fel­low­ship 2022

11 Dec 2022 21:53 UTC
32 points
0 comments18 min readLW link

A crisis for on­line com­mu­ni­ca­tion: bots and bot users will over­run the In­ter­net?

Mitchell_Porter11 Dec 2022 21:11 UTC
15 points
11 comments1 min readLW link

Finite Fac­tored Sets in Pictures

Magdalena Wache11 Dec 2022 18:49 UTC
174 points
35 comments12 min readLW link

For­mal­iza­tion as sus­pen­sion of intuition

adamShimi11 Dec 2022 15:16 UTC
54 points
18 comments1 min readLW link
(epistemologicalvigilance.substack.com)

An ar­gu­ment on an­i­mal con­scious­ness (so­lic­it­ing crit­i­cism)

SciHamster11 Dec 2022 15:12 UTC
−3 points
2 comments1 min readLW link

ChatGPT’s new novel ra­tio­nal­ity tech­nique of fact checking

ChristianKl11 Dec 2022 13:54 UTC
−14 points
7 comments1 min readLW link

Refram­ing in­ner alignment

davidad11 Dec 2022 13:53 UTC
53 points
13 comments4 min readLW link

A poem about ap­plied ra­tio­nal­ity by ChatGPT

ChristianKl11 Dec 2022 13:43 UTC
4 points
0 comments1 min readLW link

ChatGPT goes through a worm­hole hole in our Shandyesque uni­verse [vir­tual wacky weed]

Bill Benzon11 Dec 2022 11:59 UTC
−1 points
2 comments3 min readLW link

Us­ing Ob­sidian if you’re used to us­ing Roam

Solenoid_Entity11 Dec 2022 8:59 UTC
19 points
4 comments2 min readLW link

[fic­tion] Our Fi­nal Hour

Mati_Roy11 Dec 2022 5:49 UTC
17 points
5 comments3 min readLW link

Con­sider us­ing re­versible au­tomata for al­ign­ment research

Alex_Altair11 Dec 2022 1:00 UTC
88 points
30 comments2 min readLW link

High level dis­course struc­ture in ChatGPT: Part 2 [Quasi-sym­bolic?]

Bill Benzon10 Dec 2022 22:26 UTC
7 points
0 comments6 min readLW link

Poll Re­sults on AGI

Niclas Kupper10 Dec 2022 21:25 UTC
18 points
0 comments2 min readLW link

Reflect­ing on the 2022 Guild of the Rose Workshops

moridinamael10 Dec 2022 21:21 UTC
26 points
7 comments8 min readLW link

[Question] Rev­ers­ing a quan­tum simu­la­tion on the plane­tary scale

Mythopoeist10 Dec 2022 20:26 UTC
2 points
3 comments1 min readLW link

ACX Zurich De­cem­ber Meetup

MB10 Dec 2022 19:23 UTC
1 point
0 comments1 min readLW link

FMT: a great op­por­tu­nity for soon-to-be parents

Anton Rodenhauser10 Dec 2022 17:56 UTC
7 points
0 comments6 min readLW link

[ASoT] Nat­u­ral ab­strac­tions and AlphaZero

Ulisse Mini10 Dec 2022 17:53 UTC
33 points
1 comment1 min readLW link
(arxiv.org)

[Question] How promis­ing are le­gal av­enues to re­strict AI train­ing data?

thehalliard10 Dec 2022 16:31 UTC
9 points
2 comments1 min readLW link

In­spira­tion as a Scarce Resource

zenbu zenbu zenbu zenbu10 Dec 2022 15:23 UTC
7 points
0 comments4 min readLW link
(inflorescence.substack.com)

Will Man­i­fold Mar­kets/​Me­tac­u­lus have built-in sup­port for re­flec­tive la­tent vari­ables by 2025?

tailcalled10 Dec 2022 13:55 UTC
34 points
0 comments1 min readLW link

My thoughts on OpenAI’s Align­ment plan

Donald Hobson10 Dec 2022 10:35 UTC
25 points
1 comment6 min readLW link

[Question] How would you im­prove ChatGPT’s fil­ter­ing?

Noah Scales10 Dec 2022 8:05 UTC
9 points
6 comments1 min readLW link

[Question] A thought experiment

sisyphus10 Dec 2022 5:23 UTC
3 points
12 comments1 min readLW link

pa­tio11′s “Ob­ser­va­tions from an EA-ad­ja­cent (?) char­i­ta­ble effort”

RobertM10 Dec 2022 0:27 UTC
43 points
0 comments1 min readLW link
(forum.effectivealtruism.org)

A dy­nam­i­cal sys­tems primer for en­tropy and optimization

Alex_Altair10 Dec 2022 0:13 UTC
47 points
3 comments7 min readLW link

[Linkpost] The Story Of VaccinateCA

hath9 Dec 2022 23:54 UTC
103 points
4 comments10 min readLW link
(www.worksinprogress.co)

Pro­saic mis­al­ign­ment from the Solomonoff Predictor

Cleo Nardo9 Dec 2022 17:53 UTC
40 points
2 comments5 min readLW link

Take 8: Queer the in­ner/​outer al­ign­ment di­chotomy.

Charlie Steiner9 Dec 2022 17:46 UTC
28 points
2 comments2 min readLW link

[Question] Does a LLM have a util­ity func­tion?

Dagon9 Dec 2022 17:19 UTC
17 points
11 comments1 min readLW link

Monthly Roundup #1

Zvi9 Dec 2022 17:10 UTC
31 points
6 comments21 min readLW link
(thezvi.wordpress.com)

Work­ing to­wards AI al­ign­ment is better

Johannes C. Mayer9 Dec 2022 15:39 UTC
8 points
2 comments2 min readLW link

You can still fetch the coffee to­day if you’re dead tomorrow

davidad9 Dec 2022 14:06 UTC
84 points
19 comments5 min readLW link

ChatGPT’s Misal­ign­ment Isn’t What You Think

stavros9 Dec 2022 11:11 UTC
3 points
12 comments1 min readLW link

ML Safety at NeurIPS & Paradig­matic AI Safety? MLAISU W49

9 Dec 2022 10:38 UTC
19 points
0 comments4 min readLW link
(newsletter.apartresearch.com)

[Question] What are your thoughts on the fu­ture of AI-as­sisted soft­ware de­vel­op­ment?

RomanHauksson9 Dec 2022 10:04 UTC
4 points
4 comments1 min readLW link

Fear miti­gated the nu­clear threat, can it do the same to AGI risks?

Igor Ivanov9 Dec 2022 10:04 UTC
6 points
8 comments5 min readLW link

Set­ting the Zero Point

[DEACTIVATED] Duncan Sabien9 Dec 2022 6:06 UTC
90 points
43 comments20 min readLW link1 review

Sys­tems of Survival

Vaniver9 Dec 2022 5:13 UTC
63 points
5 comments5 min readLW link

[Question] Do You Have an In­ter­nal Monologue?

belkarx9 Dec 2022 3:04 UTC
23 points
7 comments1 min readLW link

[Question] How is the “sharp left turn defined”?

Chris_Leong9 Dec 2022 0:04 UTC
14 points
4 comments1 min readLW link

Linkpost for a gen­er­al­ist al­gorith­mic learner: ca­pa­ble of car­ry­ing out sort­ing, short­est paths, string match­ing, con­vex hull find­ing in one network

lovetheusers9 Dec 2022 0:02 UTC
7 points
1 comment1 min readLW link
(twitter.com)

[Question] Where’s the eco­nomic in­cen­tive for wok­ism com­ing from?

Valentine8 Dec 2022 23:28 UTC
12 points
105 comments1 min readLW link

I Believe we are in a Hard­ware Overhang

nem8 Dec 2022 23:18 UTC
8 points
0 comments1 min readLW link

Of pump­kins, the Fal­con Heavy, and Grou­cho Marx: High-Level dis­course struc­ture in ChatGPT

Bill Benzon8 Dec 2022 22:25 UTC
2 points
0 comments8 min readLW link

How Many Lives Does X-Risk Work Save From Nonex­is­tence On Aver­age?

Jordan Arel8 Dec 2022 21:57 UTC
4 points
5 comments14 min readLW link

AI Safety Seems Hard to Measure

HoldenKarnofsky8 Dec 2022 19:50 UTC
71 points
6 comments14 min readLW link
(www.cold-takes.com)