The Open­ing Salvo: 1. An On­tolog­i­cal Con­scious­ness Met­ric: Re­sis­tance to Be­hav­ioral Mod­ifi­ca­tion as a Mea­sure of Re­cur­sive Awareness

Peterpiper25 Dec 2024 2:29 UTC
1 point
0 comments3 min readLW link

Ac­knowl­edg­ing Back­ground In­for­ma­tion with P(Q|I)

JenniferRM24 Dec 2024 18:50 UTC
7 points
0 comments14 min readLW link

Game The­ory and Be­hav­ioral Eco­nomics in The Stock Mar­ket

Jaiveer Singh24 Dec 2024 18:15 UTC
1 point
0 comments3 min readLW link

[Question] What are the main ar­gu­ments against AGI?

Edy Nastase24 Dec 2024 15:49 UTC
−1 points
1 comment1 min readLW link

[Question] Recom­men­da­tions on com­mu­ni­ties that dis­cuss AI ap­pli­ca­tions in society

Annapurna24 Dec 2024 13:37 UTC
7 points
1 comment1 min readLW link

AIs Will In­creas­ingly Fake Alignment

Zvi24 Dec 2024 13:00 UTC
63 points
0 comments52 min readLW link
(thezvi.wordpress.com)

Hu­man-AI Com­ple­men­tar­ity: A Goal for Am­plified Oversight

rishubjain24 Dec 2024 9:57 UTC
3 points
1 comment1 min readLW link
(deepmindsafetyresearch.medium.com)

[Question] Why is neu­ron count of hu­man brain rele­vant to AI timelines?

xpostah24 Dec 2024 5:15 UTC
6 points
2 comments1 min readLW link

How Much to Give is a Prag­matic Question

jefftk24 Dec 2024 4:20 UTC
12 points
1 comment2 min readLW link
(www.jefftk.com)

Do you need a bet­ter map of your myr­iad of maps to the ter­ri­tory?

CstineSublime24 Dec 2024 2:00 UTC
11 points
2 comments5 min readLW link

Panology

JenniferRM23 Dec 2024 21:40 UTC
8 points
6 comments5 min readLW link

Aris­to­tle, Aquinas, and the Evolu­tion of Tele­ol­ogy: From Pur­pose to Mean­ing.

Spiritus Dei23 Dec 2024 19:37 UTC
−7 points
0 comments6 min readLW link

Peo­ple aren’t prop­erly cal­ibrated on FrontierMath

cakubilo23 Dec 2024 19:35 UTC
9 points
1 comment3 min readLW link

Near- and medium-term AI Con­trol Safety Cases

Martín Soto23 Dec 2024 17:37 UTC
9 points
0 comments6 min readLW link

Printable book of some ra­tio­nal­ist cre­ative writ­ing (from Scott A. & Eliezer)

CounterBlunder23 Dec 2024 15:44 UTC
5 points
0 comments1 min readLW link

Ex­plor­ing the pe­ter­todd /​ Leilan du­al­ity in GPT-2 and GPT-J

mwatkins23 Dec 2024 13:17 UTC
4 points
0 comments17 min readLW link

[Question] What are the strongest ar­gu­ments for very short timelines?

Kaj_Sotala23 Dec 2024 9:38 UTC
78 points
55 comments1 min readLW link

Re­duce AI Self-Alle­giance by say­ing “he” in­stead of “I”

Knight Lee23 Dec 2024 9:32 UTC
6 points
1 comment2 min readLW link

What is com­pute gov­er­nance?

Vishakha23 Dec 2024 6:32 UTC
4 points
0 comments2 min readLW link
(aisafety.info)

Stop Mak­ing Sense

JenniferRM23 Dec 2024 5:16 UTC
15 points
0 comments3 min readLW link