In­cor­po­rat­ing Jus­tice The­ory into De­ci­sion Theory

StrivingForLegibility21 Jan 2024 19:17 UTC
13 points
20 comments5 min readLW link

De­liber­ate Dy­sen­tery: Q&A about Hu­man Challenge Trials

Niko_McCarty21 Jan 2024 19:05 UTC
16 points
1 comment18 min readLW link
(www.asimov.press)

When Does Altru­ism Strengthen Altru­ism?

jefftk21 Jan 2024 18:50 UTC
44 points
2 comments3 min readLW link
(www.jefftk.com)

A Shut­down Prob­lem Proposal

21 Jan 2024 18:12 UTC
122 points
61 comments6 min readLW link

Is prin­ci­pled mass-out­reach pos­si­ble, for AGI X-risk?

NicholasKross21 Jan 2024 17:45 UTC
9 points
5 comments3 min readLW link

Vacuum: The­ory and Technologies

ethanmorse21 Jan 2024 17:23 UTC
33 points
0 comments25 min readLW link
(210ethan.github.io)

Another Non-An­thropic Para­dox: The Un­sur­pris­ing Rare­ness of Rare Events

Ape in the coat21 Jan 2024 15:58 UTC
16 points
16 comments6 min readLW link

Book re­view: Cui­sine and Empire

eukaryote21 Jan 2024 6:15 UTC
40 points
2 comments12 min readLW link
(eukaryotewritesblog.com)

Cat­a­logue of POLITICO Re­ports and Other Cited Ar­ti­cles on Effec­tive Altru­ism and AI Safety Con­nec­tions in Wash­ing­ton, DC

Evan_Gaensbauer21 Jan 2024 2:15 UTC
4 points
0 comments1 min readLW link
(docs.google.com)

You can rack up mas­sive amounts of data quickly by ask­ing ques­tions to all your friends

Neil 21 Jan 2024 1:27 UTC
14 points
2 comments2 min readLW link

[Question] Party for biomed­i­cal re­ju­ve­na­tion re­search: Euro­pean par­li­a­ment elections

Iakov Dudinsky21 Jan 2024 0:35 UTC
1 point
0 comments1 min readLW link

[Question] Why have in­surance mar­kets suc­ceeded where pre­dic­tion mar­kets have not?

JNank21 Jan 2024 0:35 UTC
13 points
13 comments1 min readLW link

[linkpost] Self-Re­ward­ing Lan­guage Models

Jacob G-W21 Jan 2024 0:30 UTC
13 points
2 comments1 min readLW link
(arxiv.org)

Why Im­prov­ing Dialogue Feels So Hard

matto20 Jan 2024 21:26 UTC
21 points
8 comments3 min readLW link

Re­search Log, RLLMv2: Phi-1.5, GPT2XL and Fal­con-RW-1B as pa­per­clip maximizers

MiguelDev20 Jan 2024 15:30 UTC
6 points
0 comments10 min readLW link

Against the Bur­den of Knowledge

Maxwell Tabarrok20 Jan 2024 14:37 UTC
21 points
6 comments6 min readLW link
(maximumprogress.substack.com)

legged robot scal­ing laws

bhauth20 Jan 2024 5:45 UTC
34 points
8 comments7 min readLW link
(www.bhauth.com)

Leg­i­bil­ity Makes Log­i­cal Line-Of-Sight Transitive

StrivingForLegibility19 Jan 2024 23:39 UTC
12 points
0 comments5 min readLW link

De­cent plan prize win­ner & highlights

lukehmiles19 Jan 2024 23:30 UTC
25 points
2 comments4 min readLW link

A quick in­ves­ti­ga­tion of AI pro-AI bias

Fabien Roger19 Jan 2024 23:26 UTC
52 points
1 comment2 min readLW link

[Question] What Soft­ware Should Ex­ist?

Tomás B.19 Jan 2024 21:43 UTC
27 points
21 comments1 min readLW link

On “Geeks, MOPs, and So­ciopaths”

19 Jan 2024 21:04 UTC
31 points
35 comments8 min readLW link

There is way too much serendipity

Malmesbury19 Jan 2024 19:37 UTC
351 points
56 comments7 min readLW link

Es­ti­mat­ing effi­ciency im­prove­ments in LLM pre-training

Daan19 Jan 2024 19:32 UTC
42 points
3 comments21 min readLW link

Up­date: Ori­ent­ing Our­selves in 2024 | Guild of the ROSE

moridinamael19 Jan 2024 16:48 UTC
14 points
0 comments1 min readLW link
(guildoftherose.org)

I Want XMP But I Know Why I Can’t Have It

jefftk19 Jan 2024 15:30 UTC
23 points
0 comments3 min readLW link
(www.jefftk.com)

Ar­gu­ments for Ro­bust­ness in AI Alignment

Fabian Schimpf19 Jan 2024 10:24 UTC
2 points
1 comment1 min readLW link

[Question] What ra­tio­nal­ity failure modes are there?

Ulisse Mini19 Jan 2024 9:12 UTC
42 points
11 comments1 min readLW link

[Question] What’s up with on­line me­dia and our abil­ity to get sh*t done?

TeaTieAndHat19 Jan 2024 9:12 UTC
2 points
0 comments6 min readLW link

Log­i­cal Line-Of-Sight Makes Games Se­quen­tial or Loopy

StrivingForLegibility19 Jan 2024 4:05 UTC
38 points
0 comments7 min readLW link

[Question] Are there high-qual­ity sur­veys available de­tailing the rates of polyamory among Amer­i­cans age 18-45 in metropoli­tan ar­eas in the United States?

Evan_Gaensbauer18 Jan 2024 23:50 UTC
23 points
0 comments1 min readLW link

Man­i­fund: 2023 in Review

Austin Chen18 Jan 2024 23:50 UTC
32 points
0 comments1 min readLW link
(manifund.substack.com)

The Un­der­re­ac­tion to OpenAI

Sherrinford18 Jan 2024 22:08 UTC
19 points
0 comments6 min readLW link

Against Non­lin­ear (Thing Of Things)

tailcalled18 Jan 2024 21:40 UTC
58 points
18 comments1 min readLW link
(thingofthings.substack.com)

Toward A Math­e­mat­i­cal Frame­work for Com­pu­ta­tion in Superposition

18 Jan 2024 21:06 UTC
182 points
17 comments73 min readLW link

The True Story of How GPT-2 Be­came Max­i­mally Lewd

18 Jan 2024 21:03 UTC
70 points
7 comments6 min readLW link
(youtu.be)

Gaia Net­work: An Illus­trated Primer

18 Jan 2024 18:23 UTC
1 point
2 comments15 min readLW link

On the abo­li­tion of man

Joe Carlsmith18 Jan 2024 18:17 UTC
88 points
18 comments41 min readLW link

More Us­able Recipes

jefftk18 Jan 2024 17:40 UTC
14 points
1 comment1 min readLW link
(www.jefftk.com)

Good job op­por­tu­ni­ties for helping with the most im­por­tant century

HoldenKarnofsky18 Jan 2024 17:30 UTC
34 points
0 comments4 min readLW link
(www.cold-takes.com)

Flex­i­bil­ity and the Singularity

Jonathan Moregård18 Jan 2024 15:29 UTC
8 points
0 comments3 min readLW link
(honestliving.substack.com)

AI #48: Ex­po­nen­tials in Geometry

Zvi18 Jan 2024 14:20 UTC
59 points
9 comments54 min readLW link
(thezvi.wordpress.com)

Wor­ri­some mi­s­un­der­stand­ing of the core is­sues with AI transition

Roman Leventov18 Jan 2024 10:05 UTC
5 points
2 comments4 min readLW link

[Question] What ev­i­dence is there for (or against) the­o­ries about the ex­tent to which effec­tive al­tru­ist in­ter­ests mo­ti­vated the ouster of Sam Alt­man last year?

Evan_Gaensbauer18 Jan 2024 5:14 UTC
10 points
0 comments1 min readLW link

Does liter­acy re­move your abil­ity to be a bard as good as Homer?

Adrià Garriga-alonso18 Jan 2024 3:43 UTC
51 points
19 comments3 min readLW link

D&D.Sci Hyper­sphere Anal­y­sis Part 4: Fine-tun­ing and Wrapup

aphyer18 Jan 2024 3:06 UTC
23 points
5 comments7 min readLW link

Some heuris­tics I use for de­cid­ing how much I trust sci­en­tific results

NathanBarnard18 Jan 2024 2:48 UTC
13 points
2 comments5 min readLW link

New­port News VA Meetup—Liv­ing Museum

Daniel18 Jan 2024 2:05 UTC
1 point
0 comments1 min readLW link

[Question] Ex­per­i­ments to Test the Prob­a­bil­ity of Strate­gic De­cep­tive Misal­ign­ment?

Oliver Daniels-Koch18 Jan 2024 0:13 UTC
2 points
0 comments1 min readLW link

In Strate­gic Time, Open-Source Games Are Loopy

StrivingForLegibility18 Jan 2024 0:08 UTC
18 points
0 comments6 min readLW link