Univer­sal Love In­te­gra­tion Test: Hitler

RaemonJan 10, 2024, 11:55 PM
76 points
65 comments9 min readLW link

The Per­cep­tron Controversy

Yuxi_LiuJan 10, 2024, 11:07 PM
65 points
18 comments1 min readLW link
(yuxi-liu-wired.github.io)

The Aspiring Ra­tion­al­ist Congregation

maiaJan 10, 2024, 10:52 PM
86 points
23 comments10 min readLW link

An Ac­tu­ally In­tu­itive Ex­pla­na­tion of the Oberth Effect

Isaac KingJan 10, 2024, 8:23 PM
63 points
37 comments6 min readLW link

Be­ware the sub­op­ti­mal routine

jwfiredragonJan 10, 2024, 7:02 PM
13 points
3 comments3 min readLW link

The true cost of fences

pleiotrothJan 10, 2024, 7:01 PM
3 points
2 comments4 min readLW link

“Dark Con­sti­tu­tion” for con­strain­ing some superintelligences

ValentineJan 10, 2024, 4:02 PM
3 points
9 comments1 min readLW link
(www.anarchonomicon.com)

[Question] rab­bit (a new AI com­pany) and Large Ac­tion Model (LAM)

MiguelDevJan 10, 2024, 1:57 PM
17 points
3 comments1 min readLW link

Sav­ing the world sucks

Defective AltruismJan 10, 2024, 5:55 AM
50 points
29 comments3 min readLW link

[Question] Ques­tions about Solomonoff induction

mukashiJan 10, 2024, 1:16 AM
7 points
11 comments1 min readLW link

AI as a nat­u­ral disaster

Neil Jan 10, 2024, 12:42 AM
11 points
1 comment7 min readLW link

Stop be­ing sur­prised by the pas­sage of time

Jan 10, 2024, 12:36 AM
−2 points
1 comment3 min readLW link

A dis­cus­sion of nor­ma­tive ethics

Jan 9, 2024, 11:29 PM
10 points
6 comments25 min readLW link

On the Con­trary, Steel­man­ning Is Nor­mal; ITT-Pass­ing Is Niche

Zack_M_DavisJan 9, 2024, 11:12 PM
45 points
31 comments4 min readLW link

[Question] What’s the pro­to­col for if a novice has ML ideas that are un­likely to work, but might im­prove ca­pa­bil­ities if they do work?

droctaJan 9, 2024, 10:51 PM
6 points
2 comments2 min readLW link

Good­bye, Shog­goth: The Stage, its An­i­ma­tron­ics, & the Pup­peteer – a New Metaphor

RogerDearnaleyJan 9, 2024, 8:42 PM
47 points
8 comments36 min readLW link

Bent or Blunt Hoods?

jefftkJan 9, 2024, 8:10 PM
23 points
0 comments1 min readLW link
(www.jefftk.com)

2024 ACX Pre­dic­tions: Blind/​Buy/​Sell/​Hold

ZviJan 9, 2024, 7:30 PM
33 points
2 comments31 min readLW link
(thezvi.wordpress.com)

An­nounc­ing the Dou­ble Crux Bot

Jan 9, 2024, 6:54 PM
53 points
10 comments3 min readLW link

Does AI risk “other” the AIs?

Joe CarlsmithJan 9, 2024, 5:51 PM
60 points
3 comments8 min readLW link

AI de­mands un­prece­dented reliability

JonoJan 9, 2024, 4:30 PM
22 points
5 comments2 min readLW link

Uncer­tainty in all its flavours

Cleo NardoJan 9, 2024, 4:21 PM
34 points
6 comments35 min readLW link

Com­pen­sat­ing for Life Biases

Jonathan MoregårdJan 9, 2024, 2:39 PM
24 points
6 comments3 min readLW link
(honestliving.substack.com)

Can Mo­ral­ity Be Quan­tified?

JuliusJan 9, 2024, 6:35 AM
3 points
0 comments5 min readLW link

Learn­ing Math in Time for Alignment

Nicholas / Heather KrossJan 9, 2024, 1:02 AM
32 points
5 comments3 min readLW link

Brief Thoughts on Jus­tifi­ca­tions for Paternalism

Srdjan MileticJan 9, 2024, 12:36 AM
4 points
0 comments4 min readLW link
(dissent.blog)

Hiring de­ci­sions are not suit­able for pre­dic­tion markets

SimonMJan 8, 2024, 9:11 PM
12 points
6 comments1 min readLW link

Bet­ter Anomia

jefftkJan 8, 2024, 6:40 PM
8 points
0 comments1 min readLW link
(www.jefftk.com)

A starter guide for evals

Jan 8, 2024, 6:24 PM
54 points
2 comments12 min readLW link
(www.apolloresearch.ai)

Is it jus­tifi­able for non-ex­perts to have strong opinions about Gaza?

Jan 8, 2024, 5:31 PM
23 points
12 comments30 min readLW link

Pro­ject ideas: Backup plans & Co­op­er­a­tive AI

Lukas FinnvedenJan 8, 2024, 5:19 PM
18 points
0 commentsLW link
(www.forethought.org)

Hackathon and Stay­ing Up-to-Date in AI

jacobhaimesJan 8, 2024, 5:10 PM
11 points
0 comments1 min readLW link
(into-ai-safety.github.io)

When “yang” goes wrong

Joe CarlsmithJan 8, 2024, 4:35 PM
73 points
6 comments13 min readLW link

Task vec­tors & anal­ogy mak­ing in LLMs

SergiiJan 8, 2024, 3:17 PM
9 points
1 comment4 min readLW link
(grgv.xyz)

[Question] How to find trans­la­tions of a book?

ViliamJan 8, 2024, 2:57 PM
9 points
8 comments1 min readLW link

[Question] Why aren’t Yud­kowsky & Bostrom get­ting more at­ten­tion now?

JoshuaFoxJan 8, 2024, 2:42 PM
14 points
8 comments1 min readLW link

2023 Pre­dic­tion Evaluations

ZviJan 8, 2024, 2:40 PM
47 points
0 comments28 min readLW link
(thezvi.wordpress.com)

There is no sharp bound­ary be­tween de­on­tol­ogy and consequentialism

quetzal_rainbowJan 8, 2024, 11:01 AM
8 points
2 comments1 min readLW link

Reflec­tions on my first year of AI safety research

Jay BaileyJan 8, 2024, 7:49 AM
53 points
3 commentsLW link

Why There Is Hope For An Align­ment Solution

DarklightJan 8, 2024, 6:58 AM
10 points
0 comments12 min readLW link

Sled­ding Among Hazards

jefftkJan 8, 2024, 3:30 AM
19 points
5 comments1 min readLW link
(www.jefftk.com)

Utility is relative

CrimsonChinJan 8, 2024, 2:31 AM
2 points
4 comments2 min readLW link

A model of re­search skill

L Rudolf LJan 8, 2024, 12:13 AM
60 points
6 comments12 min readLW link
(www.strataoftheworld.com)

We shouldn’t fear su­per­in­tel­li­gence be­cause it already exists

Spencer ChubbJan 7, 2024, 5:59 PM
−22 points
14 comments1 min readLW link

(Par­tial) failure in repli­cat­ing de­cep­tive al­ign­ment experiment

claudia.biancottiJan 7, 2024, 5:56 PM
1 point
0 comments1 min readLW link

Pro­ject ideas: Sen­tience and rights of digi­tal minds

Lukas FinnvedenJan 7, 2024, 5:34 PM
20 points
0 commentsLW link
(www.forethought.org)

De­cep­tive AI ≠ De­cep­tively-al­igned AI

Steven ByrnesJan 7, 2024, 4:55 PM
96 points
19 comments6 min readLW link

Bayesi­ans Com­mit the Gam­bler’s Fallacy

Kevin DorstJan 7, 2024, 12:54 PM
49 points
30 comments8 min readLW link
(kevindorst.substack.com)

Towards AI Safety In­fras­truc­ture: Talk & Outline

Paul BricmanJan 7, 2024, 9:31 AM
11 points
0 comments2 min readLW link
(www.youtube.com)

Defend­ing against hy­po­thet­i­cal moon life dur­ing Apollo 11

eukaryoteJan 7, 2024, 4:49 AM
57 points
9 comments32 min readLW link
(eukaryotewritesblog.com)