Judg­ing types of con­se­quen­tial­ism by in­fluence and normativity

Cole Wyeth29 Apr 2025 23:25 UTC
19 points
0 comments2 min readLW link

Band­width Rules Every­thing Around Me: Oliver Habryka on OpenPhil and GoodVentures

Elizabeth29 Apr 2025 20:40 UTC
79 points
15 comments1 min readLW link
(acesounderglass.com)

The Grand En­cy­clo­pe­dia of Epony­mous Laws

rogersbacon29 Apr 2025 19:30 UTC
27 points
6 comments16 min readLW link
(www.secretorum.life)

Mis­rep­re­sen­ta­tion as a Bar­rier for In­terp (Part I)

29 Apr 2025 17:07 UTC
113 points
12 comments7 min readLW link

AISN #53: An Open Let­ter At­tempts to Block OpenAI Restructuring

29 Apr 2025 16:13 UTC
7 points
0 comments4 min readLW link

What could Alphafold 4 look like?

Abhishaike Mahajan29 Apr 2025 15:45 UTC
8 points
0 comments1 min readLW link

Sealed Com­pu­ta­tion: Towards Low-Fric­tion Proof of Locality

Paul Bricman29 Apr 2025 15:26 UTC
4 points
0 comments10 min readLW link
(noemaresearch.com)

Dat­ing Roundup #4: An App for That

Zvi29 Apr 2025 13:10 UTC
18 points
5 comments16 min readLW link
(thezvi.wordpress.com)

Talk on let­ters to AI (Lon­don)

ukc1001429 Apr 2025 9:50 UTC
3 points
0 comments1 min readLW link

Me­mory De­cod­ing Jour­nal Club: “Mo­tor learn­ing se­lec­tively strength­ens cor­ti­cal and stri­atal synapses of mo­tor en­gram neu­rons”

Devin Ward29 Apr 2025 2:26 UTC
1 point
0 comments1 min readLW link

D&D.Sci Tax Day: Ad­ven­tur­ers and Assess­ments Eval­u­a­tion & Ruleset

aphyer29 Apr 2025 2:00 UTC
28 points
10 comments5 min readLW link

How to Build a Third Place on Focusmate

Parker Conley28 Apr 2025 23:46 UTC
96 points
10 comments5 min readLW link
(parconley.com)

Meth­ods of defense against AGI manipulation

MarkelKori28 Apr 2025 21:03 UTC
3 points
0 comments2 min readLW link

China’s Pe­ti­tion Sys­tem: It Looks Like Democ­racy — But It Isn’t

Hu Yichao28 Apr 2025 20:56 UTC
0 points
4 comments2 min readLW link

Fun­da­men­tals of Safe AI (Phase 1) – Ap­pli­ca­tions Open for the Global Co­hort

rajsecrets28 Apr 2025 20:52 UTC
9 points
0 comments2 min readLW link

Pro­ceed­ings of ILIAD: Les­sons and Progress

28 Apr 2025 19:04 UTC
78 points
5 comments8 min readLW link

GPT-4o Is An Ab­surd Sycophant

Zvi28 Apr 2025 19:00 UTC
81 points
7 comments19 min readLW link
(thezvi.wordpress.com)

[Question] What are the best stan­dard­ised, re­peat­able bets?

kave28 Apr 2025 18:45 UTC
31 points
10 comments1 min readLW link

7+ tractable di­rec­tions in AI control

28 Apr 2025 17:12 UTC
93 points
1 comment13 min readLW link

“A vic­tory for the nat­u­ral or­der”

Mati_Roy28 Apr 2025 15:33 UTC
11 points
3 comments1 min readLW link
(preservinghope.substack.com)

Why giv­ing work­ers stocks isn’t enough — and what co-ops get right

B Jacobs28 Apr 2025 14:19 UTC
7 points
9 comments2 min readLW link
(bobjacobs.substack.com)

Keltham on Be­com­ing more Truth-Oriented

Towards_Keeperhood28 Apr 2025 12:58 UTC
22 points
2 comments19 min readLW link

Ther­a­pist in the Weights: Risks of Hyper-In­tro­spec­tion in Fu­ture AI Systems

Davidmanheim28 Apr 2025 6:42 UTC
15 points
1 comment5 min readLW link

In Dark­ness They Assembled

Charlie Sanders28 Apr 2025 3:44 UTC
2 points
0 comments3 min readLW link

Seek­ing ad­vice on ca­reers in AI Safety

nem27 Apr 2025 23:59 UTC
8 points
2 comments1 min readLW link

Thin Align­ment Can’t Solve Thick Problems

Daan Henselmans27 Apr 2025 22:42 UTC
11 points
2 comments9 min readLW link

The Way You Go Depends A Good Deal On Where You Want To Get: FEP min­i­mizes sur­prise about ac­tions us­ing prefer­ences about the fu­ture as *ev­i­dence*

Christopher King27 Apr 2025 21:55 UTC
10 points
5 comments5 min readLW link

How peo­ple use LLMs

Elizabeth27 Apr 2025 21:48 UTC
83 points
6 comments1 min readLW link
(www.gleech.org)

Луна Лавгуд и Комната Тайн, Часть 6

27 Apr 2025 20:26 UTC
3 points
0 comments2 min readLW link

Our Real­ity: A Si­mu­la­tion Run by a Paper­clip Maximizer

27 Apr 2025 16:17 UTC
21 points
65 comments5 min readLW link

Ques­tions for old LW mem­bers: how have dis­cus­sions about AI changed com­pared to 10+ years ago?

Expertium27 Apr 2025 16:11 UTC
11 points
12 comments1 min readLW link

The case for multi-decade AI timelines [Linkpost]

Noosphere8927 Apr 2025 15:31 UTC
57 points
22 comments1 min readLW link
(epoch.ai)

My Re­search Pro­cess: Key Mind­sets—Truth-Seek­ing, Pri­ori­ti­sa­tion, Mov­ing Fast

Neel Nanda27 Apr 2025 14:38 UTC
44 points
0 comments11 min readLW link

I doubt model col­lapse will hap­pen

Hruss27 Apr 2025 14:08 UTC
5 points
0 comments1 min readLW link

Pro­pa­ganda-Bot: A Sketch of a Pos­si­ble RSI

TristanTrim27 Apr 2025 12:15 UTC
6 points
0 comments3 min readLW link

After In­ter­net Dependency

Vorak27 Apr 2025 8:18 UTC
14 points
2 comments1 min readLW link

Emer­gence of su­per­in­tel­li­gence from AI hive­minds: how to make it hu­man-friendly?

Mitchell_Porter27 Apr 2025 4:51 UTC
12 points
0 comments2 min readLW link

“The Ur­gency of In­ter­pretabil­ity” (Dario Amodei)

RobertM27 Apr 2025 4:31 UTC
31 points
23 comments3 min readLW link
(www.darioamodei.com)

AI Self Por­traits Aren’t Accurate

JustisMills27 Apr 2025 3:27 UTC
58 points
10 comments5 min readLW link

MiCARwave

jefftk27 Apr 2025 2:30 UTC
13 points
0 comments1 min readLW link
(www.jefftk.com)

Open Source LLM Poké­mon Scaffold

Julian Bradshaw27 Apr 2025 0:57 UTC
24 points
0 comments1 min readLW link
(github.com)

What are im­por­tant UI-shaped prob­lems that Light­cone could tackle?

Raemon27 Apr 2025 0:02 UTC
59 points
22 comments2 min readLW link

Kodo and Din

Screwtape26 Apr 2025 18:54 UTC
7 points
10 comments4 min readLW link

We should try to au­to­mate AI safety work asap

Marius Hobbhahn26 Apr 2025 16:35 UTC
113 points
10 comments15 min readLW link

AI Safety & En­trepreneur­ship v1.0

Chris_Leong26 Apr 2025 14:37 UTC
16 points
0 comments2 min readLW link

Re­con­sid­er­ing Money: The Case for Freigeld in the Digi­tal Age and a Net­worked Future

henophilia26 Apr 2025 12:54 UTC
−22 points
0 comments5 min readLW link
(blog.hermesloom.org)

How I Think About My Re­search Pro­cess: Ex­plore, Un­der­stand, Distill

Neel Nanda26 Apr 2025 10:31 UTC
56 points
4 comments8 min readLW link

Don’t you mean “the most *con­di­tion­ally* for­bid­den tech­nique?”

Knight Lee26 Apr 2025 3:45 UTC
14 points
0 comments3 min readLW link

Land with no aunties

thellimist26 Apr 2025 1:20 UTC
6 points
0 comments1 min readLW link
(kanyilmaz.me)

AI 2027 Thoughts

PeterMcCluskey26 Apr 2025 0:00 UTC
29 points
2 comments6 min readLW link
(bayesianinvestor.com)