Friend­ship is trans­ac­tional, un­con­di­tional friend­ship is insurance

RubyJul 17, 2024, 10:52 PM
67 points
24 comments2 min readLW link

D&D.Sci: Whom Shall You Call? [Eval­u­a­tion and Rule­set]

abstractapplicJul 17, 2024, 10:34 PM
17 points
5 comments5 min readLW link

Op­ti­mistic As­sump­tions, Longterm Plan­ning, and “Cope”

RaemonJul 17, 2024, 10:14 PM
215 points
46 comments7 min readLW link

Bak­ing vs Patiss­ing vs Cook­ing, the HPS explanation

adamShimiJul 17, 2024, 8:29 PM
30 points
16 comments3 min readLW link
(epistemologicalfascinations.substack.com)

Launch­ing the Re­s­pi­ra­tory Out­look 2024/​25 Fore­cast­ing Series

ChristianWilliamsJul 17, 2024, 7:51 PM
5 points
0 commentsLW link
(www.metaculus.com)

What are you get­ting paid in?

Austin ChenJul 17, 2024, 7:23 PM
92 points
14 comments4 min readLW link
(www.approachwithalacrity.com)

In­di­vi­d­u­ally in­cen­tivized safe Pareto im­prove­ments in open-source bargaining

Jul 17, 2024, 6:26 PM
41 points
2 comments17 min readLW link

Profit and Value

kwangJul 17, 2024, 6:06 PM
22 points
3 comments6 min readLW link
(open.substack.com)

So You’ve Learned To Tele­port by Tom Scott

landscape_kiwiJul 17, 2024, 6:04 PM
4 points
0 comments1 min readLW link
(www.youtube.com)

How does gen­er­al­ized ac­cessibil­ity com­pare to tar­geted ac­cessibil­ity?

ErioirEJul 17, 2024, 5:07 PM
3 points
0 comments2 min readLW link

Hous­ing Roundup #9: Restrict­ing Supply

ZviJul 17, 2024, 12:50 PM
25 points
8 comments44 min readLW link
(thezvi.wordpress.com)

We ran an AI safety con­fer­ence in Tokyo. It went re­ally well. Come next year!

BlaineJul 17, 2024, 6:55 AM
45 points
1 comment6 min readLW link

Agency in Politics

Martin SustrikJul 17, 2024, 5:30 AM
35 points
2 comments3 min readLW link
(250bpm.substack.com)

Ar­rakis—A toolkit to con­duct, track and vi­su­al­ize mechanis­tic in­ter­pretabil­ity ex­per­i­ments.

Yash SrivastavaJul 17, 2024, 2:02 AM
3 points
2 comments5 min readLW link

An­nounc­ing Open Philan­thropy’s AI gov­er­nance and policy RFP

Julian HazellJul 17, 2024, 2:02 AM
25 points
0 comments1 min readLW link
(www.openphilanthropy.org)

Turn­ing Your Back On Traffic

jefftkJul 17, 2024, 1:00 AM
37 points
7 comments1 min readLW link
(www.jefftk.com)

[Question] Opinions on Eureka Labs

jmhJul 17, 2024, 12:16 AM
6 points
2 comments1 min readLW link

Sim­plify­ing Cor­rigi­bil­ity – Subagent Cor­rigi­bil­ity Is Not Anti-Natural

Rubi J. HudsonJul 16, 2024, 10:44 PM
44 points
27 comments5 min readLW link

Mul­ti­plex Gene Edit­ing: Where Are We Now?

sarahconstantinJul 16, 2024, 8:50 PM
73 points
6 comments7 min readLW link
(sarahconstantin.substack.com)

Re­cur­sion in AI is scary. But let’s talk solu­tions.

Oleg TrottJul 16, 2024, 8:34 PM
3 points
10 comments2 min readLW link

How to wash your hands pre­cisely and thoroughly

dkl9Jul 16, 2024, 6:29 PM
12 points
0 comments1 min readLW link
(dkl9.net)

Fran­cois Chol­let in­ad­ver­tently limits his claim on ARC-AGI

Noosphere89Jul 16, 2024, 5:32 PM
12 points
3 comments1 min readLW link
(x.com)

Fully booked—LessWrong Com­mu­nity weekend

jtJul 16, 2024, 5:15 PM
20 points
2 comments1 min readLW link

Bound­less Emotion

GG10Jul 16, 2024, 4:36 PM
3 points
0 comments3 min readLW link

Mech In­terp Lacks Good Paradigms

Daniel TanJul 16, 2024, 3:47 PM
40 points
0 comments14 min readLW link

DM Parenting

Shoshannah TekofskyJul 16, 2024, 8:50 AM
50 points
4 comments5 min readLW link
(kidquest.substack.com)

Ap­ply now: Get “un­stuck” with the New IFS Self-Care Fel­low­ship Program

Inga G.Jul 16, 2024, 8:18 AM
10 points
3 commentsLW link

Why the Best Writ­ers En­dure Isolation

Declan MolonyJul 16, 2024, 5:58 AM
49 points
6 comments2 min readLW link

[Re­search log] The board of Alpha­bet would stop Deep­Mind to save the world

Lucie PhilipponJul 16, 2024, 4:59 AM
6 points
0 comments4 min readLW link

Towards more co­op­er­a­tive AI safety strategies

Richard_NgoJul 16, 2024, 4:36 AM
215 points
133 comments4 min readLW link

Why Peo­ple in Poverty Make Bad Decisions

James Stephen BrownJul 15, 2024, 11:40 PM
5 points
8 comments1 min readLW link
(nonzerosum.games)

Paper Sum­mary: Princes and Mer­chants: Euro­pean City Growth Be­fore the In­dus­trial Revolution

Jeffrey HeningerJul 15, 2024, 9:30 PM
29 points
1 comment4 min readLW link
(blog.aiimpacts.org)

MIRI’s July 2024 newsletter

HarlanJul 15, 2024, 9:28 PM
25 points
2 comments1 min readLW link
(intelligence.org)

How (and why) to get tested for CMV

MetacelsusJul 15, 2024, 8:06 PM
18 points
0 comments1 min readLW link
(denovo.substack.com)

A Bet­ter Hyper­sti­tion (for AI read­ers)

Yeshua GodJul 15, 2024, 7:35 PM
−20 points
0 comments119 min readLW link

I found >800 or­thog­o­nal “write code” steer­ing vectors

Jul 15, 2024, 7:06 PM
103 points
19 comments7 min readLW link
(jacobgw.com)

The AI al­ign­ment prob­lem in so­cio-tech­ni­cal sys­tems from a com­pu­ta­tional per­spec­tive: A Top-Down-Top view and outlook

zhaoweizhangJul 15, 2024, 6:56 PM
3 points
0 comments9 min readLW link

Mus­ings of a Lay­man: Tech­nol­ogy, AI, and the Hu­man Condition

Crimson LiquidityJul 15, 2024, 6:40 PM
−2 points
0 comments8 min readLW link

[Question] Seek­ing feed­back on a cri­tique of the pa­per­clip max­i­mizer thought experiment

bio neuralJul 15, 2024, 6:39 PM
3 points
9 comments1 min readLW link

EAGxBerkeley 2024

LaurianderJul 15, 2024, 6:38 PM
3 points
0 comments1 min readLW link

Against Aschen­bren­ner: How ‘Si­tu­a­tional Aware­ness’ con­structs a nar­ra­tive that un­der­mines safety and threat­ens humanity

GideonFJul 15, 2024, 6:37 PM
99 points
17 comments21 min readLW link
(forum.effectivealtruism.org)

On pre­dictabil­ity, chaos and AIs that don’t game our goals

Alejandro TlaieJul 15, 2024, 5:16 PM
4 points
8 comments6 min readLW link

De­cep­tive agents can col­lude to hide dan­ger­ous fea­tures in SAEs

Jul 15, 2024, 5:07 PM
33 points
2 comments7 min readLW link

Hid­ing in plain sight: the ques­tions we don’t ask

DDthinkerJul 15, 2024, 5:00 PM
−1 points
1 comment26 min readLW link

Dialogue on What It Means For Some­thing to Have A Func­tion/​Purpose

Jul 15, 2024, 4:28 PM
39 points
5 comments16 min readLW link

Com­par­ing Quan­tized Perfor­mance in Llama Models

NickyPJul 15, 2024, 4:01 PM
33 points
2 comments8 min readLW link

[Aspira­tion-based de­signs] A. Da­m­ages from mis­al­igned op­ti­miza­tion – two more models

Jul 15, 2024, 2:08 PM
6 points
0 comments9 min readLW link

Stacked Lap­top Mon­i­tor Update

jefftkJul 15, 2024, 9:40 AM
14 points
3 comments1 min readLW link
(www.jefftk.com)

Mis­nam­ing and Other Is­sues with OpenAI’s “Hu­man Level” Su­per­in­tel­li­gence Hierarchy

DavidmanheimJul 15, 2024, 5:50 AM
49 points
2 comments3 min readLW link

Series on Ar­tifi­cial Wisdom

Jordan ArelJul 15, 2024, 1:11 AM
2 points
0 comments3 min readLW link