[Question] Is CDT with pre­com­mit­ment enough?

martinkunev25 May 2024 21:40 UTC
10 points
17 comments1 min readLW link

Com­plex sys­tems the­ory in hu­man perfor­mance. New model for con­cep­tu­al­iz­ing train­ing, adap­ta­tion and long-term development

Matěj Nekoranec25 May 2024 20:17 UTC
1 point
0 comments7 min readLW link

Blindspot in Sport’s Data-Driven Age

Matěj Nekoranec25 May 2024 20:17 UTC
2 points
0 comments7 min readLW link

LMSR sub­sidy pa­ram­e­ter is the price of information

Abhimanyu Pallavi Sudhir25 May 2024 18:05 UTC
5 points
0 comments1 min readLW link

Low Fer­til­ity is a De­growth Paradise

Maxwell Tabarrok25 May 2024 17:35 UTC
7 points
2 comments3 min readLW link
(www.maximum-progress.com)

Epi­sode: Austin vs Linch on OpenAI

Austin Chen25 May 2024 16:15 UTC
20 points
25 comments44 min readLW link
(manifund.substack.com)

Train­ing-time do­main au­tho­riza­tion could be helpful for safety

25 May 2024 15:10 UTC
15 points
4 comments7 min readLW link

Level up your spreadsheeting

angelinahli25 May 2024 14:57 UTC
44 points
11 comments3 min readLW link
(docs.google.com)

“Suc­cess­ful lan­guage model evals” by Ja­son Wei

Arjun Panickssery25 May 2024 9:34 UTC
7 points
0 comments1 min readLW link
(www.jasonwei.net)

[Question] What should the norms around AI voices be?

ChristianKl25 May 2024 6:29 UTC
17 points
6 comments1 min readLW link

Se­cret US nat­sec pro­ject with in­tel revealed

Nathan Helm-Burger25 May 2024 4:22 UTC
27 points
1 comment1 min readLW link
(www.politico.com)

Launch & Grow Your Univer­sity Group: Ap­ply now to OSP & FSP!

agucova25 May 2024 1:03 UTC
3 points
0 comments2 min readLW link

Com­pu­ta­tional Me­chan­ics Hackathon (June 1 & 2)

Adam Shai24 May 2024 22:18 UTC
34 points
5 comments1 min readLW link

[Question] Re­quest for com­ments/​opinions/​ideas on safety/​ethics for use of tool AI in a large health­care sys­tem.

bokov24 May 2024 20:53 UTC
5 points
2 comments1 min readLW link

NYU Code De­bates Up­date/​Postmortem

David Rein24 May 2024 16:08 UTC
27 points
4 comments10 min readLW link

AI com­pa­nies aren’t re­ally us­ing ex­ter­nal evaluators

Zach Stein-Perlman24 May 2024 16:01 UTC
242 points
15 comments4 min readLW link

The Schumer Re­port on AI (RTFB)

Zvi24 May 2024 15:10 UTC
34 points
3 comments36 min readLW link
(thezvi.wordpress.com)

min­utes from a hu­man-al­ign­ment meeting

bhauth24 May 2024 5:01 UTC
67 points
4 comments2 min readLW link

Ta­lent Needs of Tech­ni­cal AI Safety Teams

24 May 2024 0:36 UTC
121 points
65 comments14 min readLW link

How to Give Com­ing AGI’s the Best Chance of Figur­ing Out Ethics for Us

sweenesm23 May 2024 19:44 UTC
1 point
2 comments10 min readLW link

Men­tor­ship in AGI Safety (MAGIS) call for men­tors

23 May 2024 18:28 UTC
32 points
3 comments2 min readLW link

Quick Thoughts on Scal­ing Monosemanticity

Joel Burget23 May 2024 16:22 UTC
28 points
1 comment4 min readLW link
(transformer-circuits.pub)

The case for stop­ping AI safety research

catubc23 May 2024 15:55 UTC
53 points
38 comments1 min readLW link

[Question] SAE sparse fea­ture graph us­ing only resi­d­ual layers

Jaehyuk Lim23 May 2024 13:32 UTC
0 points
3 comments1 min readLW link

[Question] Are most peo­ple deeply con­fused about “love”, or am I miss­ing a hu­man uni­ver­sal?

SpectrumDT23 May 2024 13:22 UTC
13 points
28 comments3 min readLW link

Ex­ec­u­tive Dys­func­tion 101

DaystarEld23 May 2024 12:43 UTC
33 points
1 comment3 min readLW link
(daystareld.com)

AI #65: I Spy With My AI

Zvi23 May 2024 12:40 UTC
28 points
7 comments43 min readLW link
(thezvi.wordpress.com)

What mis­takes has the AI safety move­ment made?

EuanMcLean23 May 2024 11:19 UTC
64 points
29 comments12 min readLW link

What should AI safety be try­ing to achieve?

EuanMcLean23 May 2024 11:17 UTC
17 points
1 comment13 min readLW link

What will the first hu­man-level AI look like, and how might things go wrong?

EuanMcLean23 May 2024 11:17 UTC
20 points
2 comments15 min readLW link

Big Pic­ture AI Safety: Introduction

EuanMcLean23 May 2024 11:15 UTC
46 points
7 comments5 min readLW link

Paper in Science: Manag­ing ex­treme AI risks amid rapid progress

JanB23 May 2024 8:40 UTC
50 points
2 comments1 min readLW link

Power Law Policy

Ben Turtel23 May 2024 5:28 UTC
4 points
7 comments6 min readLW link
(bturtel.substack.com)

Why en­tropy means you might not have to worry as much about su­per­in­tel­li­gent AI

Ron J23 May 2024 3:52 UTC
−26 points
1 comment2 min readLW link

Quick Thoughts on Our First Sam­pling Run

jefftk23 May 2024 0:20 UTC
29 points
3 comments2 min readLW link
(www.jefftk.com)

AI Safety pro­posal—In­fluenc­ing the su­per­in­tel­li­gence explosion

Morgan22 May 2024 23:31 UTC
0 points
2 comments7 min readLW link

Im­ple­ment­ing Asi­mov’s Laws of Robotics—How I imag­ine al­ign­ment work­ing.

Joshua Clancy22 May 2024 23:15 UTC
2 points
0 comments11 min readLW link

Higher-Order Forecasts

ozziegooen22 May 2024 21:49 UTC
45 points
1 comment3 min readLW link

A Pos­i­tive Dou­ble Stan­dard—Self-Help Prin­ci­ples Work For In­di­vi­d­u­als Not Populations

James Stephen Brown22 May 2024 21:37 UTC
8 points
3 comments5 min readLW link

A Bi-Mo­dal Brain Model

Johannes C. Mayer22 May 2024 20:10 UTC
12 points
3 comments2 min readLW link

Offer­ing ser­vice as a sen­sayer for simu­la­tion­ist-ad­ja­cent be­liefs.

mako yass22 May 2024 18:52 UTC
22 points
0 comments1 min readLW link

Do Not Mess With Scar­lett Johansson

Zvi22 May 2024 15:10 UTC
65 points
7 comments16 min readLW link
(thezvi.wordpress.com)

How Mul­ti­verse The­ory dis­solves Quan­tum in­ex­pli­ca­bil­ity

mrdlm22 May 2024 14:55 UTC
0 points
0 comments1 min readLW link

[Question] Should we be con­cerned about eat­ing too much soy?

ChristianKl22 May 2024 12:53 UTC
18 points
3 comments1 min readLW link

Pro­ce­du­ral Ex­ec­u­tive Func­tion, Part 3

DaystarEld22 May 2024 11:58 UTC
21 points
4 comments23 min readLW link

Ci­cadas, An­thropic, and the bilat­eral al­ign­ment problem

kromem22 May 2024 11:09 UTC
28 points
6 comments5 min readLW link

An­nounc­ing Hu­man-al­igned AI Sum­mer School

22 May 2024 8:55 UTC
51 points
0 comments1 min readLW link
(humanaligned.ai)

“Which chains-of-thought was that faster than?”

Emrik22 May 2024 8:21 UTC
37 points
4 comments4 min readLW link

Each Llama3-8b text uses a differ­ent “ran­dom” sub­space of the ac­ti­va­tion space

tailcalled22 May 2024 7:31 UTC
3 points
4 comments7 min readLW link

ARIA’s Safe­guarded AI grant pro­gram is ac­cept­ing ap­pli­ca­tions for Tech­ni­cal Area 1.1 un­til May 28th

Brendon_Wong22 May 2024 6:54 UTC
11 points
0 comments1 min readLW link
(www.aria.org.uk)