RSS

Why a Mars colony would lead to a first strike situation

Remmelt4 Oct 2023 11:29 UTC
−17 points
1 comment1 min readLW link
(mflb.com)

[Question] What are some ex­am­ples of AIs in­stan­ti­at­ing the ‘near­est un­blocked strat­egy prob­lem’?

EJT4 Oct 2023 11:05 UTC
5 points
2 comments1 min readLW link

Graph­i­cal ten­sor no­ta­tion for interpretability

Jordan Taylor4 Oct 2023 8:04 UTC
22 points
0 comments19 min readLW link

[Link] Bay Area Win­ter Sols­tice 2023

4 Oct 2023 2:19 UTC
11 points
0 comments1 min readLW link
(fb.me)

[Question] Who de­ter­mines whether an al­ign­ment pro­posal is the defini­tive al­ign­ment solu­tion?

MiguelDev3 Oct 2023 22:39 UTC
0 points
3 comments1 min readLW link

“Go­ing In­finite”—New book on FTX/​SBF re­leased to­day + my TL;DR

NickyP3 Oct 2023 22:19 UTC
30 points
2 comments2 min readLW link

AXRP Epi­sode 25 - Co­op­er­a­tive AI with Cas­par Oesterheld

DanielFilan3 Oct 2023 21:50 UTC
27 points
0 comments92 min readLW link

When to Get the Booster?

jefftk3 Oct 2023 21:00 UTC
32 points
2 comments2 min readLW link
(www.jefftk.com)

OpenAI-Microsoft partnership

Zach Stein-Perlman3 Oct 2023 20:01 UTC
43 points
9 comments1 min readLW link

[Question] Cur­rent AI safety tech­niques?

Zach Stein-Perlman3 Oct 2023 19:30 UTC
13 points
1 comment2 min readLW link

Test­ing and Au­toma­tion for In­tel­li­gent Sys­tems.

Sai Kiran Kammari3 Oct 2023 17:51 UTC
−5 points
0 comments1 min readLW link
(resource-cms.springernature.com)

Me­tac­u­lus An­nounces Fore­cast­ing Tour­na­ment to Eval­u­ate Fo­cused Re­search Or­ga­ni­za­tions, in Part­ner­ship With the Fed­er­a­tion of Amer­i­can Scien­tists

ChristianWilliams3 Oct 2023 16:44 UTC
11 points
0 comments1 min readLW link
(www.metaculus.com)

What would it mean to un­der­stand how a large lan­guage model (LLM) works? Some quick notes.

Bill Benzon3 Oct 2023 15:11 UTC
18 points
2 comments8 min readLW link

[Question] Po­ten­tial al­ign­ment tar­gets for a sovereign su­per­in­tel­li­gent AI

Paul Colognese3 Oct 2023 15:09 UTC
26 points
4 comments1 min readLW link

Monthly Roundup #11: Oc­to­ber 2023

Zvi3 Oct 2023 14:10 UTC
32 points
6 comments35 min readLW link
(thezvi.wordpress.com)

Why We Use Money? - A Walrasian View

Savio Coelho3 Oct 2023 12:02 UTC
4 points
3 comments8 min readLW link

Mech In­terp Challenge: Oc­to­ber—De­ci­pher­ing the Sorted List Model

TheMcDouglas3 Oct 2023 10:57 UTC
10 points
0 comments3 min readLW link

Early Ex­per­i­ments in Re­ward Model In­ter­pre­ta­tion Us­ing Sparse Autoencoders

3 Oct 2023 7:45 UTC
11 points
0 comments5 min readLW link

Some Quick Fol­low-Up Ex­per­i­ments to “Taken out of con­text: On mea­sur­ing situ­a­tional aware­ness in LLMs”

miles3 Oct 2023 2:22 UTC
24 points
0 comments9 min readLW link

Life In a Day Screen­ing and Dis­cus­sion With EA Waterloo

jenn2 Oct 2023 21:39 UTC
5 points
0 comments2 min readLW link