Bet­ting and forecasting

CarlJSep 9, 2023, 8:03 PM
2 points
0 comments1 min readLW link

AI pres­i­dents dis­cuss AI al­ign­ment agendas

Sep 9, 2023, 6:55 PM
217 points
23 comments1 min readLW link
(www.youtube.com)

Prob­a­bil­is­tic ar­gu­ment re­la­tion­ships and an in­vi­ta­tion to the ar­gu­ment map­ping community

lunatic_at_largeSep 9, 2023, 6:45 PM
13 points
4 comments10 min readLW link

How teams went about their re­search at AI Safety Camp edi­tion 8

Sep 9, 2023, 4:34 PM
28 points
0 comments13 min readLW link

Panel dis­cus­sion on AI con­scious­ness with Rob Long and Jeff Sebo

Aaron BergmanSep 9, 2023, 3:38 AM
10 points
0 commentsLW link
(www.youtube.com)

Pos­si­ble Diver­gence in AGI Risk Tol­er­ance be­tween Selfish and Altru­is­tic agents

Brad West Sep 9, 2023, 12:23 AM
1 point
1 comment2 min readLW link

Cap­ture the Flag Mechanis­tic In­ter­pretabil­ity Challenges

Sep 8, 2023, 11:00 PM
24 points
0 comments7 min readLW link

[Question] What is to be done? (About the profit mo­tive)

Connor BarberSep 8, 2023, 7:27 PM
1 point
21 comments1 min readLW link

What is the op­ti­mal fron­tier for due dili­gence?

Sep 8, 2023, 6:20 PM
41 points
1 comment1 min readLW link

Progress links di­gest, 2023-09-08: The Con­ser­va­tive Fu­tur­ist, cargo air­ships, and more

jasoncrawfordSep 8, 2023, 5:48 PM
14 points
7 comments5 min readLW link
(rootsofprogress.org)

The AI apoc­a­lypse myth.

Spiritus DeiSep 8, 2023, 5:43 PM
−22 points
12 comments2 min readLW link

Sum-thresh­old attacks

TsviBTSep 8, 2023, 5:13 PM
238 points
55 comments10 min readLW link
(tsvibt.blogspot.com)

De­bate se­ries: should we push for a pause on the de­vel­op­ment of AI?

XodarapSep 8, 2023, 4:29 PM
39 points
1 commentLW link

AI Prob­a­bil­ity Trees—Joe Car­l­smith (2022)

Nathan YoungSep 8, 2023, 3:40 PM
12 points
1 comment8 min readLW link

In­vad­ing Aus­tralia (End­less Former­lies Most Beau­tiful, or What I Learned On My Holi­day)

Oliver SourbutSep 8, 2023, 3:33 PM
12 points
1 comment8 min readLW link
(www.oliversourbut.net)

Ex­plain­ing grokking through cir­cuit efficiency

Sep 8, 2023, 2:39 PM
101 points
11 comments3 min readLW link
(arxiv.org)

Have At­ten­tion Spans Been De­clin­ing?

niplavSep 8, 2023, 2:11 PM
71 points
22 comments17 min readLW link1 review

Ex­plained Sim­ply: Quantilizers

brookSep 8, 2023, 12:54 PM
15 points
5 commentsLW link
(aisafetyexplained.substack.com)

Cross­ing the Ru­bi­con.

Spiritus DeiSep 8, 2023, 4:19 AM
−4 points
5 comments13 min readLW link

[Question] What EY and LessWrong meant when (fill in the blank) found them.

Bill BenzonSep 8, 2023, 1:42 AM
1 point
0 comments1 min readLW link

Bring back the Colosseums

lcSep 8, 2023, 12:09 AM
18 points
28 comments1 min readLW link

The Löbian Ob­sta­cle, And Why You Should Care

lukemarksSep 7, 2023, 11:59 PM
18 points
6 comments2 min readLW link

Science to Be Done In­ter­na­tion­ally Us­ing Blockchain

Victor PortonSep 7, 2023, 11:29 PM
−18 points
0 comments2 min readLW link
(science-dao.org)

A quick up­date from Nonlinear

KatWoodsSep 7, 2023, 9:28 PM
72 points
23 comments2 min readLW link

[Linkpost] Fron­tier AI Task­force: first progress report

Paul CologneseSep 7, 2023, 7:06 PM
21 points
0 comments4 min readLW link
(www.gov.uk)

[Question] How did you make your way back from meta?

mattoSep 7, 2023, 5:23 PM
23 points
27 comments1 min readLW link

AI#28: Watch­ing and Waiting

ZviSep 7, 2023, 5:20 PM
52 points
14 comments45 min readLW link
(thezvi.wordpress.com)

[Question] Mea­sure of com­plex­ity al­lowed by the laws of the uni­verse and rel­a­tive the­ory?

dr_sSep 7, 2023, 12:21 PM
8 points
22 comments1 min readLW link

Re­cre­at­ing the car­ing drive

CatneeSep 7, 2023, 10:41 AM
43 points
15 comments10 min readLW link1 review

Shar­ing In­for­ma­tion About Nonlinear

Ben PaceSep 7, 2023, 6:51 AM
323 points
323 comments34 min readLW link

Weekly In­ci­dence vs Cu­mu­la­tive Infections

jefftkSep 7, 2023, 2:30 AM
13 points
6 comments1 min readLW link
(www.jefftk.com)

Im­prov­ing Math­e­mat­i­cal Ac­cu­racy in LLMs—His­tory − 1

Abhay ChowdhrySep 7, 2023, 1:58 AM
5 points
1 comment9 min readLW link

Break­ing RLHF “Safety” (And how to fix it?)

MPotterSep 7, 2023, 1:58 AM
3 points
0 comments4 min readLW link

Feed­back-loops, De­liber­ate Prac­tice, and Trans­fer Learning

Sep 7, 2023, 1:57 AM
46 points
5 comments1 min readLW link

Video es­say: How Will We Know When AI is Con­scious?

JanProSep 6, 2023, 6:10 PM
11 points
7 comments1 min readLW link
(www.youtube.com)

My First Post

Jaivardhan NawaniSep 6, 2023, 5:42 PM
35 points
9 comments1 min readLW link

Ac­tAdd: Steer­ing Lan­guage Models with­out Optimization

Sep 6, 2023, 5:21 PM
105 points
3 comments2 min readLW link
(arxiv.org)

Monthly Roundup #10: Septem­ber 2023

ZviSep 6, 2023, 1:20 PM
35 points
4 comments56 min readLW link
(thezvi.wordpress.com)

Find Hot French Food Near Me: A Fol­low-up

aphyerSep 6, 2023, 12:32 PM
75 points
19 comments2 min readLW link

Man­i­fest 2023

Sep 6, 2023, 11:24 AM
3 points
0 comments1 min readLW link

Last Chance: Get tick­ets to Man­i­fest 2023! (Sep 22-24 in Berkeley)

Sep 6, 2023, 10:35 AM
5 points
0 comments1 min readLW link

What I’ve been read­ing, Septem­ber 2023

jasoncrawfordSep 6, 2023, 9:32 AM
17 points
0 comments5 min readLW link
(rootsofprogress.org)

De­ci­sion The­ory: A (Nor­ma­tive) Introduction

Pareto OptimalSep 6, 2023, 8:22 AM
−1 points
1 comment3 min readLW link
(paretooptimal.substack.com)

[Question] What’s the eas­iest way to make a lu­mi­na­tor?

kuiraSep 6, 2023, 12:07 AM
7 points
13 comments1 min readLW link

Or­di­nary claims re­quire or­di­nary evidence

blake8086Sep 5, 2023, 10:09 PM
1 point
3 comments2 min readLW link

Con­ver­sa­tion about paradigms, in­tel­lec­tual progress, so­cial con­sen­sus, and AI

Sep 5, 2023, 9:30 PM
14 points
6 comments1 min readLW link

What I would do if I wasn’t at ARC Evals

LawrenceCSep 5, 2023, 7:19 PM
220 points
10 comments13 min readLW link1 review

The Evolu­tion­ary Path­way from Biolog­i­cal to Digi­tal In­tel­li­gence: A Cos­mic Perspective

George360Sep 5, 2023, 5:47 PM
−17 points
0 comments4 min readLW link

The Illu­sion of Univer­sal Mo­ral­ity: A Dy­namic Per­spec­tive on Ge­netic Fit­ness and Eth­i­cal Complexity

George360Sep 5, 2023, 5:47 PM
−9 points
7 comments2 min readLW link

Bench­marks for De­tect­ing Mea­sure­ment Tam­per­ing [Red­wood Re­search]

Sep 5, 2023, 4:44 PM
87 points
22 comments20 min readLW link1 review
(arxiv.org)