Why Not Subagents?

Jun 22, 2023, 10:16 PM
130 points
52 comments14 min readLW link1 review

Catas­trophic Risks from AI #2: Mal­i­cious Use

Jun 22, 2023, 5:10 PM
38 points
1 comment17 min readLW link
(arxiv.org)

Catas­trophic Risks from AI #1: Introduction

Jun 22, 2023, 5:09 PM
40 points
1 comment5 min readLW link
(arxiv.org)

AI #17: The Litany

ZviJun 22, 2023, 2:30 PM
95 points
34 comments56 min readLW link
(thezvi.wordpress.com)

[Re­search Up­date] Sparse Au­toen­coder fea­tures are bimodal

Robert_AIZIJun 22, 2023, 1:15 PM
24 points
1 comment5 min readLW link
(aizi.substack.com)

The Hub­inger lec­tures on AGI safety: an in­tro­duc­tory lec­ture series

evhubJun 22, 2023, 12:59 AM
126 points
0 comments1 min readLW link
(www.youtube.com)

How to Search Mul­ti­ple Web­sites Quickly

Nicholas / Heather KrossJun 22, 2023, 12:42 AM
16 points
1 comment1 min readLW link

[Question] New­bie ques­tions about in­for­ma­tion the­ory and transformers

Misaligned-Semi-intelligenceJun 21, 2023, 10:45 PM
10 points
1 comment1 min readLW link

Progress links and tweets, 2023-06-21: Ste­wart Brand wants your comments

jasoncrawfordJun 21, 2023, 8:52 PM
11 points
1 comment1 min readLW link
(rootsofprogress.org)

What—ideally—should young and in­tel­li­gent peo­ple do?

veterxiphJun 21, 2023, 8:21 PM
1 point
4 comments3 min readLW link

Us­ing Claude to con­vert di­a­log tran­scripts into great posts?

mako yassJun 21, 2023, 8:19 PM
6 points
4 comments4 min readLW link

Which per­son­al­ity traits are real? Stress-test­ing the lex­i­cal hypothesis

tailcalledJun 21, 2023, 7:46 PM
65 points
5 comments9 min readLW link1 review

“text­books are all you need”

bhauthJun 21, 2023, 5:06 PM
66 points
18 comments2 min readLW link
(arxiv.org)

Philo­soph­i­cal Cy­borg (Part 2)...or, The Good Successor

ukc10014Jun 21, 2023, 3:43 PM
21 points
1 comment31 min readLW link

Re­la­tional Speaking

jefftkJun 21, 2023, 2:40 PM
11 points
0 comments2 min readLW link
(www.jefftk.com)

My side of an ar­gu­ment with Ja­cob Can­nell about chip in­ter­con­nect losses

Steven ByrnesJun 21, 2023, 1:33 PM
144 points
11 comments11 min readLW link

Short timelines and slow, con­tin­u­ous take­off as the safest path to AGI

Jun 21, 2023, 8:56 AM
65 points
15 comments7 min readLW link

A way to make solv­ing al­ign­ment 10.000 times eas­ier. The shorter case for a mas­sive open source sim­box pro­ject.

AlexFromSafeTransitionJun 21, 2023, 8:08 AM
2 points
16 comments14 min readLW link

My ten­ta­tive best guess on how EAs and Ra­tion­al­ists some­times turn crazy

habrykaJun 21, 2023, 4:11 AM
199 points
110 comments8 min readLW link

The Im­por­tance of Judg­ing: A Reflec­tion on Ra­tional Thought

CrimsonChinJun 20, 2023, 10:49 PM
2 points
0 comments4 min readLW link

“Nat­u­ral is bet­ter” is a valuable heuristic

Neil Jun 20, 2023, 10:25 PM
35 points
16 comments4 min readLW link

№.6 For Those About To Dress...

party girlJun 20, 2023, 9:14 PM
5 points
0 comments4 min readLW link
(affale.substack.com)

Frame Bridg­ing v0.8 - an in­quiry and a technique

UnrealJun 20, 2023, 7:46 PM
11 points
9 comments6 min readLW link

Public Tran­sit is not In­finitely Safe

jefftkJun 20, 2023, 6:40 PM
97 points
34 comments1 min readLW link
(www.jefftk.com)

why I’m here now

bhauthJun 20, 2023, 5:13 PM
8 points
3 comments1 min readLW link

Causal­ity: A Brief Introduction

Jun 20, 2023, 3:01 PM
49 points
18 comments6 min readLW link

Light­ning Post: Things peo­ple in AI Safety should stop talk­ing about

PrometheusJun 20, 2023, 3:00 PM
23 points
6 comments2 min readLW link

Hav­ing a headache and not hav­ing a headache

Jim PivarskiJun 20, 2023, 2:59 PM
7 points
9 comments3 min readLW link

Never Fight The Last War

ChristianKlJun 20, 2023, 12:35 PM
32 points
4 comments1 min readLW link

[Question] Why didn’t vi­rol­o­gists run the stud­ies nec­es­sary to de­ter­mine which viruses are air­borne?

ChristianKlJun 20, 2023, 11:58 AM
28 points
19 comments1 min readLW link

A Friendly Face (Another Failure Story)

Jun 20, 2023, 10:31 AM
65 points
21 comments16 min readLW link

[Question] Are the ma­jor­ity of your an­ces­tors farm­ers or non-farm­ers?

LinchJun 20, 2023, 8:55 AM
19 points
47 comments1 min readLW link

DSLT 3. Neu­ral Net­works are Singular

Liam CarrollJun 20, 2023, 8:20 AM
29 points
5 comments19 min readLW link

10 quick takes about AGI

Max HJun 20, 2023, 2:22 AM
35 points
17 comments7 min readLW link

OpenAI in­tro­duces func­tion call­ing for GPT-4

Jun 20, 2023, 1:58 AM
24 points
3 comments4 min readLW link
(openai.com)

Ap­proaches to Thump

jefftkJun 20, 2023, 1:50 AM
8 points
0 comments2 min readLW link
(www.jefftk.com)

Ban de­vel­op­ment of un­pre­dictable pow­er­ful mod­els?

TurnTroutJun 20, 2023, 1:43 AM
46 points
25 comments4 min readLW link

Cap­ture to­day’s mar­ket, cap­ture to­mor­row’s game board

SimonBiggsJun 20, 2023, 12:45 AM
9 points
0 comments5 min readLW link

Les­sons On How To Get Things Right On The First Try

Jun 19, 2023, 11:58 PM
252 points
57 comments10 min readLW link1 review

Mode col­lapse in RL may be fueled by the up­date equation

Jun 19, 2023, 9:51 PM
53 points
10 comments8 min readLW link

New refer­ence stan­dard on LLM Ap­pli­ca­tion se­cu­rity started by OWASP

QuantumForestJun 19, 2023, 8:54 PM
2 points
0 comments1 min readLW link

Ex­per­i­ments in Eval­u­at­ing Steer­ing Vectors

Gytis DaujotasJun 19, 2023, 3:11 PM
34 points
4 comments4 min readLW link

Provisionality

TsviBTJun 19, 2023, 11:49 AM
7 points
2 comments7 min readLW link

[Question] When did you ori­ent?

lemonhopeJun 19, 2023, 7:22 AM
12 points
7 comments1 min readLW link

Guide to ra­tio­nal­ist in­te­rior decorating

mingyuanJun 19, 2023, 6:47 AM
327 points
53 comments12 min readLW link4 reviews

A Mul­tidis­ci­plinary Ap­proach to Align­ment (MATA) and Archety­pal Trans­fer Learn­ing (ATL)

MiguelDevJun 19, 2023, 2:32 AM
4 points
2 comments7 min readLW link

re­solv­ing some neu­ral net­work mysteries

bhauthJun 19, 2023, 12:09 AM
44 points
6 comments2 min readLW link
(www.bhauth.com)

Why I am not an AI ex­tinc­tion cautionista

ShmiJun 18, 2023, 9:28 PM
22 points
40 comments2 min readLW link

My im­pres­sion of sin­gu­lar learn­ing theory

Ege ErdilJun 18, 2023, 3:34 PM
47 points
30 comments2 min readLW link

Ber­lin AI Align­ment Open Meetup July 2023

GuyPJun 18, 2023, 2:13 PM
1 point
0 comments1 min readLW link