Sur­vey on the ac­cel­er­a­tion risks of our new RFPs to study LLM capabilities

Ajeya Cotra10 Nov 2023 23:59 UTC
27 points
1 comment1 min readLW link

Rat Fest 2024

LoganChipkin10 Nov 2023 23:25 UTC
1 point
0 comments1 min readLW link

How I Think, Part Three: Weigh­ing Cryonics

Richard Henage10 Nov 2023 22:21 UTC
4 points
1 comment2 min readLW link

Lin­ear en­cod­ing of char­ac­ter-level in­for­ma­tion in GPT-J to­ken embeddings

10 Nov 2023 22:19 UTC
34 points
4 comments28 min readLW link

Fol­low-up sur­vey: inositol

Elizabeth10 Nov 2023 19:30 UTC
13 points
1 comment1 min readLW link
(acesounderglass.com)

We have promis­ing al­ign­ment plans with low taxes

Seth Herd10 Nov 2023 18:51 UTC
31 points
9 comments5 min readLW link

[Question] Vec­tor search on a large dataset?

camsdixon10 Nov 2023 18:43 UTC
−1 points
2 comments1 min readLW link

About Me

Abe Dillon10 Nov 2023 18:32 UTC
3 points
0 comments1 min readLW link

Me­tac­u­lus In­tro­duces AI-Pow­ered Com­mu­nity In­sights to Re­veal Fac­tors Driv­ing User Forecasts

ChristianWilliams10 Nov 2023 17:57 UTC
6 points
0 comments1 min readLW link
(www.metaculus.com)

Joy in the Here and Real

Screwtape10 Nov 2023 17:22 UTC
18 points
0 comments2 min readLW link

Arte­facts gen­er­ated by mode col­lapse in GPT-4 Turbo serve as ad­ver­sar­ial at­tacks.

Sohaib Imran10 Nov 2023 15:23 UTC
11 points
0 comments2 min readLW link

Wastew­a­ter RNA Read Lengths

jefftk10 Nov 2023 15:20 UTC
13 points
0 comments4 min readLW link
(www.jefftk.com)

Up­date on the UK AI Sum­mit and the UK’s Plans

Elliot_Mckernon10 Nov 2023 14:47 UTC
10 points
0 comments8 min readLW link

Liv Bo­eree Ted Talk Moloch & AI

Neil 10 Nov 2023 14:04 UTC
10 points
2 comments1 min readLW link
(m.youtube.com)

Pick­ing Men­tors For Re­search Programmes

Raymond D10 Nov 2023 13:01 UTC
106 points
8 comments4 min readLW link

GPT-2030 and Catas­trophic Drives: Four Vignettes

jsteinhardt10 Nov 2023 7:30 UTC
50 points
5 comments10 min readLW link
(bounded-regret.ghost.io)

Crock, Crocker, Crockiest

Screwtape10 Nov 2023 6:14 UTC
21 points
4 comments6 min readLW link

AI Timelines

10 Nov 2023 5:28 UTC
256 points
74 comments51 min readLW link

ACI#6: A Non-Dual­is­tic ACI Model

Akira Pyinya9 Nov 2023 23:01 UTC
10 points
2 comments6 min readLW link

How I got so ex­cited about HowTruthful

Bruce Lewis9 Nov 2023 18:49 UTC
17 points
2 comments5 min readLW link

The case for “Gen­er­ous Tit for Tat” as the ul­ti­mate game the­ory strategy

positivesum9 Nov 2023 18:41 UTC
2 points
3 comments8 min readLW link
(tryingtruly.substack.com)

In­ter­na­tional treaty for global com­pute caps

9 Nov 2023 18:17 UTC
22 points
2 comments8 min readLW link

Text Posts from the Kids Group: 2021

jefftk9 Nov 2023 17:50 UTC
38 points
1 comment8 min readLW link
(www.jefftk.com)

AI #37: Mov­ing Too Fast

Zvi9 Nov 2023 17:50 UTC
53 points
5 comments76 min readLW link
(thezvi.wordpress.com)

Learn­ing-the­o­retic agenda read­ing list

Vanessa Kosoy9 Nov 2023 17:25 UTC
91 points
0 comments2 min readLW link

​​ Open-ended/​Phenom­e­nal ​Ethics ​(TLDR)

Ryo 9 Nov 2023 16:58 UTC
3 points
0 comments1 min readLW link

Poly­se­man­tic At­ten­tion Head in a 4-Layer Transformer

9 Nov 2023 16:16 UTC
46 points
0 comments6 min readLW link

On OpenAI Dev Day

Zvi9 Nov 2023 16:10 UTC
60 points
0 comments15 min readLW link
(thezvi.wordpress.com)

An­trop­i­cal Prob­a­bil­ities Are Fully Ex­plained by Differ­ence in Pos­si­ble Outcomes

Ape in the coat9 Nov 2023 15:34 UTC
17 points
2 comments5 min readLW link

A free to en­ter, 240 char­ac­ter, open-source iter­ated pris­oner’s dilemma tournament

Isaac King9 Nov 2023 8:24 UTC
64 points
19 comments1 min readLW link
(manifold.markets)

Into AI Safety Epi­sodes 1 & 2

jacobhaimes9 Nov 2023 4:36 UTC
2 points
0 comments1 min readLW link
(into-ai-safety.github.io)

Mak­ing Bad De­ci­sions On Purpose

Screwtape9 Nov 2023 3:36 UTC
48 points
8 comments5 min readLW link

Me­tac­u­lus’s New Side­bar Helps You Find Fore­casts Faster

ChristianWilliams8 Nov 2023 20:56 UTC
15 points
0 comments1 min readLW link
(www.metaculus.com)

Open-ended ethics of phe­nom­ena (a desider­ata with uni­ver­sal moral­ity)

Ryo 8 Nov 2023 20:10 UTC
1 point
0 comments8 min readLW link

De­con­fus­ing “on­tol­ogy” in AI alignment

Dylan Bowman8 Nov 2023 20:03 UTC
28 points
3 comments7 min readLW link

Open Agency model can solve the AI reg­u­la­tion dilemma

Roman Leventov8 Nov 2023 20:00 UTC
22 points
1 comment2 min readLW link

Gothen­burg LW /​ ACX meetup

Stefan8 Nov 2023 19:52 UTC
1 point
0 comments1 min readLW link

[Question] Why is less­wrong block­ing wget and curl (scrape)?

Nicolas Lacombe8 Nov 2023 19:42 UTC
21 points
12 comments1 min readLW link

[Question] Is there a less­wrong archive of all pub­lic posts?

Nicolas Lacombe8 Nov 2023 19:26 UTC
10 points
7 comments1 min readLW link

Five pro­jects from AI Safety Hub Labs 2023

charlie_griffin8 Nov 2023 19:19 UTC
47 points
1 comment6 min readLW link
(www.aisafetyhub.org)

[Question] Can a stupid per­son be­come in­tel­li­gent?

A. T.8 Nov 2023 19:01 UTC
13 points
24 comments2 min readLW link

Pros­thetic Intelligence

Krantz8 Nov 2023 19:01 UTC
4 points
9 comments2 min readLW link

[Question] Do you have a satis­fac­tory work­flow for learn­ing about a line of re­search us­ing GPT4, Claude, etc?

ryan_b8 Nov 2023 18:05 UTC
9 points
3 comments1 min readLW link

What’s go­ing on? LLMs and IS-A sen­tences

Bill Benzon8 Nov 2023 16:58 UTC
6 points
15 comments4 min readLW link

[Question] What will hap­pen with real es­tate prices dur­ing a slow take­off?

Ricardo Meneghin8 Nov 2023 11:58 UTC
8 points
1 comment1 min readLW link

Tall Tales at Differ­ent Scales: Eval­u­at­ing Scal­ing Trends For De­cep­tion In Lan­guage Models

8 Nov 2023 11:37 UTC
49 points
0 comments18 min readLW link

How well does your re­search adress the the­ory-prac­tice gap?

Jonas Hallgren8 Nov 2023 11:27 UTC
18 points
0 comments10 min readLW link

Growth and Form in a Toy Model of Superposition

8 Nov 2023 11:08 UTC
87 points
5 comments14 min readLW link

Run­ning your own work­shop on han­dling hos­tile disagreements

Camille Berger 8 Nov 2023 10:28 UTC
12 points
1 comment7 min readLW link

Think­ing By The Clock

Screwtape8 Nov 2023 7:40 UTC
185 points
27 comments8 min readLW link