Briefly Ex­tend­ing Differ­en­tial Op­ti­miza­tion to Distributions

J BostockMar 10, 2024, 8:41 PM
4 points
0 comments2 min readLW link

Evolu­tion did a sur­pris­ing good job at al­ign­ing hu­mans...to so­cial status

Eli TyreMar 10, 2024, 7:34 PM
24 points
37 comments1 min readLW link

Paus­ing AI is Pos­i­tive Ex­pected Value

LironMar 10, 2024, 5:10 PM
9 points
2 comments3 min readLW link
(twitter.com)

W2SG: Introduction

Maria KaprosMar 10, 2024, 4:25 PM
2 points
2 comments10 min readLW link

An Op­ti­mistic Solu­tion to the Fermi Paradox

Glenn ClaytonMar 10, 2024, 2:39 PM
4 points
6 comments13 min readLW link

Coun­ter­fac­tual Civ­i­liza­tion Si­mu­la­tion Ver­sion −1.0 aka my ap­pli­ca­tion to Jo­hannes Mayer’s SPAR project

MorphismMar 10, 2024, 10:10 AM
1 point
0 comments14 min readLW link

Notes from a Prompt Factory

Richard_NgoMar 10, 2024, 5:13 AM
104 points
19 comments9 min readLW link
(www.narrativeark.xyz)

In­ves­ti­gat­ing Basin Vol­ume with XOR Networks

CatGoddessMar 10, 2024, 1:35 AM
10 points
0 comments5 min readLW link

[Linkpost] MindEye2: Shared-Sub­ject Models En­able fMRI-To-Image With 1 Hour of Data

Bogdan Ionut CirsteaMar 10, 2024, 1:30 AM
10 points
0 comments1 min readLW link
(openreview.net)

0th Per­son and 1st Per­son Logic

Adele LopezMar 10, 2024, 12:56 AM
60 points
28 comments6 min readLW link

Com­ple­tion Estimates

scarcegreengrassMar 9, 2024, 10:56 PM
7 points
2 comments3 min readLW link

Semi-Sim­pli­cial Types, Part I: Mo­ti­va­tion and History

astradiolMar 9, 2024, 10:07 PM
20 points
3 comments10 min readLW link

Distinc­tions when Dis­cussing Utility Functions

ozziegooenMar 9, 2024, 8:14 PM
24 points
7 commentsLW link

What is progress?

jasoncrawfordMar 9, 2024, 4:28 PM
10 points
4 comments6 min readLW link
(rootsofprogress.org)

Fif­teen Law­suits against OpenAI

RemmeltMar 9, 2024, 12:22 PM
27 points
4 comments1 min readLW link

Cam­bridge ACX/​SSC monthly meetup (lo­ca­tion changed to Fort St Ge­orge!)

hamishtodd1Mar 9, 2024, 11:10 AM
2 points
0 comments1 min readLW link

MA E-ZPass Without a Car?

jefftkMar 9, 2024, 2:40 AM
15 points
2 comments1 min readLW link
(www.jefftk.com)

Close­ness To the Is­sue (Part 5 of “The Sense Of Phys­i­cal Ne­ces­sity”)

LoganStrohlMar 9, 2024, 12:36 AM
36 points
0 comments15 min readLW link

Ex­plor­ing the Evolu­tion and Mi­gra­tion of Differ­ent Layer Embed­ding in LLMs

Ruixuan HuangMar 8, 2024, 3:01 PM
6 points
0 comments8 min readLW link

[Question] When and why did ‘train­ing’ be­come ‘pre­train­ing’?

berenMar 8, 2024, 2:29 PM
16 points
6 comments1 min readLW link

A T-o-M test: ‘pop­corn’ or ‘choco­late’

MiguelDevMar 8, 2024, 4:24 AM
20 points
13 comments1 min readLW link

Sce­nario Fore­cast­ing Work­shop: Ma­te­ri­als and Learnings

Mar 8, 2024, 2:30 AM
50 points
3 comments2 min readLW link

Fore­cast­ing fu­ture gains due to post-train­ing enhancements

Mar 8, 2024, 2:11 AM
31 points
2 comments1 min readLW link
(docs.google.com)

Do LLMs some­time simu­late some­thing akin to a dream?

NezekMar 8, 2024, 1:25 AM
8 points
4 comments1 min readLW link

Com­mu­nity norms poll (2 mins)

Nathan YoungMar 7, 2024, 9:45 PM
11 points
1 comment1 min readLW link

An­nounc­ing Con­ver­gence Anal­y­sis: An In­sti­tute for AI Sce­nario & Gover­nance Research

Mar 7, 2024, 9:37 PM
23 points
1 comment4 min readLW link

Woods’ new preprint on ob­ject permanence

Steven ByrnesMar 7, 2024, 9:29 PM
58 points
1 comment6 min readLW link

MATS AI Safety Strat­egy Curriculum

Mar 7, 2024, 7:59 PM
74 points
2 comments16 min readLW link

Poli­ti­cal Bi­ases in LLMs: Liter­a­ture Re­view & Cur­rent Uses of AI in Elections

Mar 7, 2024, 7:17 PM
6 points
0 comments6 min readLW link

Ev­i­den­tial Cor­re­la­tions are Sub­jec­tive, and it might be a problem

Martín SotoMar 7, 2024, 6:37 PM
26 points
6 comments14 min readLW link

AI Safety 101 : Ca­pa­bil­ities—Hu­man Level AI, What? How? and When?

Mar 7, 2024, 5:29 PM
46 points
8 comments54 min readLW link

A Re­view of Weak to Strong Gen­er­al­iza­tion [AI Safety Camp]

sevdeawesomeMar 7, 2024, 5:16 PM
14 points
0 comments9 min readLW link

AISN #32: Mea­sur­ing and Re­duc­ing Hazardous Knowl­edge in LLMs Plus, Fore­cast­ing the Fu­ture with LLMs, and Reg­u­la­tory Markets

Mar 7, 2024, 4:39 PM
8 points
0 comments8 min readLW link
(newsletter.safe.ai)

AI #54: Claud­ing Along

ZviMar 7, 2024, 4:00 PM
45 points
11 comments51 min readLW link
(thezvi.wordpress.com)

Be­ing In­ter­ested in Other People

Jonathan MoregårdMar 7, 2024, 10:13 AM
14 points
1 comment3 min readLW link
(youbutbetter.substack.com)

Talk­ing to Congress: Can con­stituents con­tact­ing their leg­is­la­tor in­fluence policy?

Tristan WilliamsMar 7, 2024, 9:24 AM
14 points
0 commentsLW link

Ex­plain­ing the AI Align­ment Prob­lem to Ti­be­tan Bud­dhist Monks

Paul CologneseMar 7, 2024, 9:00 AM
20 points
3 comments6 min readLW link

What if Align­ment is Not Enough?

WillPetilloMar 7, 2024, 8:10 AM
15 points
46 comments9 min readLW link

Sparks of AGI prompts on GPT2XL and its var­i­ant, RLLMv3

MiguelDevMar 7, 2024, 6:33 AM
4 points
0 comments4 min readLW link

An AI, a box, and a threat

jwfiredragonMar 7, 2024, 6:15 AM
9 points
0 comments6 min readLW link

Mud and De­s­pair (Part 4 of “The Sense Of Phys­i­cal Ne­ces­sity”)

LoganStrohlMar 7, 2024, 12:14 AM
38 points
0 comments2 min readLW link

in­tro­duc­tion to ther­mal con­duc­tivity and noise management

bhauthMar 6, 2024, 11:14 PM
31 points
1 comment4 min readLW link
(www.bhauth.com)

Es­say­ing Other Plans

Screwtape6 Mar 2024 22:59 UTC
29 points
4 comments7 min readLW link

In­vest in ACX Grants pro­jects!

Saul Munn6 Mar 2024 20:27 UTC
23 points
1 commentLW link

Vote on An­thropic Topics to Discuss

Ben Pace6 Mar 2024 19:43 UTC
75 points
55 comments1 min readLW link

Sim­ple Kelly bet­ting in pre­dic­tion markets

jessicata6 Mar 2024 18:59 UTC
38 points
3 comments3 min readLW link
(unstablerontology.substack.com)

On Claude 3.0

Zvi6 Mar 2024 18:50 UTC
76 points
5 comments31 min readLW link
(thezvi.wordpress.com)

[Question] Why cor­re­la­tion, though?

numpyNaN6 Mar 2024 16:53 UTC
22 points
7 comments1 min readLW link

Us­ing axis lines for good or evil

dynomight6 Mar 2024 14:47 UTC
151 points
39 comments4 min readLW link
(dynomight.net)

Let’s build definitely-not-con­scious AI

lemonhope6 Mar 2024 7:50 UTC
4 points
18 comments1 min readLW link