To an LLM, ev­ery­thing looks like a logic puzzle

Jesse Richardson16 May 2024 22:21 UTC
14 points
2 comments2 min readLW link

AI Safety In­sti­tute’s In­spect hello world ex­am­ple for AI evals

TheManxLoiner16 May 2024 20:47 UTC
3 points
0 comments1 min readLW link
(lovkush.medium.com)

Feel­ing (in­stru­men­tally) Rational

Morphism16 May 2024 18:56 UTC
14 points
5 comments1 min readLW link

Ad­vice for Ac­tivists from the His­tory of Environmentalism

Jeffrey Heninger16 May 2024 18:40 UTC
100 points
10 comments6 min readLW link
(blog.aiimpacts.org)

Ninety-five the­ses on AI

hamandcheese16 May 2024 17:51 UTC
21 points
0 comments7 min readLW link

GPT-4o My and Google I/​O Day

Zvi16 May 2024 17:50 UTC
41 points
2 comments37 min readLW link
(thezvi.wordpress.com)

AI #64: Feel the Mun­dane Utility

Zvi16 May 2024 15:20 UTC
28 points
11 comments47 min readLW link
(thezvi.wordpress.com)

AISN #35: Lob­by­ing on AI Reg­u­la­tion Plus, New Models from OpenAI and Google, and Le­gal Regimes for Train­ing on Copy­righted Data

16 May 2024 14:29 UTC
2 points
3 comments6 min readLW link
(newsletter.safe.ai)

FMT: a great op­por­tu­nity for (soon-to-be) parents

EternallyBlissful16 May 2024 13:24 UTC
13 points
1 comment18 min readLW link

Towards Guaran­teed Safe AI: A Frame­work for En­sur­ing Ro­bust and Reli­able AI Systems

Gunnar_Zarncke16 May 2024 13:09 UTC
51 points
20 comments1 min readLW link
(arxiv.org)

The Dun­ning-Kruger of dis­prov­ing Dun­ning-Kruger

kromem16 May 2024 10:11 UTC
57 points
0 comments5 min readLW link

A case for fair­ness-en­forc­ing ir­ra­tional behavior

cousin_it16 May 2024 9:41 UTC
16 points
3 comments2 min readLW link

Pod­cast: Eye4AI on 2023 Survey

KatjaGrace16 May 2024 7:40 UTC
8 points
0 comments1 min readLW link
(worldspiritsockpuppet.com)

Against “ar­gu­ment from over­hang risk”

RobertM16 May 2024 4:44 UTC
31 points
11 comments5 min readLW link

[Question] How can I make the most of Less On­line/​Camp/​Man­i­fest?

ErioirE16 May 2024 2:19 UTC
13 points
2 comments1 min readLW link

Do you be­lieve in hun­dred dol­lar bills ly­ing on the ground? Con­sider humming

Elizabeth16 May 2024 0:00 UTC
122 points
22 comments6 min readLW link
(acesounderglass.com)

Tack­ling Moloch: How YouCongress Offers a Novel Co­or­di­na­tion Mechanism

Hector Perez Arenas15 May 2024 23:13 UTC
29 points
9 comments6 min readLW link

In­tro­duc­ing Statis­ti­cal Utility Me­chan­ics: A Frame­work for Utility Maximizers

J Bostock15 May 2024 21:56 UTC
12 points
0 comments7 min readLW link

Tam­ing In­finity (Stat Mech Part 3)

J Bostock15 May 2024 21:43 UTC
11 points
0 comments7 min readLW link

Let’s De­sign A School, Part 2.3 School as Ed­u­ca­tion—The Cur­ricu­lum (Phase 2, Spe­cific)

Sable15 May 2024 20:58 UTC
24 points
0 comments14 min readLW link
(affablyevil.substack.com)

“A Paradigm for AI Con­scious­ness”—Seeds of Science call for reviewers

rogersbacon15 May 2024 20:55 UTC
7 points
0 comments1 min readLW link

Why you should learn a mu­si­cal instrument

cata15 May 2024 20:36 UTC
51 points
23 comments3 min readLW link

Con­tra Caller Gen­der III

jefftk15 May 2024 19:40 UTC
8 points
0 comments2 min readLW link
(www.jefftk.com)

In­struc­tion-fol­low­ing AGI is eas­ier and more likely than value al­igned AGI

Seth Herd15 May 2024 19:38 UTC
80 points
28 comments12 min readLW link

Why I’ll Keep My Crummy Draw­ings—How Gen­er­a­tive AI Art Won’t Sup­plant… Art.

James Stephen Brown15 May 2024 19:30 UTC
18 points
7 comments4 min readLW link

[Question] How is GPT-4o Re­lated to GPT-4?

Joel Burget15 May 2024 18:33 UTC
10 points
2 comments1 min readLW link

[Linkpost] Please don’t take Lu­mina’s an­ti­cav­ity probiotic

ROM15 May 2024 18:03 UTC
13 points
5 comments4 min readLW link
(trevorklee.substack.com)

Was Par­ti­san­ship Good for the En­vi­ron­men­tal Move­ment?

Jeffrey Heninger15 May 2024 17:30 UTC
24 points
0 comments5 min readLW link
(blog.aiimpacts.org)

Cal­ling all experts

sleno15 May 2024 15:22 UTC
1 point
0 comments1 min readLW link

[Question] Quan­tized vs. con­tin­u­ous na­ture of qualia

Terence Coelho15 May 2024 12:52 UTC
6 points
18 comments1 min readLW link

How to be a messy thinker

invertedpassion15 May 2024 11:57 UTC
6 points
0 comments7 min readLW link

Embed­ded Whis­tle Synth

jefftk15 May 2024 2:50 UTC
9 points
0 comments3 min readLW link
(www.jefftk.com)

Catas­trophic Good­hart in RL with KL penalty

15 May 2024 0:58 UTC
62 points
10 comments7 min readLW link

Ilya Sutskever and Jan Leike re­sign from OpenAI [up­dated]

Zach Stein-Perlman15 May 2024 0:45 UTC
246 points
95 comments2 min readLW link

my note system

bhauth15 May 2024 0:20 UTC
22 points
5 comments2 min readLW link
(www.bhauth.com)

MIRI’s May 2024 Newsletter

Harlan15 May 2024 0:13 UTC
79 points
1 comment3 min readLW link
(intelligence.org)

The Greater Goal: Shar­ing Knowl­edge with the Cosmos

pda.everyday14 May 2024 22:46 UTC
0 points
1 comment2 min readLW link

Teach­ing CS Dur­ing Take-Off

andrew carle14 May 2024 22:45 UTC
92 points
13 comments2 min readLW link

A Nar­ra­tive His­tory of En­vi­ron­men­tal­ism’s Partisanship

Jeffrey Heninger14 May 2024 16:51 UTC
31 points
3 comments10 min readLW link
(blog.aiimpacts.org)

How much AI in­fer­ence can we do?

Benjamin_Todd14 May 2024 15:10 UTC
20 points
7 comments5 min readLW link
(benjamintodd.substack.com)

How to do con­cep­tual re­search: Case study in­ter­view with Cas­par Oesterheld

Chi Nguyen14 May 2024 15:09 UTC
48 points
5 comments9 min readLW link

[Question] In the con­text of AI in­terp. What is a fea­ture ex­actly?

f3mi14 May 2024 13:46 UTC
9 points
1 comment1 min readLW link

An­nounc­ing the AI Safety Sum­mit Talks with Yoshua Bengio

otto.barten14 May 2024 12:52 UTC
9 points
1 comment1 min readLW link

Can AI part­ners make hu­man ones pale in com­par­i­son?

false14 May 2024 10:57 UTC
−1 points
1 comment5 min readLW link
(arminbagrat.com)

[Question] Why do we en­joy mu­sic?

metachirality14 May 2024 8:29 UTC
5 points
3 comments1 min readLW link

Emer­gence Is a Univer­sal Non-Zero-Sum Phenomenon.

James Stephen Brown14 May 2024 8:06 UTC
6 points
0 comments1 min readLW link
(nonzerosum.games)

D&D.Sci Long War: Defen­der of Data-moc­racy Eval­u­a­tion & Ruleset

aphyer14 May 2024 3:35 UTC
44 points
3 comments6 min readLW link

En­vi­ron­men­tal­ism in the United States Is Unusu­ally Partisan

Jeffrey Heninger13 May 2024 21:23 UTC
85 points
26 comments4 min readLW link
(blog.aiimpacts.org)

OpenAI re­leases GPT-4o, na­tively in­ter­fac­ing with text, voice and vision

Martín Soto13 May 2024 18:50 UTC
54 points
23 comments1 min readLW link
(openai.com)

GPT-4o is out

WitheringWeights13 May 2024 18:33 UTC
21 points
1 comment1 min readLW link