An open re­sponse to Wit­tkot­ter and Yampolskiy

Donald HobsonSep 24, 2024, 10:27 PM
8 points
0 comments4 min readLW link

A Path out of In­suffi­cient Views

UnrealSep 24, 2024, 8:00 PM
44 points
65 comments9 min readLW link

How to give effec­tively to US Dems

Hauke HillebrandtSep 24, 2024, 2:38 PM
2 points
0 commentsLW link
(www.slowboring.com)

[Question] How do you fol­low AI (safety) news?

PeterHSep 24, 2024, 1:58 PM
4 points
2 comments1 min readLW link

In­struc­tion Fol­low­ing with­out In­struc­tion Tuning

Bogdan Ionut CirsteaSep 24, 2024, 1:49 PM
17 points
0 comments1 min readLW link
(arxiv.org)

Book Re­view: On the Edge: The Gamblers

ZviSep 24, 2024, 11:50 AM
35 points
1 comment89 min readLW link
(thezvi.wordpress.com)

Edit­ing at the Take Level

jefftkSep 24, 2024, 11:30 AM
12 points
1 comment1 min readLW link
(www.jefftk.com)

Us­ing LLM’s for AI Foun­da­tion re­search and the Sim­ple Solu­tion assumption

Donald HobsonSep 24, 2024, 11:00 AM
5 points
0 comments2 min readLW link

When to join a re­spectabil­ity cascade

B JacobsSep 24, 2024, 7:54 AM
10 points
1 comment2 min readLW link
(bobjacobs.substack.com)

Sam­pling Effects on Strate­gic Be­hav­ior in Su­per­vised Learn­ing Models

Phil BlandSep 24, 2024, 7:44 AM
1 point
0 comments6 min readLW link

In Praise of the Beatitudes

robotelvisSep 24, 2024, 5:08 AM
9 points
7 comments3 min readLW link
(messyprogress.substack.com)

[Question] What are the best ar­gu­ments for/​against AIs be­ing “slightly ‘nice’”?

RaemonSep 24, 2024, 2:00 AM
99 points
61 comments31 min readLW link

Strug­gling like a Shadowmoth

RaemonSep 24, 2024, 12:47 AM
184 points
38 comments7 min readLW link

Bounty for Ev­i­dence on Some of Pal­isade Re­search’s Beliefs

Sep 23, 2024, 8:01 PM
46 points
4 comments2 min readLW link

Pre­dict­ing In­fluenza Abun­dance in Wastew­a­ter Me­tage­nomic Se­quenc­ing Data

jefftkSep 23, 2024, 5:25 PM
27 points
0 comments4 min readLW link
(naobservatory.org)

A primer on ML in an­ti­body engineering

Abhishaike MahajanSep 23, 2024, 5:03 PM
11 points
0 comments25 min readLW link
(www.owlposting.com)

[Question] On the sub­ject of in-house large lan­guage mod­els ver­sus im­ple­ment­ing fron­tier models

AnnapurnaSep 23, 2024, 3:00 PM
7 points
1 comment1 min readLW link

A ba­sic sys­tems ar­chi­tec­ture for AI agents that do au­tonomous research

BuckSep 23, 2024, 1:58 PM
189 points
16 comments8 min readLW link

Book Re­view: On the Edge: The Fundamentals

ZviSep 23, 2024, 1:40 PM
64 points
3 comments31 min readLW link
(thezvi.wordpress.com)

Switch­ing to a 4GB SD

jefftkSep 23, 2024, 11:20 AM
11 points
1 comment1 min readLW link
(www.jefftk.com)

Model evals for dan­ger­ous capabilities

Zach Stein-PerlmanSep 23, 2024, 11:00 AM
51 points
11 comments3 min readLW link

Foun­da­tions—Why Bri­tain has stag­nated [cross­post]

Nathan YoungSep 23, 2024, 10:43 AM
23 points
1 comment57 min readLW link
(ukfoundations.co)

Boons and banes

dkl9Sep 23, 2024, 6:18 AM
7 points
0 comments2 min readLW link
(dkl9.net)

The Sun is big, but su­per­in­tel­li­gences will not spare Earth a lit­tle sunlight

Eliezer YudkowskySep 23, 2024, 3:39 AM
207 points
143 comments13 min readLW link

GPT4o is still sen­si­tive to user-in­duced bias when writ­ing code

Sep 22, 2024, 9:04 PM
6 points
0 comments4 min readLW link

My 10-year ret­ro­spec­tive on try­ing SSRIs

Kaj_SotalaSep 22, 2024, 8:30 PM
80 points
9 comments2 min readLW link
(kajsotala.fi)

Mak­ing Eggs Without Ovaries

Sep 22, 2024, 5:44 PM
58 points
3 comments16 min readLW link
(www.asimov.press)

Becket First

jefftkSep 22, 2024, 5:10 PM
9 points
0 comments2 min readLW link
(www.jefftk.com)

On the Role of Proto-Languages

adamShimiSep 22, 2024, 4:50 PM
54 points
1 comment4 min readLW link
(epistemologicalfascinations.substack.com)

I’m cre­at­ing a deep dive pod­cast epi­sode about the origi­nal Lev­er­age Re­search—would you like to take part?

spencergSep 22, 2024, 2:03 PM
37 points
2 comments1 min readLW link

Who Feels More Alone?

marvinscheffoldSep 22, 2024, 11:54 AM
−8 points
2 comments39 min readLW link

Another ar­gu­ment against util­ity-cen­tric al­ign­ment paradigms

Fiora SunshineSep 22, 2024, 7:28 AM
67 points
39 comments8 min readLW link

My hopes for YouCongress.com

Nathan Helm-BurgerSep 22, 2024, 3:20 AM
14 points
3 comments4 min readLW link

How Often Does Tak­ing Away Op­tions Help?

niplavSep 21, 2024, 9:52 PM
21 points
7 comments2 min readLW link

A Ra­tional Com­pany—Seek­ing Advisors

AlignmentOptimizerSep 21, 2024, 7:51 PM
0 points
1 comment1 min readLW link

Seek­ing mentorship

Kevin AfachaoSep 21, 2024, 4:54 PM
5 points
0 comments1 min readLW link

Ap­pli­ca­tions of Chaos: Say­ing No (with Hast­ings Greer)

ElizabethSep 21, 2024, 4:30 PM
50 points
16 comments2 min readLW link
(acesounderglass.com)

In­ves­ti­gat­ing an in­surance-for-AI startup

Sep 21, 2024, 3:29 PM
70 points
0 comments16 min readLW link
(www.strataoftheworld.com)

An Un­mea­sured Song of Measurement

jan SijanSep 21, 2024, 3:08 PM
−3 points
0 comments4 min readLW link

Should Sports Bet­ting Be Banned?

Maxwell TabarrokSep 21, 2024, 2:13 PM
18 points
2 comments4 min readLW link
(www.maximum-progress.com)

Work with me on agent foun­da­tions: in­de­pen­dent fellowship

Alex_AltairSep 21, 2024, 1:59 PM
59 points
5 comments4 min readLW link

Elec­tric Mandola

jefftkSep 21, 2024, 1:40 PM
9 points
0 comments1 min readLW link
(www.jefftk.com)

Glitch To­ken Cat­a­log - (Al­most) a Full Clear

Lao MeinSep 21, 2024, 12:22 PM
38 points
3 comments37 min readLW link

The Other Ex­is­ten­tial Crisis

James Stephen BrownSep 21, 2024, 1:16 AM
9 points
24 comments2 min readLW link

Ap­ply to MATS 7.0!

Sep 21, 2024, 12:23 AM
32 points
0 comments5 min readLW link

Moscow – ACX Mee­tups Every­where Fall 2024

red-haraSep 20, 2024, 11:03 PM
−1 points
0 comments1 min readLW link

Val­i­dat­ing /​ find­ing al­ign­ment-rele­vant con­cepts us­ing neu­ral data

Bogdan Ionut CirsteaSep 20, 2024, 9:12 PM
7 points
0 comments1 min readLW link
(docs.google.com)

Aug­ment­ing Statis­ti­cal Models with Nat­u­ral Lan­guage Parameters

jsteinhardtSep 20, 2024, 6:30 PM
34 points
0 comments8 min readLW link
(bounded-regret.ghost.io)

Fun With The Tab­ula Muris (Se­nis)

sarahconstantinSep 20, 2024, 6:20 PM
25 points
0 comments8 min readLW link
(sarahconstantin.substack.com)

My Cri­tique of Effec­tive Altruism

Dylan PriceSep 20, 2024, 5:41 PM
−10 points
8 comments4 min readLW link