What You Can Give In­stead of Advice

Karl FaulksOct 24, 2024, 11:10 PM
13 points
2 comments1 min readLW link

[Question] is it pos­si­ble to com­ment anony­mously on a post?

KvmanThinkingOct 24, 2024, 10:24 PM
2 points
2 comments1 min readLW link

Log­i­cal Proof for the Emer­gence and Sub­strate In­de­pen­dence of Sentience

rifeOct 24, 2024, 9:08 PM
4 points
31 comments1 min readLW link
(awakenmoon.ai)

Against Job Boards: Hu­man Cap­i­tal and the Leg­i­bil­ity Trap

vaishnav92Oct 24, 2024, 8:50 PM
6 points
1 comment5 min readLW link

IAPS: Map­ping Tech­ni­cal Safety Re­search at AI Companies

Zach Stein-PerlmanOct 24, 2024, 8:30 PM
42 points
13 commentsLW link
(www.iaps.ai)

Our Digi­tal and Biolog­i­cal Children

EneaszOct 24, 2024, 6:36 PM
28 points
0 comments3 min readLW link
(deathisbad.substack.com)

Reflec­tions on the Me­tas­trate­gies Workshop

gwOct 24, 2024, 6:30 PM
41 points
5 comments11 min readLW link

How Should We Mea­sure In­tel­li­gence Models: Why Use Fre­quency of Ele­men­tal In­for­ma­tion Operations

hwj20Oct 24, 2024, 4:54 PM
1 point
0 comments5 min readLW link

Meta AI (FAIR) lat­est pa­per in­te­grates sys­tem-1 and sys­tem-2 think­ing into rea­son­ing mod­els.

happy fridayOct 24, 2024, 4:54 PM
8 points
0 comments1 min readLW link

Balanc­ing La­bel Quan­tity and Qual­ity for Scal­able Elicitation

Alex MallenOct 24, 2024, 4:49 PM
31 points
1 comment2 min readLW link

Claude Son­net 3.5.1 and Haiku 3.5

ZviOct 24, 2024, 2:50 PM
51 points
9 comments16 min readLW link
(thezvi.wordpress.com)

Big tech tran­si­tions are slow (with im­pli­ca­tions for AI)

jasoncrawfordOct 24, 2024, 2:25 PM
36 points
16 comments4 min readLW link
(blog.rootsofprogress.org)

Deriva­tive AT a discontinuity

Alok SinghOct 24, 2024, 2:48 AM
9 points
5 comments10 min readLW link

how to rapidly as­similate new information

dhruvmethiOct 24, 2024, 2:18 AM
9 points
3 comments8 min readLW link

Ex-OpenAI re­searcher says OpenAI mass-vi­o­lated copy­right law

RemmeltOct 24, 2024, 1:00 AM
0 points
0 commentsLW link
(suchir.net)

Miles Brundage re­signed from OpenAI, and his AGI readi­ness team was disbanded

garrisonOct 23, 2024, 11:40 PM
118 points
1 comment7 min readLW link
(garrisonlovely.substack.com)

A metaphor: what “green lights” for AGI would look like

LorecOct 23, 2024, 11:24 PM
−1 points
6 comments2 min readLW link

Motte-and-Bailey: a Short Explanation

LorecOct 23, 2024, 10:29 PM
12 points
0 comments1 min readLW link

Self-pre­dic­tion acts as an emer­gent regularizer

Oct 23, 2024, 10:27 PM
91 points
9 comments4 min readLW link

Tech­ni­cal Risks of (Lethal) Au­tonomous Weapons Systems

HerambOct 23, 2024, 8:41 PM
2 points
0 comments1 min readLW link
(encodejustice.org)

Ap­peal­ing to the Public

jefftkOct 23, 2024, 7:00 PM
16 points
0 comments5 min readLW link
(www.jefftk.com)

In­tro­duc­ing Transluce — A Let­ter from the Founders

jsteinhardtOct 23, 2024, 6:10 PM
74 points
3 comments3 min readLW link
(bounded-regret.ghost.io)

Are we drop­ping the ball on Recom­men­da­tion AIs?

Charbel-RaphaëlOct 23, 2024, 5:48 PM
41 points
17 comments6 min readLW link

A bird’s eye view of ARC’s research

Jacob_HiltonOct 23, 2024, 3:50 PM
121 points
12 comments7 min readLW link
(www.alignment.org)

[Question] Ar­tifi­cial V/​S Organoid Intelligence

10xyzOct 23, 2024, 2:31 PM
9 points
0 comments1 min readLW link

AI safety tax dynamics

owencbOct 23, 2024, 12:18 PM
22 points
0 comments6 min readLW link
(strangecities.substack.com)

What is malev­olence? On the na­ture, mea­sure­ment, and dis­tri­bu­tion of dark traits

Oct 23, 2024, 8:41 AM
93 points
23 commentsLW link

Join a LessWrong Team for the Unag­ing Sys­tem Challenge

CrissmanOct 23, 2024, 6:01 AM
15 points
5 comments1 min readLW link

Word Spaghetti

Gordon Seidoh WorleyOct 23, 2024, 5:39 AM
19 points
9 comments3 min readLW link

Monose­man­tic­ity & Quantization

Rahul ChandOct 22, 2024, 10:57 PM
1 point
0 comments9 min readLW link

[Question] What is the alpha in one bit of ev­i­dence?

J BostockOct 22, 2024, 9:57 PM
20 points
13 comments1 min readLW link

Catas­trophic sab­o­tage as a ma­jor threat model for hu­man-level AI systems

evhubOct 22, 2024, 8:57 PM
92 points
13 comments15 min readLW link

Why I quit effec­tive al­tru­ism, and why Ti­mothy Tel­leen-Law­ton is stay­ing (for now)

ElizabethOct 22, 2024, 6:20 PM
76 points
82 comments1 min readLW link
(acesounderglass.com)

De­ci­sion-Mak­ing Un­der Uncer­tainty: Les­sons From AI

JonasbOct 22, 2024, 5:54 PM
−1 points
0 comments5 min readLW link
(www.denominations.io)

Test­ing Ge­netic Eng­ineer­ing De­tec­tion with Spike-Ins

jefftkOct 22, 2024, 5:20 PM
9 points
0 commentsLW link
(naobservatory.org)

Pre­dic­tions as Public Works Pro­ject — What Me­tac­u­lus Is Build­ing Next

ChristianWilliamsOct 22, 2024, 4:35 PM
5 points
0 commentsLW link
(www.metaculus.com)

Gorges of gen­der on a ter­rain of traits

dkl9Oct 22, 2024, 4:18 PM
−7 points
1 comment3 min readLW link
(dkl9.net)

A Defense of Peer Review

Oct 22, 2024, 4:16 PM
23 points
1 comment22 min readLW link
(www.asimov.press)

BIG-Bench Ca­nary Con­tam­i­na­tion in GPT-4

JozdienOct 22, 2024, 3:40 PM
125 points
14 comments4 min readLW link

[Paper Blog­post] When Your AIs De­ceive You: Challenges with Par­tial Ob­serv­abil­ity in RLHF

Leon LangOct 22, 2024, 1:57 PM
51 points
2 comments18 min readLW link
(arxiv.org)

[In­tu­itive self-mod­els] 6. Awak­en­ing /​ En­light­en­ment /​ PNSE

Steven ByrnesOct 22, 2024, 1:23 PM
64 points
8 comments21 min readLW link

Re­solv­ing von Neu­mann-Mor­gen­stern In­con­sis­tent Preferences

niplavOct 22, 2024, 11:45 AM
38 points
5 comments58 min readLW link

Lenses of Control

WillPetilloOct 22, 2024, 7:51 AM
14 points
0 comments9 min readLW link

A Brief Ex­pla­na­tion of AI Control

Aaron_ScherOct 22, 2024, 7:00 AM
8 points
1 comment6 min readLW link

Longevity, AI, and Cog­ni­tive Re­search Hackathon @ MIT

ekkoláptoOct 22, 2024, 6:19 AM
1 point
0 comments1 min readLW link

Con­ver­sa­tional Sign­posts—How to stop hav­ing bor­ing so­cial interactions

Declan MolonyOct 22, 2024, 5:37 AM
11 points
6 comments2 min readLW link

I got dysen­tery so you don’t have to

eukaryoteOct 22, 2024, 4:55 AM
321 points
6 comments17 min readLW link
(eukaryotewritesblog.com)

Trans­form­ers Ex­plained (Again)

RohanSOct 22, 2024, 4:06 AM
4 points
0 comments18 min readLW link

Sleep­ing on Stage

jefftkOct 22, 2024, 12:50 AM
26 points
3 comments1 min readLW link
(www.jefftk.com)

The Mask Comes Off: At What Price?

ZviOct 21, 2024, 11:50 PM
72 points
16 comments8 min readLW link
(thezvi.wordpress.com)