Emer­gence and Am­plifi­ca­tion of Survival

jgraves01Dec 28, 2024, 11:52 PM
−1 points
0 comments3 min readLW link

[Question] Has Some­one Checked The Cold-Water-In-Left-Ear Thing?

MaloewDec 28, 2024, 8:15 PM
11 points
0 comments1 min readLW link

By de­fault, cap­i­tal will mat­ter more than ever af­ter AGI

L Rudolf LDec 28, 2024, 5:52 PM
289 points
100 comments16 min readLW link
(nosetgauge.substack.com)

AI As­sis­tants Should Have a Direct Line to Their Developers

Jan_KulveitDec 28, 2024, 5:01 PM
57 points
6 comments2 min readLW link

No, the Poly­mar­ket price does not mean we can im­me­di­ately con­clude what the prob­a­bil­ity of a bird flu pan­demic is. We also need to know the in­ter­est rate!

Christopher KingDec 28, 2024, 4:05 PM
7 points
11 comments1 min readLW link

The av­er­age ra­tio­nal­ist IQ is about 122

RockenotsDec 28, 2024, 3:42 PM
20 points
23 comments1 min readLW link

Why OpenAI’s Struc­ture Must Evolve To Ad­vance Our Mission

stuhlmuellerDec 28, 2024, 4:24 AM
19 points
1 comment1 min readLW link
(openai.com)

The Eng­ineer­ing Ar­gu­ment Fal­lacy: Why Tech­nolog­i­cal Suc­cess Doesn’t Val­i­date Physics

Wenitte ApiouDec 28, 2024, 12:49 AM
−16 points
5 comments2 min readLW link

The Robot, the Pup­pet-mas­ter, and the Psychohistorian

WillPetilloDec 28, 2024, 12:12 AM
8 points
2 comments3 min readLW link

Progress links and short notes, 2024-12-27: Clini­cal trial abun­dance, grid-scale fu­sion, per­mit­ting vs. com­pli­ance, cross­word ma­nia, and more

jasoncrawfordDec 27, 2024, 11:34 PM
11 points
0 comments2 min readLW link
(newsletter.rootsofprogress.org)

Greedy-Ad­van­tage-Aware RLHF

sej2020Dec 27, 2024, 7:47 PM
48 points
15 comments13 min readLW link

De­con­struct­ing ar­gu­ments against AI art

DMMFDec 27, 2024, 7:40 PM
7 points
5 comments5 min readLW link
(danfrank.ca)

From the Archives: a story

Richard_NgoDec 27, 2024, 4:36 PM
20 points
1 comment16 min readLW link
(www.narrativeark.xyz)

[Question] What’s the best met­ric for mea­sur­ing qual­ity of life?

ChristianKlDec 27, 2024, 2:29 PM
10 points
5 comments1 min readLW link

Re­view: Planecrash

L Rudolf LDec 27, 2024, 2:18 PM
360 points
45 comments22 min readLW link
(nosetgauge.substack.com)

Good For­tune and Many Worlds

Jonah WilbergDec 27, 2024, 1:21 PM
4 points
0 comments5 min readLW link

Let­ter from an Alien Mind

Shoshannah TekofskyDec 27, 2024, 1:20 PM
23 points
7 comments3 min readLW link
(open.substack.com)

Coin Flip

XelaPDec 27, 2024, 11:53 AM
17 points
0 comments1 min readLW link

If all trade is vol­un­tary, then what is “ex­ploita­tion?”

DarmaniDec 27, 2024, 11:21 AM
34 points
61 comments6 min readLW link

Du­pli­cate to­ken neu­rons in the first layer of GPT-2

Alex GibsonDec 27, 2024, 4:21 AM
4 points
0 comments5 min readLW link

[Question] What are the most in­ter­est­ing /​ challeng­ing evals (for hu­mans) available?

RaemonDec 27, 2024, 3:05 AM
40 points
13 comments2 min readLW link

Al­gorith­mic Asub­jec­tive An­throp­ics, Carte­sian Sub­jec­tive Anthropics

LorecDec 27, 2024, 1:58 AM
2 points
0 comments4 min readLW link

Cor­rigi­bil­ity’s De­sir­a­bil­ity is Timing-Sensitive

RobertMDec 26, 2024, 10:24 PM
29 points
4 comments3 min readLW link

PCR retrospective

bhauthDec 26, 2024, 9:20 PM
24 points
0 comments8 min readLW link
(bhauth.com)

AI #96: o3 But Not Yet For Thee

ZviDec 26, 2024, 8:30 PM
58 points
8 comments36 min readLW link
(thezvi.wordpress.com)

Su­per hu­man AI is a very low hang­ing fruit!

HznDec 26, 2024, 7:00 PM
−4 points
0 comments7 min readLW link

The Field of AI Align­ment: A Post­mortem, and What To Do About It

johnswentworthDec 26, 2024, 6:48 PM
302 points
160 comments8 min readLW link

ReSols­ticed vol I: “We’re Not Go­ing Quietly”

RaemonDec 26, 2024, 5:52 PM
61 points
4 comments19 min readLW link

[Question] Are Sparse Au­toen­coders a good idea for AI con­trol?

Gerard BoxoDec 26, 2024, 5:34 PM
3 points
4 comments1 min readLW link

A Three-Layer Model of LLM Psychology

Jan_KulveitDec 26, 2024, 4:49 PM
218 points
13 comments8 min readLW link

Hu­man, All Too Hu­man—Su­per­in­tel­li­gence re­quires learn­ing things we can’t teach

Ben TurtelDec 26, 2024, 4:26 PM
−13 points
4 comments1 min readLW link
(bturtel.substack.com)

[Question] Why don’t we cur­rently have AI agents?

ChristianKlDec 26, 2024, 3:26 PM
8 points
10 comments1 min readLW link

[Question] What would be the IQ and other bench­marks of o3 that uses $1 mil­lion worth of com­pute re­sources to an­swer one ques­tion?

avturchinDec 26, 2024, 11:08 AM
16 points
2 comments1 min readLW link

The Eco­nomics & Prac­ti­cal­ity of Start­ing Mars Colonization

Zero ContradictionsDec 26, 2024, 10:56 AM
2 points
1 comment1 min readLW link
(zerocontradictions.net)

Ter­mi­nal goal vs Intelligence

Donatas LučiūnasDec 26, 2024, 8:10 AM
−12 points
24 comments1 min readLW link

Stream­lin­ing my voice note process

Vlad SitaloDec 26, 2024, 6:04 AM
6 points
1 comment7 min readLW link
(vlad.roam.garden)

Whistle­blow­ing Twit­ter Bot

MckievDec 26, 2024, 4:09 AM
19 points
5 comments2 min readLW link

Open Thread Win­ter 2024/​2025

habrykaDec 25, 2024, 9:02 PM
23 points
59 comments1 min readLW link

Ex­plor­ing Co­op­er­a­tion: The Path to Utopia

DavidmanheimDec 25, 2024, 6:31 PM
11 points
0 commentsLW link
(exploringcooperation.substack.com)

Liv­ing with Rats in College

lsusrDec 25, 2024, 10:44 AM
28 points
0 comments1 min readLW link

[Question] What Have Been Your Most Valuable Ca­sual Con­ver­sa­tions At Con­fer­ences?

johnswentworthDec 25, 2024, 5:49 AM
54 points
21 comments1 min readLW link

The Open­ing Salvo: 1. An On­tolog­i­cal Con­scious­ness Met­ric: Re­sis­tance to Be­hav­ioral Mod­ifi­ca­tion as a Mea­sure of Re­cur­sive Awareness

PeterpiperDec 25, 2024, 2:29 AM
−3 points
0 comments5 min readLW link

The Deep Lore of LightHaven, with Oliver Habryka (TBC epi­sode 228)

Dec 24, 2024, 10:45 PM
45 points
4 comments91 min readLW link
(thebayesianconspiracy.substack.com)

Ac­knowl­edg­ing Back­ground In­for­ma­tion with P(Q|I)

JenniferRMDec 24, 2024, 6:50 PM
29 points
8 comments14 min readLW link

Game The­ory and Be­hav­ioral Eco­nomics in The Stock Mar­ket

Jaiveer SinghDec 24, 2024, 6:15 PM
1 point
0 comments3 min readLW link

[Question] What are the main ar­gu­ments against AGI?

Edy NastaseDec 24, 2024, 3:49 PM
1 point
6 comments1 min readLW link

[Question] Recom­men­da­tions on com­mu­ni­ties that dis­cuss AI ap­pli­ca­tions in society

AnnapurnaDec 24, 2024, 1:37 PM
7 points
2 comments1 min readLW link

AIs Will In­creas­ingly Fake Alignment

ZviDec 24, 2024, 1:00 PM
89 points
0 comments52 min readLW link
(thezvi.wordpress.com)

Ap­ply to the 2025 PIBBSS Sum­mer Re­search Fellowship

Dec 24, 2024, 10:25 AM
15 points
0 comments2 min readLW link

Hu­man-AI Com­ple­men­tar­ity: A Goal for Am­plified Oversight

Dec 24, 2024, 9:57 AM
27 points
4 comments1 min readLW link
(deepmindsafetyresearch.medium.com)