[Question] Karma votes: blind to or ac­count­ing for score?

cataJun 22, 2024, 9:40 PM
19 points
4 comments1 min readLW link

[Question] Should effec­tive al­tru­ism be more “cool”?

jaredmantellJun 22, 2024, 8:42 PM
3 points
3 comments1 min readLW link

Meta Align­ment: Com­mu­ni­ca­tion Wack-a-Mole

Bridgett KayJun 22, 2024, 8:12 PM
16 points
2 comments5 min readLW link
(dxmrevealed.wordpress.com)

AI as a com­put­ing plat­form: what to expect

JonasbJun 22, 2024, 7:55 PM
−3 points
0 comments7 min readLW link
(www.denominations.io)

Ex­pected num­ber of tries

adiosJun 22, 2024, 7:22 PM
6 points
0 comments2 min readLW link

Ap­ply­ing Force to the Wrong End of a Causal Chain

silentbobJun 22, 2024, 6:06 PM
41 points
0 comments9 min readLW link

Bed Time Quests & Din­ner Games for 3-5 year olds

Jun 22, 2024, 7:53 AM
51 points
0 comments1 min readLW link
(kidquest.substack.com)

Ap­prais­ing ag­grega­tivism and utilitarianism

Cleo NardoJun 21, 2024, 11:10 PM
27 points
10 comments19 min readLW link

Best-of-n with mis­al­igned re­ward mod­els for Math reasoning

Fabien RogerJun 21, 2024, 10:53 PM
25 points
0 comments3 min readLW link

No re­ally, the Sticker Short­cut fal­lacy is in­deed a fallacy

ymeskhoutJun 21, 2024, 10:27 PM
11 points
2 comments5 min readLW link
(www.ymeskhout.com)

Sara­jevo 1914: Black Swan Questions

SebastianG Jun 21, 2024, 9:27 PM
8 points
0 comments2 min readLW link

Yud­kowsky is too op­ti­mistic about how AI will treat hu­mans.

ProfessorFalkenJun 21, 2024, 7:01 PM
0 points
1 comment1 min readLW link

Juneberry Puffs

jefftkJun 21, 2024, 6:50 PM
15 points
0 comments1 min readLW link
(www.jefftk.com)

Let’s De­sign a School, Part 3.2 Costs

SableJun 21, 2024, 5:58 PM
8 points
0 comments5 min readLW link
(affablyevil.substack.com)

2022 AI Align­ment Course: 5→37% work­ing on AI safety

DewiJun 21, 2024, 5:45 PM
7 points
3 comments3 min readLW link

Some Thoughts on AI Align­ment: Us­ing AI to Con­trol AI

eigenvalueJun 21, 2024, 5:44 PM
1 point
1 comment1 min readLW link
(github.com)

What dis­t­in­guishes “early”, “mid” and “end” games?

RaemonJun 21, 2024, 5:41 PM
48 points
22 comments1 min readLW link

Nu­clear War, Map and Ter­ri­tory, Values | Guild of the Rose Newslet­ter, May 2024

moridinamaelJun 21, 2024, 5:39 PM
18 points
0 comments4 min readLW link
(guildoftherose.org)

AI gov­er­nance needs a the­ory of victory

Jun 21, 2024, 4:15 PM
45 points
8 commentsLW link
(www.convergenceanalysis.org)

Con­nect­ing the Dots: LLMs can In­fer & Ver­bal­ize La­tent Struc­ture from Train­ing Data

Jun 21, 2024, 3:54 PM
163 points
13 comments8 min readLW link
(arxiv.org)

On OpenAI’s Model Spec

ZviJun 21, 2024, 1:00 PM
47 points
4 comments30 min readLW link
(thezvi.wordpress.com)

At­ten­tion Out­put SAEs Im­prove Cir­cuit Analysis

Jun 21, 2024, 12:56 PM
33 points
3 comments19 min readLW link

“New­ton’s laws” of finance

pchvykovJun 21, 2024, 9:41 AM
9 points
3 comments10 min readLW link

Cap­i­tal­is­ing On Trust—A Simulation

James Stephen BrownJun 21, 2024, 4:43 AM
2 points
0 comments1 min readLW link
(nonzerosum.games)

″… than av­er­age” is (al­most) meaningless

jwfiredragonJun 21, 2024, 4:42 AM
16 points
6 comments3 min readLW link

The Ker­nel of Mean­ing in Prop­erty Rights

Abhimanyu Pallavi SudhirJun 21, 2024, 1:12 AM
7 points
6 comments2 min readLW link

En­riched tab is now the de­fault LW Front­page ex­pe­rience for logged-in users

Jun 21, 2024, 12:09 AM
46 points
27 comments3 min readLW link

De­bate, Or­a­cles, and Obfus­cated Arguments

Jun 20, 2024, 11:14 PM
44 points
4 comments21 min readLW link

Eva­po­ra­tion of improvements

ViliamJun 20, 2024, 6:34 PM
29 points
27 comments2 min readLW link

In­ter­pret­ing and Steer­ing Fea­tures in Images

Gytis DaujotasJun 20, 2024, 6:33 PM
66 points
6 comments5 min readLW link

Claude 3.5 Sonnet

Zach Stein-PerlmanJun 20, 2024, 6:00 PM
75 points
41 comments1 min readLW link
(www.anthropic.com)

[Question] What is go­ing to hap­pen in a case of an AGI era where hu­mans are out of the game?

CipollaJun 20, 2024, 5:44 PM
−2 points
1 comment1 min readLW link

Jailbreak steer­ing generalization

Jun 20, 2024, 5:25 PM
41 points
4 comments2 min readLW link
(arxiv.org)

Case stud­ies on so­cial-welfare-based stan­dards in var­i­ous industries

HoldenKarnofskyJun 20, 2024, 1:33 PM
42 points
0 commentsLW link

AI #69: Nice

ZviJun 20, 2024, 12:40 PM
65 points
9 comments51 min readLW link
(thezvi.wordpress.com)

Niche product design

Itay DreyfusJun 20, 2024, 6:34 AM
2 points
1 comment3 min readLW link
(productidentity.co)

Data on AI

Jun 20, 2024, 6:31 AM
1 point
0 comments1 min readLW link
(epochai.org)

Ac­tu­ally, Power Plants May Be an AI Train­ing Bot­tle­neck.

Lao MeinJun 20, 2024, 4:41 AM
83 points
13 comments2 min readLW link

Propos­ing the Post-Sin­gu­lar­ity Sym­biotic Researches

Hiroshi YamakawaJun 20, 2024, 4:05 AM
6 points
1 comment12 min readLW link

Week One of Study­ing Trans­form­ers Architecture

JustisMillsJun 20, 2024, 3:47 AM
3 points
0 comments15 min readLW link
(justismills.substack.com)

[Question] What are things you’re al­lowed to do as a startup?

ElizabethJun 20, 2024, 12:01 AM
30 points
9 comments1 min readLW link

LessWrong/​ACX meetup Tran­sil­vanya tour—Alba Iulia

Marius Adrian NicoarăJun 19, 2024, 7:56 PM
1 point
1 comment1 min readLW link

Chronic perfec­tion­ism through the eyes of school reports

Stuart JohnsonJun 19, 2024, 5:46 PM
13 points
3 comments1 min readLW link

Ilya Sutskever cre­ated a new AGI startup

harfeJun 19, 2024, 5:17 PM
95 points
35 comments1 min readLW link
(ssi.inc)

Beyond the Board: Ex­plor­ing AI Ro­bust­ness Through Go

AdamGleaveJun 19, 2024, 4:40 PM
41 points
2 comments1 min readLW link
(far.ai)

A study on cults and non-cults—an­swer ques­tions about a group and get a cult score

spencergJun 19, 2024, 2:30 PM
1 point
8 comments1 min readLW link
(www.guidedtrack.com)

Work­shop: data anal­y­sis for soft­ware engineers

Derek M. JonesJun 19, 2024, 2:20 PM
2 points
0 comments1 min readLW link

FLEXIBLE AND ADAPTABLE LLM’s WITH CONTINUOUS SELF TRAINING

Escaque 66Jun 19, 2024, 2:17 PM
−11 points
0 comments3 min readLW link

Sur­viv­ing Seveneves

Yair HalberstadtJun 19, 2024, 1:11 PM
41 points
4 comments11 min readLW link

Self re­spon­si­bil­ity

EloJun 19, 2024, 10:17 AM
17 points
3 comments2 min readLW link