Ap­proval-Seek­ing ⇒ Playful Evaluation

Jonathan MoregårdAug 28, 2024, 9:03 PM
8 points
0 comments2 min readLW link
(honestliving.substack.com)

Why do firms choose to be in­effi­cient?

Nicholas D.Aug 28, 2024, 6:39 PM
9 points
4 comments2 min readLW link
(nicholasdecker.substack.com)

Benefits of Psyl­lium Die­tary Fiber in Par­tic­u­lar

Brendan LongAug 28, 2024, 6:13 PM
14 points
9 comments2 min readLW link
(www.brendanlong.com)

[Question] “De­cep­tion Genre” What Books are like Pro­ject Lawful?

DoubleAug 28, 2024, 5:19 PM
45 points
20 comments1 min readLW link

Look­ing for Goal Rep­re­sen­ta­tions in an RL Agent—Up­date Post

CatGoddessAug 28, 2024, 4:42 PM
19 points
0 comments7 min readLW link

[Question] things that con­fuse me about the cur­rent AI mar­ket.

DMMFAug 28, 2024, 1:46 PM
156 points
27 comments2 min readLW link

Univer­sal di­men­sions of vi­sual representation

Bogdan Ionut CirsteaAug 28, 2024, 10:38 AM
11 points
0 comments1 min readLW link
(arxiv.org)

Lev­er­age points for a pause

RemmeltAug 28, 2024, 9:21 AM
3 points
0 comments1 min readLW link

De­cep­tion and Jailbreak Se­quence: 2. Iter­a­tive Refine­ment Stages of Jailbreaks in LLM

Winnie YangAug 28, 2024, 8:41 AM
7 points
2 comments31 min readLW link

How to hire some­body bet­ter than yourself

lemonhopeAug 28, 2024, 8:12 AM
46 points
5 comments5 min readLW link

On agen­tic gen­er­al­ist mod­els: we’re es­sen­tially us­ing ex­ist­ing tech­nol­ogy the weak­est and worst way you can use it

Yuli_BanAug 28, 2024, 1:57 AM
10 points
2 comments9 min readLW link

[Question] Am I con­fused about the “ma­lign uni­ver­sal prior” ar­gu­ment?

nostalgebraistAug 27, 2024, 11:17 PM
95 points
35 comments8 min readLW link

The In­for­ma­tion: OpenAI shows ‘Straw­berry’ to feds, races to launch it

Martín SotoAug 27, 2024, 11:10 PM
145 points
15 comments3 min readLW link

SB 1047: Fi­nal Takes and Also AB 3211

ZviAug 27, 2024, 10:10 PM
92 points
11 comments21 min readLW link
(thezvi.wordpress.com)

LessWrong email sub­scrip­tions?

RaemonAug 27, 2024, 9:59 PM
26 points
6 comments1 min readLW link

GPT-3.5 judges can su­per­vise GPT-4o de­baters in ca­pa­bil­ity asym­met­ric debates

Aug 27, 2024, 8:44 PM
23 points
7 comments4 min readLW link

Why Large Bureau­cratic Or­ga­ni­za­tions?

johnswentworthAug 27, 2024, 6:30 PM
68 points
52 comments12 min readLW link

In defense of tech­nolog­i­cal un­em­ploy­ment as the main AI concern

tailcalledAug 27, 2024, 5:58 PM
44 points
36 comments1 min readLW link

[Question] I’m do­ing Yolov8 model train­ing but the ac­cu­racy rate is 70%

Sezer KarataşAug 27, 2024, 5:53 PM
−14 points
0 comments1 min readLW link

What De­pres­sion Is Like

SableAug 27, 2024, 5:43 PM
88 points
24 comments4 min readLW link
(affablyevil.substack.com)

Unit eco­nomics of LLM APIs

Aug 27, 2024, 4:51 PM
43 points
0 comments2 min readLW link

Soft Na­tion­al­iza­tion: how the USG will con­trol AI labs

Aug 27, 2024, 3:11 PM
76 points
7 comments21 min readLW link
(www.convergenceanalysis.org)

[Question] On Nothing

HudjefaAug 27, 2024, 6:50 AM
−14 points
12 comments1 min readLW link

“Real sum­mer”?

duck_masterAug 26, 2024, 10:11 PM
2 points
0 comments1 min readLW link

Me­tac­u­lus’s ‘Mini­tac­u­lus’ Ex­per­i­ments — Col­lab­o­rate With Us

ChristianWilliamsAug 26, 2024, 8:44 PM
7 points
0 commentsLW link
(www.metaculus.com)

My Apart­ment Art Com­mis­sion Process

jennAug 26, 2024, 6:36 PM
34 points
4 comments7 min readLW link
(jenn.site)

My (cur­rent) model of what an AI gov­er­nance re­searcher does

Johan de KockAug 26, 2024, 5:58 PM
1 point
2 comments5 min readLW link

Would catch­ing your AIs try­ing to es­cape con­vince AI de­vel­op­ers to slow down or un­de­ploy?

BuckAug 26, 2024, 4:46 PM
316 points
77 comments4 min readLW link

… Wait, our mod­els of se­man­tics should in­form fluid me­chan­ics?!?

Aug 26, 2024, 4:38 PM
59 points
18 comments4 min readLW link

Day Zero An­tivirals for Fu­ture Pandemics

Niko_McCartyAug 26, 2024, 3:18 PM
22 points
2 comments10 min readLW link
(www.asimov.press)

Molec­u­lar dy­nam­ics data will be es­sen­tial for the next gen­er­a­tion of ML pro­tein models

Abhishaike MahajanAug 26, 2024, 2:50 PM
9 points
0 comments11 min readLW link
(www.owlposting.com)

My luke­warm take on GLP-1 agonists

George3d6Aug 26, 2024, 12:34 PM
16 points
0 comments1 min readLW link
(cerebralab.com)

In­ter­view with Robert Kral­isch on Simulators

WillPetilloAug 26, 2024, 5:49 AM
17 points
0 comments75 min readLW link

One per­son’s worth of men­tal en­ergy for AI doom aver­sion jobs. What should I do?

LorecAug 26, 2024, 1:29 AM
9 points
17 comments1 min readLW link

Sec­u­lar in­ter­pre­ta­tions of core peren­ni­al­ist claims

zhukeepaAug 25, 2024, 11:41 PM
83 points
32 comments14 min readLW link

Dar­wi­nian Traps and Ex­is­ten­tial Risks

KristianRonnAug 25, 2024, 10:37 PM
85 points
14 comments10 min readLW link

DIY LessWrong Jewelry

FluffnuttAug 25, 2024, 9:33 PM
33 points
0 comments1 min readLW link

Meta: On view­ing the lat­est LW posts

quiet_NaNAug 25, 2024, 7:31 PM
5 points
2 comments1 min readLW link

you should prob­a­bly eat oat­meal sometimes

bhauthAug 25, 2024, 2:50 PM
41 points
32 comments3 min readLW link
(bhauth.com)

Referen­dum Me­chan­ics in a Mar­ket­place of Ideas

Martin SustrikAug 25, 2024, 8:30 AM
57 points
2 comments5 min readLW link
(250bpm.substack.com)

Please stop us­ing mediocre AI art in your posts

RaemonAug 25, 2024, 12:13 AM
115 points
24 comments2 min readLW link

AXRP Epi­sode 35 - Peter Hase on LLM Beliefs and Easy-to-Hard Generalization

DanielFilanAug 24, 2024, 10:30 PM
21 points
0 comments74 min readLW link

The top 30 books to ex­pand the ca­pa­bil­ities of AI: a bi­ased read­ing list

Jonathan MuganAug 24, 2024, 9:48 PM
−6 points
0 comments16 min readLW link

The Ap Distribution

criticalpointsAug 24, 2024, 9:45 PM
22 points
8 comments3 min readLW link
(eregis.github.io)

What is it to solve the al­ign­ment prob­lem? (Notes)

Joe CarlsmithAug 24, 2024, 9:19 PM
69 points
18 comments53 min readLW link

Ex­am­ine self mod­ifi­ca­tion as an in­tu­ition provider for the con­cept of con­scious­ness

CanalettoAug 24, 2024, 8:48 PM
−4 points
2 comments10 min readLW link

[Question] Look­ing to in­ter­view AI Safety re­searchers for a book

jeffreycarusoAug 24, 2024, 7:57 PM
14 points
0 comments1 min readLW link

Per­plex­ity wins my AI race

ElizabethAug 24, 2024, 7:20 PM
107 points
12 comments10 min readLW link
(acesounderglass.com)

Why should any­one boot *you* up?

onurAug 24, 2024, 5:51 PM
−1 points
5 comments3 min readLW link
(solmaz.io)

Un­der­stand­ing Hid­den Com­pu­ta­tions in Chain-of-Thought Reasoning

rokosbasiliskAug 24, 2024, 4:35 PM
6 points
1 comment1 min readLW link