Twin Cities ACX Meetup—Fe­bru­ary 2024

Timothy M.Feb 8, 2024, 11:26 PM
1 point
2 comments1 min readLW link

A re­view of “Don’t for­get the bound­ary prob­lem...”

jessicataFeb 8, 2024, 11:19 PM
12 points
1 comment12 min readLW link
(unstablerontology.substack.com)

ain­telope pro­ject update

Gunnar_ZarnckeFeb 8, 2024, 6:32 PM
24 points
2 comments3 min readLW link

Up­date­less­ness doesn’t solve most problems

Martín SotoFeb 8, 2024, 5:30 PM
135 points
45 comments12 min readLW link

Pre­dict­ing Align­ment Award Win­ners Us­ing ChatGPT 4

Shoshannah TekofskyFeb 8, 2024, 2:38 PM
16 points
2 comments11 min readLW link

AI #50: The Most Danger­ous Thing

ZviFeb 8, 2024, 2:30 PM
53 points
4 comments24 min readLW link
(thezvi.wordpress.com)

How to de­velop a pho­to­graphic mem­ory 3/​3

PhilosophicalSoulFeb 8, 2024, 9:22 AM
6 points
2 comments18 min readLW link

Believ­ing In

AnnaSalamonFeb 8, 2024, 7:06 AM
241 points
51 comments13 min readLW link

Mea­sur­ing pre-peer-re­view epistemic status

Jakub SmékalFeb 8, 2024, 5:09 AM
1 point
0 comments2 min readLW link

A Chess-GPT Lin­ear Emer­gent World Representation

Adam KarvonenFeb 8, 2024, 4:25 AM
105 points
14 comments7 min readLW link
(adamkarvonen.github.io)

Do­mes­tic Pro­duc­tion vs In­ter­na­tional Wealth Creation

100YearPantsFeb 8, 2024, 4:25 AM
1 point
0 comments1 min readLW link

Con­di­tional pre­dic­tion mar­kets are ev­i­den­tial, not causal

philhFeb 7, 2024, 9:52 PM
55 points
10 comments2 min readLW link

A Back-Of-The-En­velope Calcu­la­tion On How Un­likely The Cir­cum­stan­tial Ev­i­dence Around Covid-19 Is

RokoFeb 7, 2024, 9:49 PM
−1 points
36 comments5 min readLW link

Nitric ox­ide for covid and other viral infections

ElizabethFeb 7, 2024, 9:30 PM
39 points
6 comments6 min readLW link
(acesounderglass.com)

De­bat­ing with More Per­sua­sive LLMs Leads to More Truth­ful Answers

Feb 7, 2024, 9:28 PM
89 points
14 comments9 min readLW link
(arxiv.org)

[Question] Choos­ing a book on causality

martinkunevFeb 7, 2024, 9:16 PM
4 points
3 comments1 min readLW link

More Hyphenation

Arjun PanicksseryFeb 7, 2024, 7:43 PM
88 points
19 comments1 min readLW link
(arjunpanickssery.substack.com)

Read­ing writ­ing ad­vice doesn’t make writ­ing easier

Henry SleightFeb 7, 2024, 7:14 PM
17 points
0 comments5 min readLW link
(open.substack.com)

[Question] What’s this 3rd se­cret di­rec­tive of evolu­tion called? (sur­vive & spread & ___)

lemonhopeFeb 7, 2024, 2:11 PM
10 points
11 comments1 min readLW link

Train­ing of su­per­in­tel­li­gence is se­cretly adversarial

quetzal_rainbowFeb 7, 2024, 1:38 PM
15 points
2 comments5 min readLW link

The Math of Sus­pi­cious Coincidences

RokoFeb 7, 2024, 1:32 PM
25 points
3 comments4 min readLW link

[Question] How to deal with the sense of de­mo­ti­va­tion that comes from think­ing about de­ter­minism?

SpectrumDTFeb 7, 2024, 10:53 AM
13 points
71 comments1 min readLW link

Quan­tum Dar­winism, so­cial con­structs, and the sci­en­tific method

pchvykovFeb 7, 2024, 7:04 AM
6 points
12 comments9 min readLW link

Why I think it’s net harm­ful to do tech­ni­cal safety re­search at AGI labs

RemmeltFeb 7, 2024, 4:17 AM
26 points
24 comments1 min readLW link

story-based de­ci­sion-making

bhauthFeb 7, 2024, 2:35 AM
90 points
11 comments4 min readLW link

Full Driv­ing En­gage­ment Optional

jefftkFeb 7, 2024, 2:30 AM
14 points
0 comments1 min readLW link
(www.jefftk.com)

How to train your own “Sleeper Agents”

evhubFeb 7, 2024, 12:31 AM
92 points
11 comments2 min readLW link

My guess at Con­jec­ture’s vi­sion: trig­ger­ing a nar­ra­tive bifurcation

Alexandre VariengienFeb 6, 2024, 7:10 PM
75 points
12 comments16 min readLW link

Ar­ro­gance and Peo­ple Pleasing

Jonathan MoregårdFeb 6, 2024, 6:43 PM
26 points
7 comments4 min readLW link
(honestliving.substack.com)

What does davi­dad want from «bound­aries»?

Feb 6, 2024, 5:45 PM
47 points
1 comment5 min readLW link

[Question] How can I effi­ciently read all the Dath Ilan wor­ld­build­ing?

mike_hawkeFeb 6, 2024, 4:52 PM
10 points
1 comment1 min readLW link

Prevent­ing model exfil­tra­tion with up­load limits

ryan_greenblattFeb 6, 2024, 4:29 PM
71 points
22 comments14 min readLW link

Evolu­tion is an ob­ser­va­tion, not a process

Neil Feb 6, 2024, 2:49 PM
8 points
11 comments5 min readLW link

[Question] Why do we need an un­der­stand­ing of the real world to pre­dict the next to­kens in a body of text?

Valentin BaltadzhievFeb 6, 2024, 2:43 PM
2 points
12 comments1 min readLW link

On the De­bate Between Je­zos and Leahy

ZviFeb 6, 2024, 2:40 PM
64 points
6 comments63 min readLW link
(thezvi.wordpress.com)

Why Two Valid An­swers Ap­proach is not Enough for Sleep­ing Beauty

Ape in the coatFeb 6, 2024, 2:21 PM
6 points
12 comments6 min readLW link

Are most per­son­al­ity di­s­or­ders re­ally trust di­s­or­ders?

chaosmageFeb 6, 2024, 12:37 PM
20 points
4 comments1 min readLW link

From Con­cep­tual Spaces to Quan­tum Con­cepts: For­mal­is­ing and Learn­ing Struc­tured Con­cep­tual Models

Roman LeventovFeb 6, 2024, 10:18 AM
8 points
1 comment4 min readLW link
(arxiv.org)

Fluent dream­ing for lan­guage mod­els (AI in­ter­pretabil­ity method)

Feb 6, 2024, 6:02 AM
46 points
5 comments1 min readLW link
(arxiv.org)

Selfish AI Inevitable

Davey MorseFeb 6, 2024, 4:29 AM
1 point
0 comments1 min readLW link

Toy mod­els of AI con­trol for con­cen­trated catas­tro­phe prevention

Feb 6, 2024, 1:38 AM
51 points
2 comments7 min readLW link

Things You’re Allowed to Do: Univer­sity Edition

Saul MunnFeb 6, 2024, 12:36 AM
97 points
13 comments5 min readLW link
(www.brasstacks.blog)

Value learn­ing in the ab­sence of ground truth

Joel_SaarinenFeb 5, 2024, 6:56 PM
47 points
8 comments45 min readLW link

Im­ple­ment­ing ac­ti­va­tion steering

AnnahFeb 5, 2024, 5:51 PM
75 points
8 comments7 min readLW link

AI al­ign­ment as a trans­la­tion problem

Roman LeventovFeb 5, 2024, 2:14 PM
22 points
2 comments3 min readLW link

Safe Sta­sis Fallacy

DavidmanheimFeb 5, 2024, 10:54 AM
54 points
2 commentsLW link

[Question] How has in­ter­nal­is­ing a post-AGI world af­fected your cur­rent choices?

yanni kyriacosFeb 5, 2024, 5:43 AM
10 points
8 comments1 min readLW link

A thought ex­per­i­ment for com­par­ing “biolog­i­cal” vs “digi­tal” in­tel­li­gence in­crease/​explosion

Super AGIFeb 5, 2024, 4:57 AM
6 points
3 comments1 min readLW link

Notic­ing Panic

Cole WyethFeb 5, 2024, 3:45 AM
59 points
8 comments3 min readLW link

EA/​ACX/​LW Fe­bru­ary Santa Cruz Meetup

madmailFeb 4, 2024, 11:26 PM
1 point
0 comments1 min readLW link