The goal of physics

Jim Pivarski2 Sep 2023 23:08 UTC
46 points
4 comments5 min readLW link

Will value of paid sex drop right be­fore the end of the world?

azamatvaliev2 Sep 2023 19:03 UTC
−13 points
0 comments4 min readLW link

PIBBSS Sum­mer Sym­po­sium 2023

2 Sep 2023 17:22 UTC
25 points
2 comments3 min readLW link

The small­est pos­si­ble but­ton (or: moth traps!)

Neil 2 Sep 2023 15:24 UTC
114 points
17 comments3 min readLW link
(neilwarren.substack.com)

Steven Har­nad: Sym­bol ground­ing and the struc­ture of dictionaries

Bill Benzon2 Sep 2023 12:28 UTC
5 points
2 comments2 min readLW link

Is Me­taethics Un­nec­es­sary Given In­tent-Aligned AI?

CBiddulph2 Sep 2023 9:48 UTC
10 points
0 comments7 min readLW link

Ra­tional Agents Co­op­er­ate in the Pri­soner’s Dilemma

Isaac King2 Sep 2023 6:15 UTC
17 points
66 comments12 min readLW link

[Linkpost] Large lan­guage mod­els con­verge to­ward hu­man-like con­cept organization

Bogdan Ionut Cirstea2 Sep 2023 6:00 UTC
22 points
1 comment1 min readLW link

Plum Cook­ing Temperature

jefftk2 Sep 2023 1:30 UTC
11 points
0 comments1 min readLW link
(www.jefftk.com)

[Question] What did you learn from leaked doc­u­ments?

wassname2 Sep 2023 1:28 UTC
10 points
7 comments1 min readLW link

One Minute Every Moment

abramdemski1 Sep 2023 20:23 UTC
125 points
23 comments3 min readLW link

Ten­sor Trust: An on­line game to un­cover prompt in­jec­tion vulnerabilities

1 Sep 2023 19:31 UTC
30 points
0 comments5 min readLW link
(tensortrust.ai)

Re­pro­duc­ing ARC Evals’ re­cent re­port on lan­guage model agents

Thomas Broadley1 Sep 2023 16:52 UTC
102 points
17 comments3 min readLW link
(thomasbroadley.com)

[Question] Why aren’t more peo­ple in AIS fa­mil­iar with PDP?

Prometheus1 Sep 2023 15:27 UTC
4 points
9 comments1 min readLW link

AGI isn’t just a technology

Seth Herd1 Sep 2023 14:35 UTC
18 points
12 comments2 min readLW link

Can an LLM iden­tify ring-com­po­si­tion in a liter­ary text? [ChatGPT]

Bill Benzon1 Sep 2023 14:18 UTC
4 points
2 comments11 min readLW link

What is OpenAI’s plan for mak­ing AI Safer?

brook1 Sep 2023 11:15 UTC
6 points
0 comments4 min readLW link
(aisafetyexplained.substack.com)

Progress links di­gest, 2023-09-01: How an­cient peo­ple ma­nipu­lated wa­ter, and more

jasoncrawford1 Sep 2023 4:33 UTC
12 points
4 comments6 min readLW link
(rootsofprogress.org)

A Golden Age of Build­ing? Ex­cerpts and les­sons from Em­pire State, Pen­tagon, Skunk Works and SpaceX

jacobjacob1 Sep 2023 4:03 UTC
181 points
23 comments24 min readLW link

[Question] Would AI ex­perts ever agree that AGI sys­tems have at­tained “con­scious­ness”?

Super AGI1 Sep 2023 3:57 UTC
−16 points
6 comments1 min readLW link

Meta Ques­tions about Metaphilosophy

Wei Dai1 Sep 2023 1:17 UTC
148 points
78 comments3 min readLW link

[Linkpost] Michael Niel­sen re­marks on ‘Op­pen­heimer’

22tom31 Aug 2023 15:46 UTC
78 points
7 comments2 min readLW link
(michaelnotebook.com)

My thoughts on AI and per­sonal fu­ture plan af­ter learn­ing about AI Safety for 4 months

Ziyue Wang31 Aug 2023 15:32 UTC
7 points
0 comments4 min readLW link

Which Ques­tions Are An­thropic Ques­tions?

dadadarren31 Aug 2023 15:15 UTC
16 points
13 comments3 min readLW link

The Tree of Life, and a Note on Job

Bill Benzon31 Aug 2023 14:03 UTC
13 points
7 comments4 min readLW link

Clean­ing a SoundCraft Mixer

jefftk31 Aug 2023 13:20 UTC
11 points
0 comments1 min readLW link
(www.jefftk.com)

AI #27: Por­tents of Gemini

Zvi31 Aug 2023 12:40 UTC
54 points
37 comments47 min readLW link
(thezvi.wordpress.com)

[CANCELLED DUE TO ILLNESS] San Fran­cisco ACX Meetup “First Satur­day”

guenael31 Aug 2023 12:34 UTC
1 point
0 comments1 min readLW link

Long-Term Fu­ture Fund Ask Us Any­thing (Septem­ber 2023)

31 Aug 2023 0:28 UTC
33 points
6 comments1 min readLW link
(forum.effectivealtruism.org)

Re­sponses to ap­par­ent ra­tio­nal­ist con­fu­sions about game /​ de­ci­sion theory

Anthony DiGiovanni30 Aug 2023 22:02 UTC
140 points
14 comments12 min readLW link

In­vuln­er­a­ble In­com­plete Prefer­ences: A For­mal Statement

Sami Petersen30 Aug 2023 21:59 UTC
124 points
32 comments35 min readLW link

Re­port on Fron­tier Model Training

YafahEdelman30 Aug 2023 20:02 UTC
122 points
21 comments21 min readLW link
(docs.google.com)

An ad­ver­sar­ial ex­am­ple for Direct Logit At­tri­bu­tion: mem­ory man­age­ment in gelu-4l

30 Aug 2023 17:36 UTC
17 points
0 comments8 min readLW link
(arxiv.org)

A Let­ter to the Edi­tor of MIT Tech­nol­ogy Review

Jeffs30 Aug 2023 16:59 UTC
0 points
0 comments2 min readLW link

Biose­cu­rity Cul­ture, Com­puter Se­cu­rity Culture

jefftk30 Aug 2023 16:40 UTC
103 points
10 comments2 min readLW link
(www.jefftk.com)

Why I hang out at LessWrong and why you should check-in there ev­ery now and then

Bill Benzon30 Aug 2023 15:20 UTC
16 points
5 comments5 min readLW link

“Want­ing” and “lik­ing”

Mateusz Bagiński30 Aug 2023 14:52 UTC
22 points
2 comments29 min readLW link

Open Call for Re­search As­sis­tants in Devel­op­men­tal Interpretability

30 Aug 2023 9:02 UTC
54 points
11 comments4 min readLW link

LTFF and EAIF are un­usu­ally fund­ing-con­strained right now

30 Aug 2023 1:03 UTC
90 points
24 comments15 min readLW link
(forum.effectivealtruism.org)

Paper Walk­through: Au­to­mated Cir­cuit Dis­cov­ery with Arthur Conmy

Neel Nanda29 Aug 2023 22:07 UTC
36 points
1 comment1 min readLW link
(www.youtube.com)

An OV-Co­her­ent Toy Model of At­ten­tion Head Superposition

29 Aug 2023 19:44 UTC
21 points
0 comments6 min readLW link

The Eco­nomics of the As­teroid Deflec­tion Prob­lem (Dom­i­nant As­surance Con­tracts)

moyamo29 Aug 2023 18:28 UTC
77 points
70 comments15 min readLW link

The Epistemic Author­ity of Deep Learn­ing Pioneers

Dylan Bowman29 Aug 2023 18:14 UTC
8 points
2 comments3 min readLW link

Demo­cratic Fine-Tuning

Joe Edelman29 Aug 2023 18:13 UTC
24 points
2 comments1 min readLW link
(open.substack.com)

Should ra­tio­nal­ists (be seen to) win?

Will_Pearson29 Aug 2023 18:13 UTC
6 points
7 comments1 min readLW link

Frank­furt meetup

sultan29 Aug 2023 18:10 UTC
2 points
0 comments1 min readLW link

Is­tan­bul meetup

sultan29 Aug 2023 18:10 UTC
2 points
0 comments1 min readLW link

Bro­ken Bench­mark: MMLU

awg29 Aug 2023 18:09 UTC
23 points
5 comments1 min readLW link
(www.youtube.com)

AISN #20: LLM Pro­lifer­a­tion, AI De­cep­tion, and Con­tin­u­ing Drivers of AI Capabilities

29 Aug 2023 15:07 UTC
12 points
0 comments8 min readLW link
(newsletter.safe.ai)

Loft Bed Fan Guard

jefftk29 Aug 2023 13:30 UTC
16 points
3 comments1 min readLW link
(www.jefftk.com)