Just be­cause an LLM said it doesn’t mean it’s true: an illus­tra­tive example

dirkAug 21, 2024, 9:05 PM
26 points
12 comments3 min readLW link

[Question] How do you finish your tasks faster?

CipollaAug 21, 2024, 8:01 PM
4 points
2 comments1 min readLW link

AI Safety Newslet­ter #40: Cal­ifor­nia AI Leg­is­la­tion Plus, NVIDIA De­lays Chip Pro­duc­tion, and Do AI Safety Bench­marks Ac­tu­ally Mea­sure Safety?

Aug 21, 2024, 6:09 PM
11 points
0 comments6 min readLW link
(newsletter.safe.ai)

[Question] Should LW sug­gest stan­dard metaprompts?

DagonAug 21, 2024, 4:41 PM
3 points
6 comments1 min readLW link

Eter­nal Ex­is­tence and Eter­nal Bore­dom: The Case for AI and Im­mor­tal Humans

Tuan Tu NguyenAug 21, 2024, 9:58 AM
−12 points
2 comments5 min readLW link

Please do not use AI to write for you

Richard_KennawayAug 21, 2024, 9:53 AM
69 points
34 comments4 min readLW link

Ap­ply to Aether—In­de­pen­dent LLM Agent Safety Re­search Group

RohanSAug 21, 2024, 9:47 AM
10 points
0 comments7 min readLW link
(forum.effectivealtruism.org)

the Giga Press was a mistake

bhauthAug 21, 2024, 4:51 AM
99 points
26 comments5 min readLW link
(bhauth.com)

Ex­plor­ing the Boundaries of Cog­ni­to­haz­ards and the Na­ture of Reality

Victor NovikovAug 21, 2024, 3:42 AM
−2 points
2 comments1 min readLW link

[Question] What is the point of 2v2 de­bates?

Axel AhlqvistAug 20, 2024, 9:59 PM
2 points
1 comment1 min readLW link

[Question] Where should I look for in­for­ma­tion on gut health?

FinalFormal2Aug 20, 2024, 7:44 PM
10 points
10 comments1 min readLW link

Would you benefit from, or ob­ject to, a page with LW users’ re­acts?

RaemonAug 20, 2024, 4:35 PM
23 points
6 comments1 min readLW link

Free­dom of Speech

Zero ContradictionsAug 20, 2024, 4:34 PM
−13 points
2 comments2 min readLW link
(thewaywardaxolotl.blogspot.com)

AGI Safety and Align­ment at Google Deep­Mind: A Sum­mary of Re­cent Work

Aug 20, 2024, 4:22 PM
222 points
33 comments9 min readLW link

Try­ing to be ra­tio­nal for the wrong reasons

ViliamAug 20, 2024, 4:18 PM
26 points
9 comments3 min readLW link

[Question] How great is the util­ity of “sav­ing” en­dan­gered lan­guages?

SpectrumDTAug 20, 2024, 1:14 PM
18 points
29 comments1 min readLW link

Guide to SB 1047

ZviAug 20, 2024, 1:10 PM
71 points
18 comments53 min readLW link
(thezvi.wordpress.com)

Find­ing De­cep­tion in Lan­guage Models

Aug 20, 2024, 9:42 AM
20 points
4 comments4 min readLW link

Next au­to­mated rea­son­ing grand challenge: CompCert

sanxiynAug 20, 2024, 5:27 AM
−5 points
0 comments1 min readLW link

Thiel on AI & Rac­ing with China

Ben PaceAug 20, 2024, 3:19 AM
55 points
10 comments12 min readLW link

Reflect­ing on the tran­shu­man­ist re­but­tal to AI ex­is­ten­tial risk and cri­tique of our de­bate method­olo­gies and mi­suse of statistics

catgirlsruletheworldAug 20, 2024, 1:59 AM
−5 points
0 comments4 min readLW link

Ar­tifi­cial In­tel­li­gence and Eter­nal Tor­ture and Suffering

Tuan Tu NguyenAug 20, 2024, 1:53 AM
−1 points
0 comments4 min readLW link

AI #77: A Few Upgrades

ZviAug 20, 2024, 12:20 AM
23 points
3 comments52 min readLW link
(thezvi.wordpress.com)

Monthly Roundup #21: Au­gust 2024

ZviAug 20, 2024, 12:20 AM
22 points
6 comments40 min readLW link
(thezvi.wordpress.com)

[Linkpost] Au­to­mated De­sign of Agen­tic Systems

Bogdan Ionut CirsteaAug 19, 2024, 11:06 PM
8 points
1 comment1 min readLW link
(arxiv.org)

Limi­ta­tions on For­mal Ver­ifi­ca­tion for AI Safety

Andrew DicksonAug 19, 2024, 11:03 PM
134 points
60 comments23 min readLW link

The Con­scious River: Con­scious Tur­ing ma­chines negate ma­te­ri­al­ism

blalloAug 19, 2024, 9:54 PM
0 points
4 comments7 min readLW link

LLM Ap­pli­ca­tions I Want To See

sarahconstantinAug 19, 2024, 9:10 PM
102 points
6 comments8 min readLW link
(sarahconstantin.substack.com)

Defin­ing al­ign­ment research

Richard_NgoAug 19, 2024, 8:42 PM
92 points
23 comments7 min readLW link

Vilnius – ACX Mee­tups Every­where Fall 2024

Aug 19, 2024, 5:38 PM
3 points
1 comment1 min readLW link

Can Cur­rent LLMs be Trusted To Pro­duce Paper­clips Safely?

Rohit ChatterjeeAug 19, 2024, 5:17 PM
4 points
0 comments9 min readLW link

A primer on why com­pu­ta­tional pre­dic­tive tox­i­col­ogy is hard

Abhishaike MahajanAug 19, 2024, 5:16 PM
63 points
2 comments12 min readLW link
(www.owlposting.com)

In­tro­duc­tion and Ex­plo­ra­tion of AI Ethics Through a Global Lens

ThePathYouWillChooseAug 19, 2024, 5:11 PM
1 point
0 comments1 min readLW link

Trust­wor­thy and un­trust­wor­thy models

Olli JärviniemiAug 19, 2024, 4:27 PM
47 points
3 comments8 min readLW link

Apart­ment Price Map Discontinuity

jefftkAug 19, 2024, 3:30 PM
12 points
0 comments1 min readLW link
(www.jefftk.com)

Will we ever run out of new jobs?

Kevin KohlerAug 19, 2024, 3:04 PM
17 points
7 comments7 min readLW link
(machinocene.substack.com)

[Question] What are the best re­sources for build­ing gears-level mod­els of how gov­ern­ments ac­tu­ally work?

adamShimiAug 19, 2024, 2:05 PM
19 points
6 comments1 min readLW link

[Cross-post] Book Re­view: Bureau­cracy, by James Q Wilson

davekastenAug 19, 2024, 1:57 PM
12 points
0 comments7 min readLW link

[Question] If AI is in a bub­ble and the bub­ble bursts, what would you do?

RemmeltAug 19, 2024, 10:56 AM
12 points
18 comments1 min readLW link

Think­ing About Propen­sity Evaluations

Aug 19, 2024, 9:23 AM
10 points
0 comments27 min readLW link

A Tax­on­omy Of AI Sys­tem Evaluations

Aug 19, 2024, 9:07 AM
13 points
0 comments14 min readLW link

Be­ware the sci­ence fic­tion bias in pre­dic­tions of the future

nsokolskyAug 19, 2024, 5:32 AM
25 points
20 comments4 min readLW link
(nsokolsky.substack.com)

In­ter­dic­tor Ship

lsusrAug 19, 2024, 4:59 AM
63 points
9 comments7 min readLW link

Why you should be us­ing a retinoid

GeneSmithAug 19, 2024, 3:07 AM
98 points
60 comments5 min readLW link

Li­a­bil­ity regimes for AI

Ege ErdilAug 19, 2024, 1:25 AM
153 points
34 comments5 min readLW link

Some­thing Is Lost When AI Makes Art

utilistrutilAug 18, 2024, 10:53 PM
17 points
1 comment10 min readLW link

Scal­ing Laws and Likely Limits to AI

DavidmanheimAug 18, 2024, 5:19 PM
19 points
0 commentsLW link

What is “True Love”?

johnswentworthAug 18, 2024, 4:05 PM
72 points
11 comments1 min readLW link

Quick look: ap­pli­ca­tions of chaos theory

Aug 18, 2024, 3:00 PM
79 points
51 comments8 min readLW link
(acesounderglass.com)

Restruc­tur­ing Pop Songs for Contra

jefftkAug 18, 2024, 2:10 PM
11 points
0 comments2 min readLW link
(www.jefftk.com)