Ta­boo Truth

Tomás B.Jul 8, 2023, 11:23 PM
36 points
16 comments2 min readLW link

“View”

herschelJul 8, 2023, 11:19 PM
6 points
0 comments2 min readLW link

[Question] H5N1. Just how bad is the situ­a­tion?

Q HomeJul 8, 2023, 10:09 PM
16 points
8 comments1 min readLW link

A Two-Part Sys­tem for Prac­ti­cal Self-Care

Jonathan MoregårdJul 8, 2023, 9:23 PM
11 points
0 comments3 min readLW link
(honestliving.substack.com)

Really Strong Fea­tures Found in Resi­d­ual Stream

Logan RiggsJul 8, 2023, 7:40 PM
69 points
6 comments2 min readLW link

Eight Strate­gies for Tack­ling the Hard Part of the Align­ment Problem

scasperJul 8, 2023, 6:55 PM
42 points
11 comments7 min readLW link

“Con­cepts of Agency in Biol­ogy” (Okasha, 2023) - Brief Paper Summary

Nora_AmmannJul 8, 2023, 6:22 PM
40 points
3 comments7 min readLW link

Blan­chard’s Danger­ous Idea and the Plight of the Lu­cid Crossdreamer

Zack_M_DavisJul 8, 2023, 6:03 PM
38 points
135 comments72 min readLW link
(unremediatedgender.space)

Con­tin­u­ous Ad­ver­sar­ial Qual­ity As­surance: Ex­tend­ing RLHF and Con­sti­tu­tional AI

Benaya KorenJul 8, 2023, 5:32 PM
6 points
0 comments9 min readLW link

Com­mentless down­vot­ing is not a good way to fight infohazards

DirectedEvolutionJul 8, 2023, 5:29 PM
6 points
9 comments3 min readLW link

[Question] Why does anx­iety (?) make me dumb?

TeaTieAndHatJul 8, 2023, 4:13 PM
18 points
14 comments3 min readLW link

Eco­nomic Time Bomb: An Over­looked Em­ploy­ment Bub­ble Threat­en­ing the US Economy

Glenn ClaytonJul 8, 2023, 3:19 PM
4 points
10 comments6 min readLW link

What is ev­ery­one do­ing in AI governance

Igor IvanovJul 8, 2023, 3:16 PM
12 points
0 comments5 min readLW link

LLM mis­al­ign­ment can prob­a­bly be found with­out man­ual prompt engineering

ProgramCrafterJul 8, 2023, 2:35 PM
1 point
0 comments1 min readLW link

You must not fool your­self, and you are the eas­iest per­son to fool

Richard_NgoJul 8, 2023, 2:05 PM
35 points
5 comments4 min readLW link

Fixed Point: a love story

Richard_NgoJul 8, 2023, 1:56 PM
99 points
2 comments7 min readLW link

An­nounc­ing AI Align­ment work­shop at the ALIFE 2023 conference

rorygreigJul 8, 2023, 1:52 PM
16 points
0 comments1 min readLW link
(humanvaluesandartificialagency.com)

3D Printed Talk­box Cap

jefftkJul 8, 2023, 1:00 PM
9 points
0 comments1 min readLW link
(www.jefftk.com)

Writ­ing this post as ra­tio­nal­ity case study

Ben AmitayJul 8, 2023, 12:24 PM
10 points
8 comments2 min readLW link

[Question] What Does LessWrong/​EA Think of Hu­man In­tel­li­gence Aug­men­ta­tion as of mid-2023?

lukemarksJul 8, 2023, 11:42 AM
84 points
28 comments2 min readLW link

[Question] Re­quest for feed­back—in­fo­haz­ards in test­ing LLMs for causal rea­son­ing?

DirectedEvolutionJul 8, 2023, 9:01 AM
16 points
0 comments2 min readLW link

Views on when AGI comes and on strat­egy to re­duce ex­is­ten­tial risk

TsviBTJul 8, 2023, 9:00 AM
133 points
61 comments14 min readLW link1 review

Week­day Even­ing Beach Picnics

jefftkJul 8, 2023, 2:20 AM
2 points
4 comments1 min readLW link
(www.jefftk.com)

ACI#4: Seed AI is the new Per­pet­ual Mo­tion Machine

Akira PyinyaJul 8, 2023, 1:17 AM
−1 points
0 comments6 min readLW link

[Question] Links to dis­cus­sions on so­cial equil­ibrium and hu­man value af­ter (al­igned) su­per-AI?

Michael TontchevJul 8, 2023, 1:01 AM
7 points
3 comments1 min readLW link

Notes from the Qatar Cen­ter for Global Bank­ing and Fi­nance 3rd An­nual Conference

PixelatedPenguinJul 7, 2023, 11:48 PM
2 points
0 comments1 min readLW link

In­tro­duc­ing bayescalc.io

Adele LopezJul 7, 2023, 4:11 PM
115 points
29 comments1 min readLW link
(bayescalc.io)

Meetup Tip: Ask At­ten­dees To Ex­plain It

ScrewtapeJul 7, 2023, 4:08 PM
10 points
0 comments4 min readLW link

In­ter­pret­ing Mo­du­lar Ad­di­tion in MLPs

Bart BussmannJul 7, 2023, 9:22 AM
20 points
0 comments6 min readLW link

In­ter­nal in­de­pen­dent re­view for lan­guage model agent alignment

Seth HerdJul 7, 2023, 6:54 AM
55 points
30 comments11 min readLW link

[Question] Can LessWrong provide me with some­thing I find ob­vi­ously highly use­ful to my own prac­ti­cal life?

agrippaJul 7, 2023, 3:08 AM
32 points
4 comments1 min readLW link

ask me about technology

bhauthJul 7, 2023, 2:03 AM
23 points
42 comments1 min readLW link

Ap­par­ently, of the 195 Million the DoD al­lo­cated in Univer­sity Re­search Fund­ing Awards in 2022, more than half of them con­cerned AI or com­pute hard­ware research

mako yassJul 7, 2023, 1:20 AM
41 points
5 comments2 min readLW link
(www.defense.gov)

What are the best non-LW places to read on al­ign­ment progress?

RaemonJul 7, 2023, 12:57 AM
50 points
14 comments1 min readLW link

Two paths to win the AGI transition

Nathan Helm-BurgerJul 6, 2023, 9:59 PM
11 points
8 comments4 min readLW link

Em­piri­cal Ev­i­dence Against “The Longest Train­ing Run”

NickGabsJul 6, 2023, 6:32 PM
31 points
0 comments14 min readLW link

Progress Stud­ies Fel­low­ship look­ing for members

jay ramJul 6, 2023, 5:41 PM
3 points
0 comments1 min readLW link

BOUNTY AVAILABLE: AI ethi­cists, what are your ob­ject-level ar­gu­ments against AI notkil­lev­ery­oneism?

Peter BerggrenJul 6, 2023, 5:32 PM
18 points
6 comments2 min readLW link

Lay­er­ing and Tech­ni­cal Debt in the Global Wayfind­ing Model

herschelJul 6, 2023, 5:30 PM
14 points
0 comments3 min readLW link

Lo­cal­iz­ing goal mis­gen­er­al­iza­tion in a maze-solv­ing policy network

Jan BetleyJul 6, 2023, 4:21 PM
37 points
2 comments7 min readLW link

Jesse Hoogland on Devel­op­men­tal In­ter­pretabil­ity and Sin­gu­lar Learn­ing Theory

Michaël TrazziJul 6, 2023, 3:46 PM
42 points
2 comments4 min readLW link
(theinsideview.ai)

Progress links and tweets, 2023-07-06: Ter­raformer Mark One, Is­raeli wa­ter man­age­ment, & more

jasoncrawfordJul 6, 2023, 3:35 PM
18 points
4 comments2 min readLW link
(rootsofprogress.org)

Towards Non-Panop­ti­con AI Alignment

Logan ZoellnerJul 6, 2023, 3:29 PM
7 points
0 comments3 min readLW link

A Defense of Work on Math­e­mat­i­cal AI Safety

Davidmanheim6 Jul 2023 14:15 UTC
28 points
13 comments3 min readLW link
(forum.effectivealtruism.org)

Un­der­stand­ing the two most com­mon men­tal health prob­lems in the world

spencerg6 Jul 2023 14:06 UTC
19 points
0 commentsLW link

An­nounc­ing the EA Archive

Aaron Bergman6 Jul 2023 13:49 UTC
13 points
2 commentsLW link

Agency begets agency

Richard_Ngo6 Jul 2023 13:08 UTC
60 points
1 comment4 min readLW link

AI #19: Hofs­tadter, Sutskever, Leike

Zvi6 Jul 2023 12:50 UTC
60 points
16 comments40 min readLW link
(thezvi.wordpress.com)

Do you feel that AGI Align­ment could be achieved in a Type 0 civ­i­liza­tion?

Super AGI6 Jul 2023 4:52 UTC
−2 points
1 comment1 min readLW link

Open Thread—July 2023

Ruby6 Jul 2023 4:50 UTC
11 points
35 comments1 min readLW link