A very non-tech­ni­cal ex­pla­na­tion of the ba­sics of in­fra-Bayesianism

matolcsid26 Apr 2023 22:57 UTC
62 points
9 comments9 min readLW link

LM Si­tu­a­tional Aware­ness, Eval­u­a­tion Pro­posal: Vio­lat­ing Imitation

Jacob Pfau26 Apr 2023 22:53 UTC
13 points
2 comments2 min readLW link

Re­cent Database Mi­gra­tion—Re­port Bugs

RobertM26 Apr 2023 22:19 UTC
38 points
2 comments1 min readLW link

In­fra-Bayesi­anism nat­u­rally leads to the mono­ton­ic­ity prin­ci­ple, and I think this is a problem

matolcsid26 Apr 2023 21:39 UTC
17 points
6 comments4 min readLW link

Un­der­stand­ing new terms via etymology

corruptedCatapillar26 Apr 2023 20:48 UTC
4 points
1 comment2 min readLW link
(forum.effectivealtruism.org)

Chad Jones pa­per mod­el­ing AI and x-risk vs. growth

jasoncrawford26 Apr 2023 20:07 UTC
39 points
7 comments2 min readLW link
(web.stanford.edu)

I was Wrong, Si­mu­la­tor The­ory is Real

Robert_AIZI26 Apr 2023 17:45 UTC
75 points
7 comments3 min readLW link
(aizi.substack.com)

$250 prize for check­ing Jake Can­nell’s Brain Efficiency

Alexander Gietelink Oldenziel26 Apr 2023 16:21 UTC
123 points
170 comments2 min readLW link

My ver­sion of Si­mu­lacra Levels

Daniel Kokotajlo26 Apr 2023 15:50 UTC
41 points
14 comments3 min readLW link

[Question] Is the fact that we don’t ob­serve any ob­vi­ous glitch ev­i­dence that we’re not in a simu­la­tion?

Jim Buhler26 Apr 2023 14:57 UTC
8 points
16 comments1 min readLW link

Tran­script and Brief Re­sponse to Twit­ter Con­ver­sa­tion be­tween Yann LeCunn and Eliezer Yudkowsky

Zvi26 Apr 2023 13:10 UTC
187 points
50 comments10 min readLW link
(thezvi.wordpress.com)

What comes af­ter?

rogersbacon26 Apr 2023 12:44 UTC
2 points
0 comments2 min readLW link
(www.secretorum.life)

Ac­ci­den­tal Terraforming

Sable26 Apr 2023 6:49 UTC
9 points
16 comments5 min readLW link
(affablyevil.substack.com)

Philos­o­phy by Paul Gra­ham Link

EniScien26 Apr 2023 5:36 UTC
21 points
4 comments1 min readLW link

Box­ing at the gym

yakimoff26 Apr 2023 5:10 UTC
1 point
0 comments1 min readLW link

Si­be­lius + drinks

yakimoff26 Apr 2023 5:08 UTC
1 point
0 comments1 min readLW link

A sim­ple pre­sen­ta­tion of AI risk arguments

Seth Herd26 Apr 2023 2:19 UTC
16 points
0 comments2 min readLW link

Archety­pal Trans­fer Learn­ing: a Pro­posed Align­ment Solu­tion that solves the In­ner & Outer Align­ment Prob­lem while adding Cor­rigible Traits to GPT-2-medium

MiguelDev26 Apr 2023 1:37 UTC
14 points
5 comments10 min readLW link

[Question] How Many Bits Of Op­ti­miza­tion Can One Bit Of Ob­ser­va­tion Un­lock?

johnswentworth26 Apr 2023 0:26 UTC
61 points
32 comments3 min readLW link

Believe in Your­self and don’t stop Improving

Johannes C. Mayer25 Apr 2023 22:34 UTC
0 points
0 comments1 min readLW link

Should LW have an offi­cial list of norms?

Ruby25 Apr 2023 21:20 UTC
57 points
31 comments5 min readLW link

Im­ple­ment­ing a Trans­former from scratch in PyTorch—a write-up on my experience

Mislav Jurić25 Apr 2023 20:51 UTC
20 points
0 comments10 min readLW link

Ex­plor­ing the Lot­tery Ticket Hypothesis

Rauno Arike25 Apr 2023 20:06 UTC
50 points
3 comments11 min readLW link

Ge­netic Se­quenc­ing of Wastew­a­ter: Prevalence to Rel­a­tive Abundance

jefftk25 Apr 2023 19:30 UTC
17 points
2 comments2 min readLW link
(www.jefftk.com)

[Feed­back please] New User’s Guide to LessWrong

Ruby25 Apr 2023 18:54 UTC
38 points
18 comments6 min readLW link

Refram­ing the bur­den of proof: Com­pa­nies should prove that mod­els are safe (rather than ex­pect­ing au­di­tors to prove that mod­els are dan­ger­ous)

Akash25 Apr 2023 18:49 UTC
27 points
11 comments3 min readLW link
(childrenoficarus.substack.com)

LLMs for on­line dis­cus­sion moderation

Dave Lindbergh25 Apr 2023 16:53 UTC
12 points
3 comments3 min readLW link

AI Safety Newslet­ter #3: AI policy pro­pos­als and a new challenger approaches

ozhang25 Apr 2023 16:15 UTC
33 points
0 comments1 min readLW link

EA might sys­tem­at­i­cally gen­er­ate a scarcity mind­set that pro­duces low-in­tegrity actors

Severin T. Seehrich25 Apr 2023 15:50 UTC
26 points
2 comments1 min readLW link

Max Teg­mark’s new Time ar­ti­cle on how we’re in a Don’t Look Up sce­nario [Linkpost]

Jonas Hallgren25 Apr 2023 15:41 UTC
39 points
9 comments1 min readLW link
(time.com)

WHO Biolog­i­cal Risk warning

Jonas Kgomo25 Apr 2023 15:10 UTC
−6 points
2 comments1 min readLW link

A Rant on Calcu­lus III

Wofsen25 Apr 2023 14:51 UTC
−5 points
2 comments1 min readLW link

Briefly how I’ve up­dated since ChatGPT

rime25 Apr 2023 14:47 UTC
48 points
2 comments2 min readLW link

Dis­cuss AI Policy Recommendations

Giles25 Apr 2023 14:21 UTC
8 points
0 comments1 min readLW link

Ex­plain­ing the Trans­former Cir­cuits Frame­work by Example

Felix Hofstätter25 Apr 2023 13:45 UTC
8 points
0 comments15 min readLW link

Notes on Po­ten­tial Fu­ture AI Tax Policy

Zvi25 Apr 2023 13:30 UTC
33 points
5 comments9 min readLW link
(thezvi.wordpress.com)

Sen­tience in Sili­con: The Challenges of AI Consciousness

Hannes Thurnherr25 Apr 2023 13:15 UTC
5 points
2 comments5 min readLW link

Paths to failure

25 Apr 2023 8:03 UTC
29 points
1 comment8 min readLW link

My Assess­ment of the Chi­nese AI Safety Community

Lao Mein25 Apr 2023 4:21 UTC
245 points
94 comments3 min readLW link

Mak­ing Nanobots isn’t a one-shot pro­cess, even for an ar­tifi­cial superintelligance

dankrad25 Apr 2023 0:39 UTC
20 points
13 comments6 min readLW link

Men­tal Models Of Peo­ple Can Be People

Nox ML25 Apr 2023 0:03 UTC
12 points
55 comments8 min readLW link

Progress links and tweets, 2023-04-24

jasoncrawford24 Apr 2023 21:17 UTC
16 points
1 comment2 min readLW link
(rootsofprogress.org)

Ideas for AI labs: Read­ing list

Zach Stein-Perlman24 Apr 2023 19:00 UTC
11 points
0 comments4 min readLW link

Deep learn­ing mod­els might be se­cretly (al­most) linear

beren24 Apr 2023 18:43 UTC
110 points
28 comments4 min readLW link

Sub­jec­tive AI/​ML Digest: April II

Boris T24 Apr 2023 18:33 UTC
1 point
0 comments1 min readLW link
(borisagain.substack.com)

The Tox­o­plasma of AGI Doom and Ca­pa­bil­ities?

Robert_AIZI24 Apr 2023 18:11 UTC
68 points
12 comments1 min readLW link

[Question] Mea­sures of In­ter­net Viral­ity and News Popularity

Fer32dwt34r3dfsz24 Apr 2023 17:43 UTC
4 points
4 comments1 min readLW link

A con­cise sum-up of the ba­sic ar­gu­ment for AI doom

Mergimio H. Doefevmil24 Apr 2023 17:37 UTC
11 points
6 comments2 min readLW link

A re­sponse to Con­jec­ture’s CoEm proposal

Kristian Freed24 Apr 2023 17:23 UTC
7 points
0 comments4 min readLW link

Ca­ma­raderie at scale: in search of shared identity

eq24 Apr 2023 16:46 UTC
8 points
2 comments8 min readLW link