All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025 2026

All Jan Feb MarAprMay Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 101112 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

[Question] What should we tell an AI if it asks why it was created?

cSkeleton10 Apr 2024 20:37 UTC

1 point

1 comment1 min readLW link

RTFB: On the New Proposed CAIP AI Bill

Zvi10 Apr 2024 18:30 UTC

119 points

14 comments34 min readLW link

(thezvi.wordpress.com)

(Rational) Decision-Making In Wartime

Danylo Zhyrko10 Apr 2024 18:08 UTC

15 points

2 comments5 min readLW link

Thinking harder doesn’t work

Jakob Greenfeld10 Apr 2024 18:00 UTC

−12 points

7 comments6 min readLW link

(jakobgreenfeld.com)

Scaling Laws and Superposition

Pavan Katta10 Apr 2024 15:36 UTC

9 points

4 comments5 min readLW link

(www.pavankatta.com)

Responsible Advanced Artificial Intelligence Act

Anon4210 Apr 2024 14:35 UTC

4 points

0 comments1 min readLW link

(assets.caip.org)

Apply to the Pivotal Research Fellowship (AI Safety & Biosecurity)

Tobias H and tilmanr

10 Apr 2024 12:08 UTC

18 points

0 comments1 min readLW link

Is Consciousness Simulated?

Daniele De Nuntiis10 Apr 2024 9:02 UTC

−1 points

2 comments5 min readLW link

AI DOESN’T NEED TO KILL HUMANITY TO EXIST. IT WILL JUST SEE US IMPLODE. OR NOT. [2024]

BX10 Apr 2024 8:52 UTC

−37 points

0 comments27 min readLW link

How I select alignment research projects

Ethan Perez, Henry Sleight and Mikita Balesni

10 Apr 2024 4:33 UTC

37 points

4 comments24 min readLW link

[Question] How to accelerate recovery from sleep debt with biohacking?

exanova10 Apr 2024 1:27 UTC

10 points

2 comments1 min readLW link

[Question] What are some posthumanist/more-than-human approaches to definitions of intelligence and agency? Particularly in application to AI research.

Eli Hiton9 Apr 2024 21:52 UTC

1 point

0 comments1 min readLW link

Ophiology (or, how the Mamba architecture works)

Danielle Ensign, SrGonao and Adrià Garriga-alonso

9 Apr 2024 19:31 UTC

67 points

10 comments10 min readLW link

Apply to LASR Labs: a London-based technical AI safety research programme

Erin Robertson, Charlie Griffin, joehardie and LASR Labs

9 Apr 2024 17:34 UTC

45 points

1 comment3 min readLW link

“Decentralized Autonomous Education”—Call for Reviewers (Seeds of Science)

rogersbacon9 Apr 2024 14:39 UTC

6 points

0 comments1 min readLW link

D&D.Sci: The Mad Tyrant’s Pet Turtles [Evaluation and Ruleset]

abstractapplic9 Apr 2024 14:01 UTC

48 points

6 comments3 min readLW link

Medical Roundup #2

Zvi9 Apr 2024 13:40 UTC

37 points

18 comments16 min readLW link

(thezvi.wordpress.com)

[Closed] PIBBSS is hiring in a variety of roles (alignment research and incubation program)

Nora_Ammann, Lucas Teixeira and DusanDNesic

9 Apr 2024 8:12 UTC

54 points

0 comments3 min readLW link

Any evidence or reason to expect a multiverse / Everett branches?

lemonhope9 Apr 2024 5:26 UTC

9 points

127 comments1 min readLW link

Fermenting Form

koratkar9 Apr 2024 2:46 UTC

19 points

2 comments4 min readLW link

(careerscouting.substack.com)

[Question] Non-ultimatum game problem

numpyNaN8 Apr 2024 23:25 UTC

9 points

4 comments2 min readLW link

Pandemic Identification Simulator

jefftk8 Apr 2024 19:00 UTC

22 points

0 comments1 min readLW link

(www.jefftk.com)

How We Picture Bayesian Agents

johnswentworth and David Lorell

8 Apr 2024 18:12 UTC

73 points

14 comments7 min readLW link

CEA seeks co-founder for AI safety group support spin-off

agucova8 Apr 2024 15:42 UTC

18 points

0 comments4 min readLW link

Investigating the role of agency in AI x-risk

Corin Katzke8 Apr 2024 15:12 UTC

10 points

0 comments40 min readLW link

(www.convergenceanalysis.org)

Measuring Learned Optimization in Small Transformer Models

J Bostock8 Apr 2024 14:41 UTC

22 points

0 comments11 min readLW link

[Question] Can singularity emerge from transformers?

MP8 Apr 2024 14:26 UTC

3 points

1 comment1 min readLW link

Gated Attention Blocks: Preliminary Progress toward Removing Attention Head Superposition

cmathw, Dennis Akar and Lee Sharkey

8 Apr 2024 11:14 UTC

42 points

4 comments15 min readLW link

Math-to-English Cheat Sheet

nahoj8 Apr 2024 9:19 UTC

54 points

5 comments6 min readLW link

[Question] What does it take to transfer the knowledge to action?

EL_File41388 Apr 2024 6:23 UTC

3 points

7 comments1 min readLW link

Normalizing Sparse Autoencoders

Fengyuan Hu8 Apr 2024 6:17 UTC

22 points

18 comments13 min readLW link

A Dozen Ways to Get More Dakka

Davidmanheim8 Apr 2024 4:45 UTC

143 points

12 comments3 min readLW link 1 review

[Crosspost] Introducing the Hypermanifest: Redefining AI’s Role in Human Connection and Interaction

simulacra.exe7 Apr 2024 17:21 UTC

4 points

0 comments5 min readLW link

Applications Open: Elevate Your Mental Wellbeing with Rethink Wellbeing’s CBT Program

Inga G.7 Apr 2024 14:03 UTC

13 points

2 comments4 min readLW link

The Poker Theory of Poker Night

omark7 Apr 2024 9:47 UTC

29 points

13 comments9 min readLW link

(www.codeandbugs.com)

Centrists are (probably) less biased

Kevin Dorst7 Apr 2024 6:40 UTC

1 point

2 comments5 min readLW link

(kevindorst.substack.com)

on the dollar-yen exchange rate

bhauth7 Apr 2024 4:49 UTC

50 points

21 comments10 min readLW link

(www.bhauth.com)

Conflict in Posthuman Literature

Martín Soto6 Apr 2024 22:26 UTC

42 points

1 comment2 min readLW link

(twitter.com)

“Fractal Strategy” workshop report

Raemon6 Apr 2024 21:26 UTC

68 points

23 comments10 min readLW link

The 2nd Demographic Transition

Maxwell Tabarrok6 Apr 2024 14:10 UTC

68 points

20 comments4 min readLW link

(www.maximum-progress.com)

My intellectual journey to (dis)solve the hard problem of consciousness

Charbel-Raphaël6 Apr 2024 9:32 UTC

48 points

45 comments30 min readLW link

Measuring Predictability of Persona Evaluations

Thee Ho and evhub

6 Apr 2024 8:46 UTC

20 points

0 comments7 min readLW link

Privacy and writing

Neil 6 Apr 2024 8:20 UTC

20 points

1 comment5 min readLW link

[Question] How does the ever-increasing use of AI in the military for the direct purpose of murdering people affect your p(doom)?

Justausername6 Apr 2024 6:31 UTC

19 points

16 comments1 min readLW link

Two tools for rethinking existential risk

Arepo6 Apr 2024 2:55 UTC

2 points

0 comments25 min readLW link

Exploring Whole Brain Emulation

PeterMcCluskey6 Apr 2024 2:38 UTC

13 points

1 comment2 min readLW link

(bayesianinvestor.com)

Koan: divining alien datastructures from RAM activations

TsviBT5 Apr 2024 18:04 UTC

59 points

10 comments21 min readLW link

On the 2nd CWT with Jonathan Haidt

Zvi5 Apr 2024 17:30 UTC

27 points

3 comments33 min readLW link

(thezvi.wordpress.com)

End-to-end hacking with language models

tchauvin5 Apr 2024 15:06 UTC

29 points

0 comments8 min readLW link

Partial value takeover without world takeover

KatjaGrace5 Apr 2024 6:20 UTC

92 points

26 comments3 min readLW link 1 review

(worldspiritsockpuppet.com)