All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025 2026

All Jan Feb MarAprMay Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8910 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

[Question] Non-ultimatum game problem

numpyNaN8 Apr 2024 23:25 UTC

9 points

4 comments2 min readLW link

Pandemic Identification Simulator

jefftk8 Apr 2024 19:00 UTC

22 points

0 comments1 min readLW link

(www.jefftk.com)

How We Picture Bayesian Agents

johnswentworth and David Lorell

8 Apr 2024 18:12 UTC

73 points

14 comments7 min readLW link

CEA seeks co-founder for AI safety group support spin-off

agucova8 Apr 2024 15:42 UTC

18 points

0 comments4 min readLW link

Investigating the role of agency in AI x-risk

Corin Katzke8 Apr 2024 15:12 UTC

10 points

0 comments40 min readLW link

(www.convergenceanalysis.org)

Measuring Learned Optimization in Small Transformer Models

J Bostock8 Apr 2024 14:41 UTC

22 points

0 comments11 min readLW link

[Question] Can singularity emerge from transformers?

MP8 Apr 2024 14:26 UTC

3 points

1 comment1 min readLW link

Gated Attention Blocks: Preliminary Progress toward Removing Attention Head Superposition

cmathw, Dennis Akar and Lee Sharkey

8 Apr 2024 11:14 UTC

42 points

4 comments15 min readLW link

Math-to-English Cheat Sheet

nahoj8 Apr 2024 9:19 UTC

54 points

5 comments6 min readLW link

[Question] What does it take to transfer the knowledge to action?

EL_File41388 Apr 2024 6:23 UTC

3 points

7 comments1 min readLW link

Normalizing Sparse Autoencoders

Fengyuan Hu8 Apr 2024 6:17 UTC

22 points

18 comments13 min readLW link

A Dozen Ways to Get More Dakka

Davidmanheim8 Apr 2024 4:45 UTC

143 points

12 comments3 min readLW link 1 review

[Crosspost] Introducing the Hypermanifest: Redefining AI’s Role in Human Connection and Interaction

simulacra.exe7 Apr 2024 17:21 UTC

4 points

0 comments5 min readLW link

Applications Open: Elevate Your Mental Wellbeing with Rethink Wellbeing’s CBT Program

Inga G.7 Apr 2024 14:03 UTC

13 points

2 comments4 min readLW link

The Poker Theory of Poker Night

omark7 Apr 2024 9:47 UTC

29 points

13 comments9 min readLW link

(www.codeandbugs.com)

Centrists are (probably) less biased

Kevin Dorst7 Apr 2024 6:40 UTC

1 point

2 comments5 min readLW link

(kevindorst.substack.com)

on the dollar-yen exchange rate

bhauth7 Apr 2024 4:49 UTC

50 points

21 comments10 min readLW link

(www.bhauth.com)

Conflict in Posthuman Literature

Martín Soto6 Apr 2024 22:26 UTC

42 points

1 comment2 min readLW link

(twitter.com)

“Fractal Strategy” workshop report

Raemon6 Apr 2024 21:26 UTC

68 points

23 comments10 min readLW link

The 2nd Demographic Transition

Maxwell Tabarrok6 Apr 2024 14:10 UTC

68 points

20 comments4 min readLW link

(www.maximum-progress.com)

My intellectual journey to (dis)solve the hard problem of consciousness

Charbel-Raphaël6 Apr 2024 9:32 UTC

48 points

45 comments30 min readLW link

Measuring Predictability of Persona Evaluations

Thee Ho and evhub

6 Apr 2024 8:46 UTC

20 points

0 comments7 min readLW link

Privacy and writing

Neil 6 Apr 2024 8:20 UTC

20 points

1 comment5 min readLW link

[Question] How does the ever-increasing use of AI in the military for the direct purpose of murdering people affect your p(doom)?

Justausername6 Apr 2024 6:31 UTC

19 points

16 comments1 min readLW link

Two tools for rethinking existential risk

Arepo6 Apr 2024 2:55 UTC

2 points

0 comments25 min readLW link

Exploring Whole Brain Emulation

PeterMcCluskey6 Apr 2024 2:38 UTC

13 points

1 comment2 min readLW link

(bayesianinvestor.com)

Koan: divining alien datastructures from RAM activations

TsviBT5 Apr 2024 18:04 UTC

65 points

10 comments21 min readLW link

On the 2nd CWT with Jonathan Haidt

Zvi5 Apr 2024 17:30 UTC

27 points

3 comments33 min readLW link

(thezvi.wordpress.com)

End-to-end hacking with language models

tchauvin5 Apr 2024 15:06 UTC

29 points

0 comments8 min readLW link

Partial value takeover without world takeover

KatjaGrace5 Apr 2024 6:20 UTC

92 points

26 comments3 min readLW link 1 review

(worldspiritsockpuppet.com)

On Complexity Science

Garrett Baker5 Apr 2024 2:24 UTC

54 points

21 comments4 min readLW link

Using game theory to elect a centrist in the 2024 US Presidential Election

Ebenezer Dukakis5 Apr 2024 0:46 UTC

−1 points

0 comments8 min readLW link

New report: A review of the empirical evidence for existential risk from AI via misaligned power-seeking

Harlan and rosehadshar

4 Apr 2024 23:41 UTC

31 points

5 comments1 min readLW link

(blog.aiimpacts.org)

Quick evidence review of bulking & cutting

jp4 Apr 2024 21:43 UTC

41 points

9 comments4 min readLW link 2 reviews

LLMs for Alignment Research: a safety priority?

abramdemski4 Apr 2024 20:03 UTC

148 points

25 comments11 min readLW link

On Leif Wenar’s Absurdly Unconvincing Critique Of Effective Altruism

Bentham's Bulldog4 Apr 2024 19:01 UTC

8 points

2 comments14 min readLW link

Run evals on base models too!

orthonormal4 Apr 2024 18:43 UTC

51 points

6 comments1 min readLW link

Let’s Fund: Impact of our $1M crowdfunded grant to the Center for Clean Energy Innovation

Hauke Hillebrandt4 Apr 2024 16:28 UTC

5 points

0 comments5 min readLW link

(lets-fund.org)

The Buckling World Hypothesis—Visualising Vulnerable Worlds

Rosco-Hunter4 Apr 2024 15:51 UTC

−5 points

2 comments4 min readLW link

Can AI Transform the Electorate into a Citizen’s Assembly?

Rosco-Hunter4 Apr 2024 15:45 UTC

−6 points

0 comments4 min readLW link

AI Discrimination Requirements: A Regulatory Review

Deric Cheng and Elliot Mckernon

4 Apr 2024 15:43 UTC

7 points

0 comments6 min readLW link

Trying to Do More Good

jefftk4 Apr 2024 14:20 UTC

18 points

0 comments12 min readLW link

(www.jefftk.com)

Language and Capabilities: Testing LLM Mathematical Abilities Across Languages

Ethan Edwards4 Apr 2024 13:18 UTC

24 points

2 comments36 min readLW link

AI #58: Stargate AGI

Zvi4 Apr 2024 13:10 UTC

49 points

9 comments60 min readLW link

(thezvi.wordpress.com)

Cult of equilibrium

Templarrr4 Apr 2024 9:19 UTC

13 points

2 comments1 min readLW link

[Question] Should you refuse this bet in Technicolor Sleeping Beauty?

Ape in the coat4 Apr 2024 8:55 UTC

16 points

15 comments1 min readLW link

[Question] What’s with all the bans recently?

Gerald Monroe4 Apr 2024 6:16 UTC

63 points

83 comments4 min readLW link

Best in Class Life Improvement

sapphire4 Apr 2024 1:51 UTC

77 points

20 comments1 min readLW link

[Question] What is the purpose and application of AI Debate?

VojtaKovarik4 Apr 2024 0:38 UTC

13 points

9 comments1 min readLW link

Concrete empirical research projects in mechanistic anomaly detection

Erik Jenner, Viktor Rehnberg and Oliver Daniels

3 Apr 2024 23:07 UTC

43 points

3 comments10 min readLW link