All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025 2026

All Jan Feb MarAprMay Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 91011 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

[Question] What are some posthumanist/more-than-human approaches to definitions of intelligence and agency? Particularly in application to AI research.

Eli Hiton9 Apr 2024 21:52 UTC

1 point

0 comments1 min readLW link

Ophiology (or, how the Mamba architecture works)

Danielle Ensign, SrGonao and Adrià Garriga-alonso

9 Apr 2024 19:31 UTC

67 points

10 comments10 min readLW link

Apply to LASR Labs: a London-based technical AI safety research programme

Erin Robertson, Charlie Griffin, joehardie and LASR Labs

9 Apr 2024 17:34 UTC

45 points

1 comment3 min readLW link

“Decentralized Autonomous Education”—Call for Reviewers (Seeds of Science)

rogersbacon9 Apr 2024 14:39 UTC

6 points

0 comments1 min readLW link

D&D.Sci: The Mad Tyrant’s Pet Turtles [Evaluation and Ruleset]

abstractapplic9 Apr 2024 14:01 UTC

48 points

6 comments3 min readLW link

Medical Roundup #2

Zvi9 Apr 2024 13:40 UTC

37 points

18 comments16 min readLW link

(thezvi.wordpress.com)

[Closed] PIBBSS is hiring in a variety of roles (alignment research and incubation program)

Nora_Ammann, Lucas Teixeira and DusanDNesic

9 Apr 2024 8:12 UTC

54 points

0 comments3 min readLW link

Any evidence or reason to expect a multiverse / Everett branches?

lemonhope9 Apr 2024 5:26 UTC

9 points

127 comments1 min readLW link

Fermenting Form

koratkar9 Apr 2024 2:46 UTC

19 points

2 comments4 min readLW link

(careerscouting.substack.com)

[Question] Non-ultimatum game problem

numpyNaN8 Apr 2024 23:25 UTC

9 points

4 comments2 min readLW link

Pandemic Identification Simulator

jefftk8 Apr 2024 19:00 UTC

22 points

0 comments1 min readLW link

(www.jefftk.com)

How We Picture Bayesian Agents

johnswentworth and David Lorell

8 Apr 2024 18:12 UTC

73 points

14 comments7 min readLW link

CEA seeks co-founder for AI safety group support spin-off

agucova8 Apr 2024 15:42 UTC

18 points

0 comments4 min readLW link

Investigating the role of agency in AI x-risk

Corin Katzke8 Apr 2024 15:12 UTC

10 points

0 comments40 min readLW link

(www.convergenceanalysis.org)

Measuring Learned Optimization in Small Transformer Models

J Bostock8 Apr 2024 14:41 UTC

22 points

0 comments11 min readLW link

[Question] Can singularity emerge from transformers?

MP8 Apr 2024 14:26 UTC

3 points

1 comment1 min readLW link

Gated Attention Blocks: Preliminary Progress toward Removing Attention Head Superposition

cmathw, Dennis Akar and Lee Sharkey

8 Apr 2024 11:14 UTC

42 points

4 comments15 min readLW link

Math-to-English Cheat Sheet

nahoj8 Apr 2024 9:19 UTC

54 points

5 comments6 min readLW link

[Question] What does it take to transfer the knowledge to action?

EL_File41388 Apr 2024 6:23 UTC

3 points

7 comments1 min readLW link

Normalizing Sparse Autoencoders

Fengyuan Hu8 Apr 2024 6:17 UTC

22 points

18 comments13 min readLW link

A Dozen Ways to Get More Dakka

Davidmanheim8 Apr 2024 4:45 UTC

143 points

12 comments3 min readLW link 1 review

[Crosspost] Introducing the Hypermanifest: Redefining AI’s Role in Human Connection and Interaction

simulacra.exe7 Apr 2024 17:21 UTC

4 points

0 comments5 min readLW link

Applications Open: Elevate Your Mental Wellbeing with Rethink Wellbeing’s CBT Program

Inga G.7 Apr 2024 14:03 UTC

13 points

2 comments4 min readLW link

The Poker Theory of Poker Night

omark7 Apr 2024 9:47 UTC

29 points

13 comments9 min readLW link

(www.codeandbugs.com)

Centrists are (probably) less biased

Kevin Dorst7 Apr 2024 6:40 UTC

1 point

2 comments5 min readLW link

(kevindorst.substack.com)

on the dollar-yen exchange rate

bhauth7 Apr 2024 4:49 UTC

50 points

21 comments10 min readLW link

(www.bhauth.com)

Conflict in Posthuman Literature

Martín Soto6 Apr 2024 22:26 UTC

42 points

1 comment2 min readLW link

(twitter.com)

“Fractal Strategy” workshop report

Raemon6 Apr 2024 21:26 UTC

68 points

23 comments10 min readLW link

The 2nd Demographic Transition

Maxwell Tabarrok6 Apr 2024 14:10 UTC

68 points

20 comments4 min readLW link

(www.maximum-progress.com)

My intellectual journey to (dis)solve the hard problem of consciousness

Charbel-Raphaël6 Apr 2024 9:32 UTC

48 points

45 comments30 min readLW link

Measuring Predictability of Persona Evaluations

Thee Ho and evhub

6 Apr 2024 8:46 UTC

20 points

0 comments7 min readLW link

Privacy and writing

Neil 6 Apr 2024 8:20 UTC

20 points

1 comment5 min readLW link

[Question] How does the ever-increasing use of AI in the military for the direct purpose of murdering people affect your p(doom)?

Justausername6 Apr 2024 6:31 UTC

19 points

16 comments1 min readLW link

Two tools for rethinking existential risk

Arepo6 Apr 2024 2:55 UTC

2 points

0 comments25 min readLW link

Exploring Whole Brain Emulation

PeterMcCluskey6 Apr 2024 2:38 UTC

13 points

1 comment2 min readLW link

(bayesianinvestor.com)

Koan: divining alien datastructures from RAM activations

TsviBT5 Apr 2024 18:04 UTC

65 points

10 comments21 min readLW link

On the 2nd CWT with Jonathan Haidt

Zvi5 Apr 2024 17:30 UTC

27 points

3 comments33 min readLW link

(thezvi.wordpress.com)

End-to-end hacking with language models

tchauvin5 Apr 2024 15:06 UTC

29 points

0 comments8 min readLW link

Partial value takeover without world takeover

KatjaGrace5 Apr 2024 6:20 UTC

92 points

26 comments3 min readLW link 1 review

(worldspiritsockpuppet.com)

On Complexity Science

Garrett Baker5 Apr 2024 2:24 UTC

54 points

21 comments4 min readLW link

Using game theory to elect a centrist in the 2024 US Presidential Election

Ebenezer Dukakis5 Apr 2024 0:46 UTC

−1 points

0 comments8 min readLW link

New report: A review of the empirical evidence for existential risk from AI via misaligned power-seeking

Harlan and rosehadshar

4 Apr 2024 23:41 UTC

31 points

5 comments1 min readLW link

(blog.aiimpacts.org)

Quick evidence review of bulking & cutting

jp4 Apr 2024 21:43 UTC

41 points

9 comments4 min readLW link 2 reviews

LLMs for Alignment Research: a safety priority?

abramdemski4 Apr 2024 20:03 UTC

148 points

25 comments11 min readLW link

On Leif Wenar’s Absurdly Unconvincing Critique Of Effective Altruism

Bentham's Bulldog4 Apr 2024 19:01 UTC

8 points

2 comments14 min readLW link

Run evals on base models too!

orthonormal4 Apr 2024 18:43 UTC

51 points

6 comments1 min readLW link

Let’s Fund: Impact of our $1M crowdfunded grant to the Center for Clean Energy Innovation

Hauke Hillebrandt4 Apr 2024 16:28 UTC

5 points

0 comments5 min readLW link

(lets-fund.org)

The Buckling World Hypothesis—Visualising Vulnerable Worlds

Rosco-Hunter4 Apr 2024 15:51 UTC

−5 points

2 comments4 min readLW link

Can AI Transform the Electorate into a Citizen’s Assembly?

Rosco-Hunter4 Apr 2024 15:45 UTC

−6 points

0 comments4 min readLW link

AI Discrimination Requirements: A Regulatory Review

Deric Cheng and Elliot Mckernon

4 Apr 2024 15:43 UTC

7 points

0 comments6 min readLW link