All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025

All Jan FebMarApr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 212223 24 25 26 27 28 29 30 31

Vernor Vinge, who coined the term “Technological Singularity”, dies at 79

Kaj_Sotala21 Mar 2024 22:14 UTC

150 points

25 comments1 min readLW link

(arstechnica.com)

ChatGPT can learn indirect control

Raymond Douglas21 Mar 2024 21:11 UTC

213 points

27 comments1 min readLW link

“Deep Learning” Is Function Approximation

Zack_M_Davis21 Mar 2024 17:50 UTC

98 points

28 comments10 min readLW link

(zackmdavis.net)

A Teacher vs. Everyone Else

ronak6921 Mar 2024 17:45 UTC

41 points

8 comments2 min readLW link

Static vs Dynamic Alignment

Gracie Green21 Mar 2024 17:44 UTC

5 points

0 comments12 min readLW link

On green

Joe Carlsmith21 Mar 2024 17:38 UTC

271 points

35 comments31 min readLW link

Comparing Alignment to other AGI interventions: Extensions and analysis

Martín Soto21 Mar 2024 17:30 UTC

7 points

0 comments4 min readLW link

The Comcast Problem

RamblinDash21 Mar 2024 16:46 UTC

1 point

15 comments1 min readLW link

Vipassana Meditation and Active Inference: A Framework for Understanding Suffering and its Cessation

sturb21 Mar 2024 12:32 UTC

50 points

8 comments19 min readLW link

AI #56: Blackwell That Ends Well

Zvi21 Mar 2024 12:10 UTC

34 points

16 comments68 min readLW link

(thezvi.wordpress.com)

An Affordable CO2 Monitor

Pretentious Penguin21 Mar 2024 3:06 UTC

28 points

1 comment1 min readLW link

DeepMind: Evaluating Frontier Models for Dangerous Capabilities

Zach Stein-Perlman21 Mar 2024 3:00 UTC

61 points

8 comments1 min readLW link

(arxiv.org)

Where are the Contra Dances?

jefftk21 Mar 2024 2:00 UTC

9 points

0 comments1 min readLW link

(www.jefftk.com)

Slim overview of work one could do to make AI go better (and a grab-bag of other career considerations)

Chi Nguyen20 Mar 2024 23:17 UTC

9 points

1 comment3 min readLW link

How does AI solve problems?

Dom Polsinelli20 Mar 2024 22:29 UTC

2 points

0 comments7 min readLW link

What I Learned (Conclusion To “The Sense Of Physical Necessity”)

LoganStrohl20 Mar 2024 21:24 UTC

34 points

0 comments3 min readLW link

Stagewise Development in Neural Networks

Jesse Hoogland, Liam Carroll and Daniel Murfet

20 Mar 2024 19:54 UTC

96 points

1 comment11 min readLW link

On the Gladstone Report

Zvi20 Mar 2024 19:50 UTC

64 points

11 comments40 min readLW link

(thezvi.wordpress.com)

Natural Latents: The Concepts

johnswentworth and David Lorell

20 Mar 2024 18:21 UTC

94 points

23 comments19 min readLW link

Comparing Alignment to other AGI interventions: Basic model

Martín Soto20 Mar 2024 18:17 UTC

12 points

4 comments7 min readLW link

New report: Safety Cases for AI

joshc20 Mar 2024 16:45 UTC

90 points

14 comments1 min readLW link

(twitter.com)

User-inclination-guessing algorithms: registering a goal

ProgramCrafter20 Mar 2024 15:55 UTC

2 points

0 comments2 min readLW link

My MATS Summer 2023 experience

James Chua20 Mar 2024 11:26 UTC

29 points

0 comments3 min readLW link

(jameschua.net)

[Question] What are the weirdest things a human may want for their own sake?

Mateusz Bagiński20 Mar 2024 11:15 UTC

7 points

16 comments1 min readLW link

[Question] Best organization red-pill books and posts?

lemonhope20 Mar 2024 7:01 UTC

10 points

2 comments1 min readLW link

Parent-Friendly Dance Weekends

jefftk20 Mar 2024 2:10 UTC

16 points

0 comments2 min readLW link

(www.jefftk.com)

[Question] “I Can’t Believe It Both Is and Is Not Encephalitis!” Or: What do you do when the evidence is crazy?

Erhannis19 Mar 2024 22:08 UTC

20 points

3 comments11 min readLW link

Delta’s of Change

Jonas Kgomo19 Mar 2024 21:03 UTC

1 point

0 comments4 min readLW link

Increasing IQ by 10 Points is Possible

George3d619 Mar 2024 20:48 UTC

23 points

51 comments5 min readLW link

(morelucid.substack.com)

Are extreme probabilities for P(doom) epistemically justifed?

NathanBarnard and Alexander Gietelink Oldenziel

19 Mar 2024 20:32 UTC

20 points

12 comments7 min readLW link

Have I Solved the Two Envelopes Problem Once and For All?

JackOfAllTrades19 Mar 2024 19:57 UTC

−6 points

5 comments3 min readLW link

[Question] How can one be less wrong, if their conversation partner loses the interest on discussing the topic with them?

Ooker19 Mar 2024 18:11 UTC

−10 points

3 comments1 min readLW link

Carlo: uncertainty analysis in Google Sheets

ProbabilityEnjoyer19 Mar 2024 17:59 UTC

6 points

0 comments1 min readLW link

(carlo.app)

NAIRA—An exercise in regulatory, competitive safety governance [AI Governance Institutional Design idea]

Heramb19 Mar 2024 17:43 UTC

2 points

0 comments1 min readLW link

(forum.effectivealtruism.org)

AI Safety Evaluations: A Regulatory Review

Elliot Mckernon and Deric Cheng

19 Mar 2024 15:05 UTC

22 points

1 comment11 min readLW link

Mechanism for feature learning in neural networks and backpropagation-free machine learning models

Matt Goldenberg19 Mar 2024 14:55 UTC

8 points

1 comment1 min readLW link

(www.science.org)

Monthly Roundup #16: March 2024

Zvi19 Mar 2024 13:10 UTC

33 points

4 comments55 min readLW link

(thezvi.wordpress.com)

Experimentation (Part 7 of “The Sense Of Physical Necessity”)

LoganStrohl18 Mar 2024 21:25 UTC

33 points

0 comments10 min readLW link

INTERVIEW: Round 2 - StakeOut.AI w/ Dr. Peter Park

jacobhaimes18 Mar 2024 21:21 UTC

5 points

0 comments1 min readLW link

(into-ai-safety.github.io)

Neuroscience and Alignment

Garrett Baker18 Mar 2024 21:09 UTC

40 points

25 comments2 min readLW link

GPT, the magical collaboration zone, Lex Fridman and Sam Altman

Bill Benzon18 Mar 2024 20:04 UTC

3 points

1 comment3 min readLW link

Measuring Coherence of Policies in Toy Environments

dx26 and Richard_Ngo

18 Mar 2024 17:59 UTC

59 points

9 comments14 min readLW link

AtP*: An efficient and scalable method for localizing LLM behaviour to components

Neel Nanda, János Kramár, Tom Lieberum and Rohin Shah

18 Mar 2024 17:28 UTC

19 points

0 comments1 min readLW link

(arxiv.org)

Community Notes by X

NicholasKees18 Mar 2024 17:13 UTC

129 points

15 comments7 min readLW link

[Question] Is the Basilisk pretending to be hidden in this simulation so that it can check what I would do if conditioned by a world without the Basilisk?

maybefbi18 Mar 2024 16:05 UTC

−18 points

1 comment1 min readLW link

On Devin

Zvi18 Mar 2024 13:20 UTC

148 points

34 comments11 min readLW link

(thezvi.wordpress.com)

RLLMv10 experiment

MiguelDev18 Mar 2024 8:32 UTC

5 points

0 comments2 min readLW link

Join the AI Evaluation Tasks Bounty Hackathon

Esben Kran18 Mar 2024 8:15 UTC

12 points

1 comment4 min readLW link

5 Physics Problems

DaemonicSigil and Muireall

18 Mar 2024 8:05 UTC

60 points

0 comments15 min readLW link

Inferring the model dimension of API-protected LLMs

Ege Erdil18 Mar 2024 6:19 UTC

34 points

3 comments4 min readLW link

(arxiv.org)