All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025

All Jan FebMarApr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 202122 23 24 25 26 27 28 29 30 31

Slim overview of work one could do to make AI go better (and a grab-bag of other career considerations)

Chi NguyenMar 20, 2024, 11:17 PM

9 points

1 comment LW link

How does AI solve problems?

Dom PolsinelliMar 20, 2024, 10:29 PM

2 points

0 comments7 min readLW link

What I Learned (Conclusion To “The Sense Of Physical Necessity”)

LoganStrohlMar 20, 2024, 9:24 PM

34 points

0 comments3 min readLW link

Stagewise Development in Neural Networks

Jesse Hoogland, Liam Carroll and Daniel Murfet

Mar 20, 2024, 7:54 PM

96 points

1 comment11 min readLW link

On the Gladstone Report

ZviMar 20, 2024, 7:50 PM

64 points

11 comments40 min readLW link

(thezvi.wordpress.com)

Natural Latents: The Concepts

johnswentworth and David Lorell

Mar 20, 2024, 6:21 PM

90 points

18 comments19 min readLW link

Comparing Alignment to other AGI interventions: Basic model

Martín SotoMar 20, 2024, 6:17 PM

12 points

4 comments7 min readLW link

New report: Safety Cases for AI

joshcMar 20, 2024, 4:45 PM

89 points

14 comments1 min readLW link

(twitter.com)

User-inclination-guessing algorithms: registering a goal

ProgramCrafterMar 20, 2024, 3:55 PM

2 points

0 comments2 min readLW link

My MATS Summer 2023 experience

James ChuaMar 20, 2024, 11:26 AM

29 points

0 comments3 min readLW link

(jameschua.net)

[Question] What are the weirdest things a human may want for their own sake?

Mateusz BagińskiMar 20, 2024, 11:15 AM

7 points

16 comments1 min readLW link

[Question] Best organization red-pill books and posts?

lemonhopeMar 20, 2024, 7:01 AM

10 points

2 comments1 min readLW link

Parent-Friendly Dance Weekends

jefftkMar 20, 2024, 2:10 AM

16 points

0 comments2 min readLW link

(www.jefftk.com)

[Question] “I Can’t Believe It Both Is and Is Not Encephalitis!” Or: What do you do when the evidence is crazy?

ErhannisMar 19, 2024, 10:08 PM

20 points

3 comments11 min readLW link

Delta’s of Change

Jonas KgomoMar 19, 2024, 9:03 PM

1 point

0 comments4 min readLW link

Increasing IQ by 10 Points is Possible

George3d6Mar 19, 2024, 8:48 PM

23 points

51 comments5 min readLW link

(morelucid.substack.com)

Are extreme probabilities for P(doom) epistemically justifed?

NathanBarnard and Alexander Gietelink Oldenziel

Mar 19, 2024, 8:32 PM

20 points

12 comments7 min readLW link

Have I Solved the Two Envelopes Problem Once and For All?

JackOfAllTradesMar 19, 2024, 7:57 PM

−6 points

5 comments3 min readLW link

[Question] How can one be less wrong, if their conversation partner loses the interest on discussing the topic with them?

OokerMar 19, 2024, 6:11 PM

−10 points

3 comments1 min readLW link

Carlo: uncertainty analysis in Google Sheets

ProbabilityEnjoyerMar 19, 2024, 5:59 PM

6 points

0 comments1 min readLW link

(carlo.app)

NAIRA—An exercise in regulatory, competitive safety governance [AI Governance Institutional Design idea]

HerambMar 19, 2024, 5:43 PM

2 points

0 comments1 min readLW link

(forum.effectivealtruism.org)

AI Safety Evaluations: A Regulatory Review

Elliot Mckernon and Deric Cheng

Mar 19, 2024, 3:05 PM

22 points

1 comment11 min readLW link

Mechanism for feature learning in neural networks and backpropagation-free machine learning models

Matt GoldenbergMar 19, 2024, 2:55 PM

8 points

1 comment1 min readLW link

(www.science.org)

Monthly Roundup #16: March 2024

ZviMar 19, 2024, 1:10 PM

33 points

4 comments55 min readLW link

(thezvi.wordpress.com)

Experimentation (Part 7 of “The Sense Of Physical Necessity”)

LoganStrohlMar 18, 2024, 9:25 PM

33 points

0 comments10 min readLW link

INTERVIEW: Round 2 - StakeOut.AI w/ Dr. Peter Park

jacobhaimesMar 18, 2024, 9:21 PM

5 points

0 comments1 min readLW link

(into-ai-safety.github.io)

Neuroscience and Alignment

Garrett BakerMar 18, 2024, 9:09 PM

40 points

25 comments2 min readLW link

GPT, the magical collaboration zone, Lex Fridman and Sam Altman

Bill BenzonMar 18, 2024, 8:04 PM

3 points

1 comment3 min readLW link

Measuring Coherence of Policies in Toy Environments

dx26 and Richard_Ngo

Mar 18, 2024, 5:59 PM

59 points

9 comments14 min readLW link

AtP*: An efficient and scalable method for localizing LLM behaviour to components

Neel Nanda, János Kramár, Tom Lieberum and Rohin Shah

Mar 18, 2024, 5:28 PM

19 points

0 comments1 min readLW link

(arxiv.org)

Community Notes by X

NicholasKeesMar 18, 2024, 5:13 PM

127 points

15 comments7 min readLW link

[Question] Is the Basilisk pretending to be hidden in this simulation so that it can check what I would do if conditioned by a world without the Basilisk?

maybefbiMar 18, 2024, 4:05 PM

−18 points

1 comment1 min readLW link

On Devin

ZviMar 18, 2024, 1:20 PM

148 points

34 comments11 min readLW link

(thezvi.wordpress.com)

RLLMv10 experiment

MiguelDevMar 18, 2024, 8:32 AM

5 points

0 comments2 min readLW link

Join the AI Evaluation Tasks Bounty Hackathon

Esben KranMar 18, 2024, 8:15 AM

12 points

1 comment LW link

5 Physics Problems

DaemonicSigil and Muireall

Mar 18, 2024, 8:05 AM

60 points

0 comments15 min readLW link

Inferring the model dimension of API-protected LLMs

Ege ErdilMar 18, 2024, 6:19 AM

34 points

3 comments4 min readLW link

(arxiv.org)

AI strategy given the need for good reflection

owencbMar 18, 2024, 12:48 AM

7 points

0 comments LW link

XAI releases Grok base model

Jacob G-WMar 18, 2024, 12:47 AM

11 points

3 comments1 min readLW link

(x.ai)

Toki pona FAQ

dkl9Mar 17, 2024, 9:44 PM

37 points

9 comments1 min readLW link

(dkl9.net)

EA ErFiN Project work

Max_He-HoMar 17, 2024, 8:42 PM

2 points

0 comments1 min readLW link

EA ErFiN Project work

Max_He-HoMar 17, 2024, 8:37 PM

2 points

0 comments1 min readLW link

[Question] Alice and Bob is debating on a technique. Alice says Bob should try it before denying it. Is it a fallacy or something similar?

OokerMar 17, 2024, 8:01 PM

0 points

19 comments2 min readLW link

Is there a way to calculate the P(we are in a 2nd cold war)?

cloakMar 17, 2024, 8:01 PM

−9 points

2 comments1 min readLW link

The Worst Form Of Government (Except For Everything Else We’ve Tried)

johnswentworthMar 17, 2024, 6:11 PM

135 points

47 comments4 min readLW link

Applying simulacrum levels to hobbies, interests and goals

DMMFMar 17, 2024, 4:18 PM

15 points

2 comments4 min readLW link

(danfrank.ca)

What is the best argument that LLMs are shoggoths?

JoshuaFoxMar 17, 2024, 11:36 AM

26 points

22 comments1 min readLW link

Invitation to the Princeton AI Alignment and Safety Seminar

Sadhika MalladiMar 17, 2024, 1:10 AM

6 points

1 comment1 min readLW link

Anxiety vs. Depression

SableMar 17, 2024, 12:15 AM

86 points

35 comments3 min readLW link

(affablyevil.substack.com)

Celiefs

TheLemmaLlamaMar 16, 2024, 11:56 PM

3 points

8 comments1 min readLW link

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer