All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025

All JanFebMar Apr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 131415 16 17 18 19 20 21 22 23 24 25 26 27 28 29

Running the Numbers on a Heat Pump

jefftkFeb 9, 2024, 3:00 AM

30 points

12 comments4 min readLW link

(www.jefftk.com)

[Question] How do high-trust societies form?

Shankar SivarajanFeb 9, 2024, 1:11 AM

23 points

17 comments1 min readLW link

[Question] How do health systems work in adequate worlds?

mukashiFeb 9, 2024, 12:54 AM

10 points

2 comments1 min readLW link

Twin Cities ACX Meetup—February 2024

Timothy M.Feb 8, 2024, 11:26 PM

1 point

2 comments1 min readLW link

A review of “Don’t forget the boundary problem...”

jessicataFeb 8, 2024, 11:19 PM

12 points

1 comment12 min readLW link

(unstablerontology.substack.com)

aintelope project update

Gunnar_ZarnckeFeb 8, 2024, 6:32 PM

24 points

2 comments3 min readLW link

Updatelessness doesn’t solve most problems

Martín SotoFeb 8, 2024, 5:30 PM

135 points

45 comments12 min readLW link

Predicting Alignment Award Winners Using ChatGPT 4

Shoshannah TekofskyFeb 8, 2024, 2:38 PM

16 points

2 comments11 min readLW link

AI #50: The Most Dangerous Thing

ZviFeb 8, 2024, 2:30 PM

53 points

4 comments24 min readLW link

(thezvi.wordpress.com)

How to develop a photographic memory 3/3

PhilosophicalSoulFeb 8, 2024, 9:22 AM

6 points

2 comments18 min readLW link

Believing In

AnnaSalamonFeb 8, 2024, 7:06 AM

241 points

51 comments13 min readLW link

Measuring pre-peer-review epistemic status

Jakub SmékalFeb 8, 2024, 5:09 AM

1 point

0 comments2 min readLW link

A Chess-GPT Linear Emergent World Representation

Adam KarvonenFeb 8, 2024, 4:25 AM

105 points

14 comments7 min readLW link

(adamkarvonen.github.io)

Domestic Production vs International Wealth Creation

100YearPantsFeb 8, 2024, 4:25 AM

1 point

0 comments1 min readLW link

Conditional prediction markets are evidential, not causal

philhFeb 7, 2024, 9:52 PM

55 points

10 comments2 min readLW link

A Back-Of-The-Envelope Calculation On How Unlikely The Circumstantial Evidence Around Covid-19 Is

RokoFeb 7, 2024, 9:49 PM

−1 points

36 comments5 min readLW link

Nitric oxide for covid and other viral infections

ElizabethFeb 7, 2024, 9:30 PM

39 points

6 comments6 min readLW link

(acesounderglass.com)

Debating with More Persuasive LLMs Leads to More Truthful Answers

Akbir Khan, John Hughes, Dan Valentine, Sam Bowman and Ethan Perez

Feb 7, 2024, 9:28 PM

89 points

14 comments9 min readLW link

(arxiv.org)

[Question] Choosing a book on causality

martinkunevFeb 7, 2024, 9:16 PM

4 points

3 comments1 min readLW link

More Hyphenation

Arjun PanicksseryFeb 7, 2024, 7:43 PM

88 points

19 comments1 min readLW link

(arjunpanickssery.substack.com)

Reading writing advice doesn’t make writing easier

Henry SleightFeb 7, 2024, 7:14 PM

17 points

0 comments5 min readLW link

(open.substack.com)

[Question] What’s this 3rd secret directive of evolution called? (survive & spread & ___)

lemonhopeFeb 7, 2024, 2:11 PM

10 points

11 comments1 min readLW link

Training of superintelligence is secretly adversarial

quetzal_rainbowFeb 7, 2024, 1:38 PM

15 points

2 comments5 min readLW link

The Math of Suspicious Coincidences

RokoFeb 7, 2024, 1:32 PM

25 points

3 comments4 min readLW link

[Question] How to deal with the sense of demotivation that comes from thinking about determinism?

SpectrumDTFeb 7, 2024, 10:53 AM

13 points

71 comments1 min readLW link

Quantum Darwinism, social constructs, and the scientific method

pchvykovFeb 7, 2024, 7:04 AM

6 points

12 comments9 min readLW link

Why I think it’s net harmful to do technical safety research at AGI labs

RemmeltFeb 7, 2024, 4:17 AM

26 points

24 comments1 min readLW link

story-based decision-making

bhauthFeb 7, 2024, 2:35 AM

90 points

11 comments4 min readLW link

Full Driving Engagement Optional

jefftkFeb 7, 2024, 2:30 AM

14 points

0 comments1 min readLW link

(www.jefftk.com)

How to train your own “Sleeper Agents”

evhubFeb 7, 2024, 12:31 AM

92 points

11 comments2 min readLW link

My guess at Conjecture’s vision: triggering a narrative bifurcation

Alexandre VariengienFeb 6, 2024, 7:10 PM

75 points

12 comments16 min readLW link

Arrogance and People Pleasing

Jonathan MoregårdFeb 6, 2024, 6:43 PM

26 points

7 comments4 min readLW link

(honestliving.substack.com)

What does davidad want from «boundaries»?

Chipmonk and davidad

Feb 6, 2024, 5:45 PM

47 points

1 comment5 min readLW link

[Question] How can I efficiently read all the Dath Ilan worldbuilding?

mike_hawkeFeb 6, 2024, 4:52 PM

10 points

1 comment1 min readLW link

Preventing model exfiltration with upload limits

ryan_greenblattFeb 6, 2024, 4:29 PM

71 points

22 comments14 min readLW link

Evolution is an observation, not a process

Neil Feb 6, 2024, 2:49 PM

8 points

11 comments5 min readLW link

[Question] Why do we need an understanding of the real world to predict the next tokens in a body of text?

Valentin BaltadzhievFeb 6, 2024, 2:43 PM

2 points

12 comments1 min readLW link

On the Debate Between Jezos and Leahy

ZviFeb 6, 2024, 2:40 PM

64 points

6 comments63 min readLW link

(thezvi.wordpress.com)

Why Two Valid Answers Approach is not Enough for Sleeping Beauty

Ape in the coatFeb 6, 2024, 2:21 PM

6 points

12 comments6 min readLW link

Are most personality disorders really trust disorders?

chaosmageFeb 6, 2024, 12:37 PM

20 points

4 comments1 min readLW link

From Conceptual Spaces to Quantum Concepts: Formalising and Learning Structured Conceptual Models

Roman LeventovFeb 6, 2024, 10:18 AM

8 points

1 comment4 min readLW link

(arxiv.org)

Fluent dreaming for language models (AI interpretability method)

tbenthompson, mikes and Zygi Straznickas

Feb 6, 2024, 6:02 AM

46 points

5 comments1 min readLW link

(arxiv.org)

Selfish AI Inevitable

Davey Morse6 Feb 2024 4:29 UTC

1 point

0 comments1 min readLW link

Toy models of AI control for concentrated catastrophe prevention

Fabien Roger and Buck

6 Feb 2024 1:38 UTC

51 points

2 comments7 min readLW link

Things You’re Allowed to Do: University Edition

Saul Munn6 Feb 2024 0:36 UTC

97 points

13 comments5 min readLW link

(www.brasstacks.blog)

Value learning in the absence of ground truth

Joel_Saarinen5 Feb 2024 18:56 UTC

47 points

8 comments45 min readLW link

Implementing activation steering

Annah5 Feb 2024 17:51 UTC

75 points

8 comments7 min readLW link

AI alignment as a translation problem

Roman Leventov5 Feb 2024 14:14 UTC

22 points

2 comments3 min readLW link

Safe Stasis Fallacy

Davidmanheim5 Feb 2024 10:54 UTC

54 points

2 comments LW link

[Question] How has internalising a post-AGI world affected your current choices?

yanni kyriacos5 Feb 2024 5:43 UTC

10 points

8 comments1 min readLW link