All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025 2026

All Jan Feb Mar Apr May Jun Jul Aug SepOctNov Dec

All 1 2 3 4 5 6 7 8 9 101112 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

[Question] Examples of Low Status Fun

niplav10 Oct 2023 23:19 UTC

18 points

17 comments1 min readLW link

A New Model for Compute Center Verification

Damin Curtis10 Oct 2023 19:22 UTC

8 points

0 comments5 min readLW link

Announcing MIRI’s new CEO and leadership team

Gretta Duleba10 Oct 2023 19:22 UTC

222 points

52 comments3 min readLW link

18 Heterodox lenses to look the world through

Shaurya Gupta10 Oct 2023 18:33 UTC

−1 points

2 comments5 min readLW link

Documenting Journey Into AI Safety

jacobhaimes10 Oct 2023 18:30 UTC

17 points

4 comments6 min readLW link

Looking for AI Art Collaborators!

elte10 Oct 2023 18:24 UTC

1 point

0 comments1 min readLW link

Childhood Roundup #3

Zvi10 Oct 2023 14:30 UTC

49 points

3 comments30 min readLW link

(thezvi.wordpress.com)

My simple model for Alignment vs Capability

ryan_b10 Oct 2023 12:07 UTC

7 points

0 comments7 min readLW link

Next year in Jerusalem: The brilliant ideas and radiant legacy of Miriam Lipschutz Yevick [in relation to current AI debates]

Bill Benzon10 Oct 2023 9:06 UTC

1 point

0 comments1 min readLW link

(3quarksdaily.com)

Become a PIBBSS Research Affiliate

Nora_Ammann and DusanDNesic

10 Oct 2023 7:41 UTC

24 points

6 comments6 min readLW link

My 1st month at a “neurodivergent gifted school” called Minerva University

exanova10 Oct 2023 3:34 UTC

4 points

1 comment1 min readLW link

(inawe.substack.com)

Epistemic Motif of Abstract-Concrete Cycles & Domain Expansion

Dalcy10 Oct 2023 3:28 UTC

26 points

2 comments3 min readLW link

Simple Terminal Colors

jefftk10 Oct 2023 0:40 UTC

11 points

1 comment1 min readLW link

(www.jefftk.com)

The Handbook of Rationality (2021, MIT press) is now open access

romeostevensit10 Oct 2023 0:30 UTC

48 points

4 comments1 min readLW link

Non-superintelligent paperclip maximizers are normal

jessicata10 Oct 2023 0:29 UTC

71 points

4 comments9 min readLW link

(unstableontology.com)

The Witching Hour

Richard_Ngo10 Oct 2023 0:19 UTC

116 points

1 comment9 min readLW link

(www.narrativeark.xyz)

One: a story

Richard_Ngo10 Oct 2023 0:18 UTC

36 points

0 comments4 min readLW link

(www.narrativeark.xyz)

Truthseeking when your disagreements lie in moral philosophy

Elizabeth and T_W

10 Oct 2023 0:00 UTC

100 points

4 comments4 min readLW link

(acesounderglass.com)

NYT on the Manifest forecasting conference

Austin Chen9 Oct 2023 21:40 UTC

45 points

14 comments2 min readLW link

(www.nytimes.com)

Forecasting and prediction markets

CarlJ9 Oct 2023 20:43 UTC

3 points

0 comments1 min readLW link

Comparing Two Forecasters in an Ideal World

nikos9 Oct 2023 19:52 UTC

5 points

0 comments6 min readLW link

The case for aftermarket blind spot mirrors

Brendan Long9 Oct 2023 19:30 UTC

60 points

14 comments2 min readLW link

(www.brendanlong.com)

New contractor role: Web security task force contractor for AI safety announcements

Ethan Ashkie and Andrew_Critch

9 Oct 2023 18:36 UTC

11 points

0 comments2 min readLW link

(survivalandflourishing.com)

[Question] Anyone working on D. Amodei’s Bartlett show transcript?

Leopard9 Oct 2023 18:17 UTC

10 points

0 comments1 min readLW link

Knowledge Base 3: Shopping advisor and other uses of knowledge base about products

iwis9 Oct 2023 11:53 UTC

0 points

0 comments4 min readLW link

Knowledge Base 2: The structure and the method of building

iwis9 Oct 2023 11:53 UTC

2 points

4 comments7 min readLW link

We don’t understand what happened with culture enough

Jan_Kulveit9 Oct 2023 9:54 UTC

88 points

22 comments6 min readLW link 1 review

Leveraging Bayes’ Theorem to Supercharge Memory Techniques

disoha9 Oct 2023 3:34 UTC

−15 points

1 comment4 min readLW link

Paper: Identifying the Risks of LM Agents with an LM-Emulated Sandbox—University of Toronto 2023 - Benchmark consisting of 36 high-stakes tools and 144 test cases!

Singularian25019 Oct 2023 0:00 UTC

6 points

0 comments1 min readLW link

AI Alignment Breakthroughs this week (10/08/23)

Logan Zoellner8 Oct 2023 23:30 UTC

32 points

14 comments6 min readLW link

“The Heart of Gaming is the Power Fantasy”, and Cohabitive Games

Raemon8 Oct 2023 21:02 UTC

81 points

50 comments4 min readLW link

(bottomfeeder.substack.com)

FAQ: What the heck is goal agnosticism?

porby8 Oct 2023 19:11 UTC

66 points

38 comments28 min readLW link

Time is homogeneous sequentially-composable determination

TsviBT8 Oct 2023 14:58 UTC

15 points

0 comments21 min readLW link

Linkpost: Are Emergent Abilities in Large Language Models just In-Context Learning?

Erich_Grunewald8 Oct 2023 12:14 UTC

12 points

7 comments2 min readLW link

(arxiv.org)

Bird-eye view visualization of LLM activations

Sergii8 Oct 2023 12:12 UTC

11 points

2 comments1 min readLW link

(grgv.xyz)

Perspective Based Reasoning Could Absolve CDT

dadadarren8 Oct 2023 11:22 UTC

4 points

5 comments5 min readLW link

The Gradient – The Artificiality of Alignment

mic8 Oct 2023 4:06 UTC

12 points

1 comment5 min readLW link

(thegradient.pub)

Comparing Anthropic’s Dictionary Learning to Ours

Robert_AIZI7 Oct 2023 23:30 UTC

137 points

8 comments4 min readLW link

A thought about the constraints of debtlessness in online communities

mako yass7 Oct 2023 21:26 UTC

60 points

23 comments1 min readLW link

Arguments for utilitarianism are impossibility arguments under unbounded prospects

MichaelStJules7 Oct 2023 21:08 UTC

7 points

7 comments21 min readLW link

Sam Altman’s sister claims Sam sexually abused her—Part 1: Introduction, outline, author’s notes

pythagoras50157 Oct 2023 21:06 UTC

96 points

108 comments8 min readLW link

Griffin Island

jefftk7 Oct 2023 18:40 UTC

14 points

3 comments1 min readLW link

(www.jefftk.com)

Every Mention of EA in “Going Infinite”

KirstenH7 Oct 2023 14:42 UTC

48 points

0 comments8 min readLW link

(open.substack.com)

Fixing Insider Threats in the AI Supply Chain

Madhav Malhotra7 Oct 2023 13:19 UTC

20 points

2 comments5 min readLW link

Contra Nora Belrose on Orthogonality Thesis Being Trivial

tailcalled7 Oct 2023 11:47 UTC

18 points

21 comments1 min readLW link

Related Discussion from Thomas Kwa’s MIRI Research Experience

Raemon7 Oct 2023 6:25 UTC

72 points

140 comments1 min readLW link

[Question] Current State of Probabilistic Logic

Alexander Heckett7 Oct 2023 5:06 UTC

3 points

2 comments1 min readLW link

On the Relationship Between Variability and the Evolutionary Outcomes of Systems in Nature

Artyom Shaposhnikov7 Oct 2023 3:06 UTC

2 points

0 comments1 min readLW link

Announcing Dialogues

Ben Pace7 Oct 2023 2:57 UTC

160 points

60 comments4 min readLW link

Don’t Dismiss Simple Alignment Approaches

Chris_Leong7 Oct 2023 0:35 UTC

139 points

9 comments4 min readLW link