All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025 2026

All Jan Feb Mar Apr May Jun JulAugSep Oct Nov Dec

All 1 2 3 4 567 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

how 2 tell if ur input is out of distribution given only model weights

dkirmani5 Aug 2023 22:45 UTC

49 points

10 comments1 min readLW link

Summary of Improving Global Decision Making (around AI)

Will_Pearson5 Aug 2023 18:46 UTC

−7 points

0 comments1 min readLW link

Ground-Truth Label Imbalance Impairs the Performance of Contrast-Consistent Search (and Other Contrast-Pair-Based Unsupervised Methods)

Tom Angsten and Ami Hays

5 Aug 2023 17:55 UTC

6 points

2 comments7 min readLW link

(drive.google.com)

Seattle Astral Codex Ten Monthly Social

a7x5 Aug 2023 17:55 UTC

1 point

0 comments1 min readLW link

AISafety.info’s Writing & Editing Hackathon

smallsilo5 Aug 2023 17:14 UTC

2 points

0 comments1 min readLW link

Join AISafety.info’s Writing & Editing Hackathon (Aug 25-28) (Prizes to be won!)

smallsilo5 Aug 2023 14:08 UTC

19 points

3 comments1 min readLW link

(forum.effectivealtruism.org)

Stomach Ulcers and Dental Cavities

Metacelsus5 Aug 2023 14:08 UTC

57 points

7 comments1 min readLW link

(denovo.substack.com)

video games > IQ tests

bhauth5 Aug 2023 13:27 UTC

34 points

46 comments3 min readLW link

[Linkpost] Applicability of scaling laws to vision encoding models

Bogdan Ionut Cirstea5 Aug 2023 11:10 UTC

11 points

2 comments1 min readLW link

A Naive Proposal for Constructing Interpretable AI

Chris_Leong5 Aug 2023 10:32 UTC

18 points

6 comments2 min readLW link

ACX Paris Meetup—August 11 2023

PoignardAzur5 Aug 2023 9:44 UTC

2 points

0 comments1 min readLW link

Meet Hyperion on Sunday Aug 6?

duck_master5 Aug 2023 4:36 UTC

1 point

0 comments1 min readLW link

[Question] What are the best published papers from outside the alignment community that are relevant to Agent Foundations?

Stephen Fowler5 Aug 2023 3:02 UTC

20 points

4 comments1 min readLW link

Announcing Squiggle Hub

ozziegooen and Slava Matyukhin

5 Aug 2023 1:00 UTC

49 points

4 comments5 min readLW link

(forum.effectivealtruism.org)

Read More Books but Pretend to Read Even More

Arjun Panickssery5 Aug 2023 0:07 UTC

26 points

12 comments4 min readLW link

(arjunpanickssery.substack.com)

The Sinews of Sudan’s Latest War

Tim Liptrot4 Aug 2023 18:17 UTC

43 points

12 comments12 min readLW link

Private notes on LW?

Raemon4 Aug 2023 17:35 UTC

61 points

33 comments1 min readLW link

When training AI, we should escalate the frequency of capability tests

Hauke Hillebrandt4 Aug 2023 16:07 UTC

2 points

0 comments1 min readLW link

Manifund: What we’re funding (weeks 2-4)

Austin Chen4 Aug 2023 16:00 UTC

44 points

2 comments5 min readLW link

(manifund.substack.com)

[Linkpost] Multimodal Neurons in Pretrained Text-Only Transformers

Bogdan Ionut Cirstea4 Aug 2023 15:29 UTC

11 points

0 comments1 min readLW link

Apollo Research is hiring evals and interpretability engineers & scientists

Marius Hobbhahn4 Aug 2023 10:54 UTC

25 points

0 comments2 min readLW link

[Question] Has anyone tried creating a YouTube or TikTok series covering the sequences?

Max Rossi4 Aug 2023 0:10 UTC

4 points

4 comments1 min readLW link

[Question] Is there any metric measuring ~”proportion of people creating extra value”?

Amal 3 Aug 2023 22:54 UTC

7 points

3 comments1 min readLW link

[Question] Hypothetical: what would you do?

JNS3 Aug 2023 22:39 UTC

4 points

2 comments1 min readLW link

[Linkpost] Deception Abilities Emerged in Large Language Models

Bogdan Ionut Cirstea3 Aug 2023 17:28 UTC

12 points

0 comments1 min readLW link

Embedding Ethical Priors into AI Systems: A Bayesian Approach

Justausername3 Aug 2023 15:31 UTC

−5 points

3 comments21 min readLW link

Password-locked models: a stress case for capabilities evaluation

Fabien Roger3 Aug 2023 14:53 UTC

156 points

14 comments6 min readLW link

AI #23: Fundamental Problems with RLHF

Zvi3 Aug 2023 12:50 UTC

59 points

9 comments41 min readLW link

(thezvi.wordpress.com)

Bad Imitation Instruments

jefftk3 Aug 2023 2:30 UTC

21 points

1 comment1 min readLW link

(www.jefftk.com)

Kolmogorov’s theory of Algorithmic Probability

Aidan Rocke3 Aug 2023 0:58 UTC

6 points

2 comments2 min readLW link

(keplerlounge.com)

Work culture creep

CrimsonChin3 Aug 2023 0:38 UTC

34 points

16 comments8 min readLW link

[Question] Boxing

Zach Stein-Perlman2 Aug 2023 23:38 UTC

6 points

1 comment1 min readLW link

External rationality vs. internal rationality

metachirality2 Aug 2023 23:29 UTC

7 points

0 comments1 min readLW link

When performing a dimensionality reduction on tensors, the trace is often zero.

Joseph Van Name2 Aug 2023 21:06 UTC

7 points

1 comment3 min readLW link

Progress links digest, 2023-08-02: Superconductor edition

jasoncrawford2 Aug 2023 20:27 UTC

13 points

0 comments3 min readLW link

(rootsofprogress.org)

[Question] What works for ADHD and/or related things?

TeaTieAndHat2 Aug 2023 18:37 UTC

9 points

13 comments1 min readLW link

[Question] Would you pay for a search engine limited to rationalist sites?

Conor2 Aug 2023 18:06 UTC

4 points

19 comments1 min readLW link

The Roots of Progress Blog-Building Intensive: advice for applicants, request for support

jasoncrawford2 Aug 2023 15:37 UTC

9 points

0 comments1 min readLW link

(rootsofprogress.org)

3 levels of threat obfuscation

HoldenKarnofsky2 Aug 2023 14:58 UTC

71 points

14 comments7 min readLW link

ChatGPT for translation

Varshul Gupta2 Aug 2023 11:57 UTC

1 point

0 comments3 min readLW link

(dubverseblack.substack.com)

Long-Term Future Fund: April 2023 grant recommendations

abergal, calebp99, Linch, habryka, Thomas Larsen and Vaniver

2 Aug 2023 7:54 UTC

81 points

3 comments50 min readLW link

[Question] Could we breed/engineer intelligent parrots?

lemonhope2 Aug 2023 7:32 UTC

9 points

18 comments1 min readLW link

Anthropical Motte and Bailey in two versions of Sleeping Beauty

Ape in the coat2 Aug 2023 7:08 UTC

32 points

57 comments6 min readLW link

solar-thermal and techno-economic analysis

bhauth2 Aug 2023 6:22 UTC

21 points

8 comments5 min readLW link

(www.bhauth.com)

South Bay ACX/SSC Meetup @ Whole Foods

allisona2 Aug 2023 3:44 UTC

1 point

0 comments1 min readLW link

“Is There Anything That’s Worth More”

Zack_M_Davis2 Aug 2023 3:28 UTC

64 points

6 comments1 min readLW link

Bay Winter Solstice: call for speech pitches!

tcheasdfjkl2 Aug 2023 3:24 UTC

9 points

0 comments1 min readLW link

(docs.google.com)

[Question] What is ontology?

Adam Zerner2 Aug 2023 0:54 UTC

28 points

19 comments1 min readLW link

My current LK99 questions

Eliezer Yudkowsky1 Aug 2023 22:48 UTC

211 points

38 comments5 min readLW link

Spiral Staircase

Michael Samoilov1 Aug 2023 21:51 UTC

21 points

2 comments2 min readLW link