[Question] How do I find all the items on LW that I’ve *fa­vor­ited* or up­voted?

Alex K. Chen (parrot)Aug 7, 2023, 11:51 PM
14 points
3 comments1 min readLW link

A plea for more fund­ing short­fall transparency

porbyAug 7, 2023, 9:33 PM
73 points
4 comments2 min readLW link

[Question] Tips for re­duc­ing think­ing branch­ing factor

Simon BerensAug 7, 2023, 8:21 PM
4 points
6 comments1 min readLW link

An in­ter­ac­tive in­tro­duc­tion to grokking and mechanis­tic interpretability

Aug 7, 2023, 7:09 PM
23 points
3 comments1 min readLW link
(pair.withgoogle.com)

Feed­back­loop-first Rationality

RaemonAug 7, 2023, 5:58 PM
205 points
69 comments8 min readLW link2 reviews

Grow­ing Bon­sai Net­works with RNNs

ameoAug 7, 2023, 5:34 PM
21 points
5 comments1 min readLW link
(cprimozic.net)

[Question] Should I test my­self for microplas­tics?

AugsAug 7, 2023, 5:31 PM
9 points
2 comments1 min readLW link

Op­ti­mi­sa­tion Mea­sures: Desider­ata, Im­pos­si­bil­ity, Proposals

Aug 7, 2023, 3:52 PM
36 points
9 comments1 min readLW link

An­nounc­ing the Clearer Think­ing micro-grants pro­gram for 2023

spencergAug 7, 2023, 3:21 PM
14 points
1 comment1 min readLW link
(www.clearerthinking.org)

What I’ve been read­ing, July–Au­gust 2023

jasoncrawfordAug 7, 2023, 2:22 PM
23 points
0 comments13 min readLW link
(rootsofprogress.org)

Monthly Roundup #9: Au­gust 2023

ZviAug 7, 2023, 1:20 PM
42 points
25 comments57 min readLW link
(thezvi.wordpress.com)

Strength­en­ing the Ar­gu­ment for In­trin­sic AI Safety: The S-Curves Per­spec­tive

avturchinAug 7, 2023, 1:13 PM
8 points
0 comments12 min readLW link

Overview of how AI might ex­ac­er­bate long-run­ning catas­trophic risks

Hauke HillebrandtAug 7, 2023, 11:53 AM
20 points
0 comments11 min readLW link
(aisafetyfundamentals.com)

Drinks at a bar

yakimoffAug 7, 2023, 2:52 AM
3 points
0 comments1 min readLW link

Prob­lems with Robin Han­son’s Quillette Ar­ti­cle On AI

DaemonicSigilAug 6, 2023, 10:13 PM
89 points
33 comments8 min readLW link

Yann LeCun on AGI and AI Safety

Chris_LeongAug 6, 2023, 9:56 PM
37 points
13 comments1 min readLW link
(drive.google.com)

Com­pu­ta­tional Thread Art

CallumMcDougallAug 6, 2023, 9:42 PM
76 points
2 comments6 min readLW link

‘We’re chang­ing the clouds.’ An un­fore­seen test of geo­eng­ineer­ing is fuel­ing record ocean warmth

AnnapurnaAug 6, 2023, 8:58 PM
60 points
6 comments1 min readLW link
(www.science.org)

[Linkpost] Will AI avoid ex­ploita­tion?

cdkgAug 6, 2023, 2:28 PM
22 points
1 comment1 min readLW link

Re­duc­ing the risk of catas­troph­i­cally mis­al­igned AI by avoid­ing the Sin­gle­ton sce­nario: the Many­ton Variant

GravitasGradientAug 6, 2023, 2:24 PM
−6 points
0 comments3 min readLW link

Re­boot­ing AI Gover­nance: An AI-Driven Ap­proach to AI Governance

utilonAug 6, 2023, 2:19 PM
1 point
1 comment29 min readLW link
(forum.effectivealtruism.org)

Model-Based Policy Anal­y­sis un­der Deep Uncertainty

utilonAug 6, 2023, 2:07 PM
16 points
1 comment23 min readLW link
(forum.effectivealtruism.org)

[Question] On be­ing in a bad place and too stub­born to leave.

TeaTieAndHatAug 6, 2023, 11:45 AM
12 points
14 comments3 min readLW link

Safety-First Agents/​Ar­chi­tec­tures Are a Promis­ing Path to Safe AGI

Brendon_WongAug 6, 2023, 8:02 AM
13 points
2 comments12 min readLW link

The Benev­olent Ruler’s Hand­book (Part 1): The Policy Problem

FCCCAug 6, 2023, 3:46 AM
11 points
3 comments4 min readLW link

Ex­plor­ing the Mul­ti­verse of Large Lan­guage Models

frankyAug 6, 2023, 2:38 AM
1 point
0 comments5 min readLW link

Align­ing my web server with de­vops prac­tices: part 2 (se­cu­rity)

VipulNaikAug 6, 2023, 1:30 AM
6 points
0 comments19 min readLW link

how 2 tell if ur in­put is out of dis­tri­bu­tion given only model weights

dkirmaniAug 5, 2023, 10:45 PM
48 points
10 comments1 min readLW link

Sum­mary of Im­prov­ing Global De­ci­sion Mak­ing (around AI)

Will_PearsonAug 5, 2023, 6:46 PM
−7 points
0 comments1 min readLW link

Ground-Truth La­bel Im­bal­ance Im­pairs the Perfor­mance of Con­trast-Con­sis­tent Search (and Other Con­trast-Pair-Based Un­su­per­vised Meth­ods)

Aug 5, 2023, 5:55 PM
6 points
2 comments7 min readLW link
(drive.google.com)

Seat­tle As­tral Codex Ten Monthly Social

a7xAug 5, 2023, 5:55 PM
1 point
0 comments1 min readLW link

AISafety.info’s Writ­ing & Edit­ing Hackathon

smallsiloAug 5, 2023, 5:14 PM
2 points
0 comments1 min readLW link

Join AISafety.info’s Writ­ing & Edit­ing Hackathon (Aug 25-28) (Prizes to be won!)

smallsiloAug 5, 2023, 2:08 PM
19 points
3 comments1 min readLW link
(forum.effectivealtruism.org)

Stomach Ulcers and Den­tal Cavities

MetacelsusAug 5, 2023, 2:08 PM
57 points
7 comments1 min readLW link
(denovo.substack.com)

video games > IQ tests

bhauthAug 5, 2023, 1:27 PM
35 points
46 comments3 min readLW link

[Linkpost] Ap­pli­ca­bil­ity of scal­ing laws to vi­sion en­cod­ing models

Bogdan Ionut CirsteaAug 5, 2023, 11:10 AM
11 points
2 comments1 min readLW link

A Naive Pro­posal for Con­struct­ing In­ter­pretable AI

Chris_LeongAug 5, 2023, 10:32 AM
18 points
6 comments2 min readLW link

ACX Paris Meetup—Au­gust 11 2023

PoignardAzurAug 5, 2023, 9:44 AM
2 points
0 comments1 min readLW link

Meet Hype­r­ion on Sun­day Aug 6?

duck_masterAug 5, 2023, 4:36 AM
1 point
0 comments1 min readLW link

[Question] What are the best pub­lished pa­pers from out­side the al­ign­ment com­mu­nity that are rele­vant to Agent Foun­da­tions?

Stephen FowlerAug 5, 2023, 3:02 AM
20 points
4 comments1 min readLW link

An­nounc­ing Squig­gle Hub

Aug 5, 2023, 1:00 AM
49 points
4 comments5 min readLW link
(forum.effectivealtruism.org)

Read More Books but Pre­tend to Read Even More

Arjun PanicksseryAug 5, 2023, 12:07 AM
26 points
12 comments4 min readLW link
(arjunpanickssery.substack.com)

The Sinews of Su­dan’s Lat­est War

Tim LiptrotAug 4, 2023, 6:17 PM
43 points
12 comments12 min readLW link

Pri­vate notes on LW?

RaemonAug 4, 2023, 5:35 PM
61 points
33 comments1 min readLW link

When train­ing AI, we should es­ca­late the fre­quency of ca­pa­bil­ity tests

Hauke HillebrandtAug 4, 2023, 4:07 PM
2 points
0 comments1 min readLW link

Man­i­fund: What we’re fund­ing (weeks 2-4)

Austin ChenAug 4, 2023, 4:00 PM
44 points
2 commentsLW link
(manifund.substack.com)

[Linkpost] Mul­ti­modal Neu­rons in Pre­trained Text-Only Transformers

Bogdan Ionut CirsteaAug 4, 2023, 3:29 PM
11 points
0 comments1 min readLW link

Apollo Re­search is hiring evals and in­ter­pretabil­ity en­g­ineers & scientists

Marius HobbhahnAug 4, 2023, 10:54 AM
25 points
0 comments2 min readLW link

[Question] Has any­one tried cre­at­ing a YouTube or TikTok se­ries cov­er­ing the se­quences?

Max RossiAug 4, 2023, 12:10 AM
4 points
4 comments1 min readLW link

[Question] Is there any met­ric mea­sur­ing ~”pro­por­tion of peo­ple cre­at­ing ex­tra value”?

Amal Aug 3, 2023, 10:54 PM
7 points
3 comments1 min readLW link