[Question] What ML gears do you like?

Ulisse MiniNov 11, 2023, 7:10 PM
25 points
4 comments1 min readLW link

Smart Ses­sions—Fi­nally a (kinda) win­dow-cen­tric ses­sion manager

Eli TyreNov 11, 2023, 6:54 PM
14 points
3 comments5 min readLW link

AISC pro­ject: Satis­fIA – AI that satis­fies with­out over­do­ing it

Jobst HeitzigNov 11, 2023, 6:22 PM
12 points
0 comments1 min readLW link
(docs.google.com)

Con­trol Sym­me­try: why we might want to start in­ves­ti­gat­ing asym­met­ric al­ign­ment interventions

domenicrosatiNov 11, 2023, 5:27 PM
25 points
1 comment2 min readLW link

Game The­ory with­out Argmax [Part 2]

Cleo NardoNov 11, 2023, 4:02 PM
31 points
14 comments13 min readLW link

Game The­ory with­out Argmax [Part 1]

Cleo NardoNov 11, 2023, 3:59 PM
70 points
18 comments19 min readLW link

It’s OK to be bi­ased to­wards humans

dr_sNov 11, 2023, 11:59 AM
54 points
69 comments6 min readLW link

The Top AI Safety Bets for 2023: GiveWiki’s Lat­est Recommendations

Dawn DrescherNov 11, 2023, 9:04 AM
3 points
2 commentsLW link

Ar­tifi­cial Gen­eral Horsiness

robotelvisNov 11, 2023, 5:15 AM
4 points
0 comments5 min readLW link
(messyprogress.substack.com)

Pal­isade is hiring Re­search Engineers

Nov 11, 2023, 3:09 AM
23 points
0 comments3 min readLW link

Open Phil re­leases RFPs on LLM Bench­marks and Forecasting

LawrenceCNov 11, 2023, 3:01 AM
53 points
0 comments2 min readLW link
(www.openphilanthropy.org)

Memo on some ne­glected topics

Lukas FinnvedenNov 11, 2023, 2:01 AM
28 points
2 commentsLW link
(open.substack.com)

Who is Sam Bankman-Fried (SBF) re­ally, and how could he have done what he did? - three the­o­ries and a lot of evidence

spencergNov 11, 2023, 1:04 AM
36 points
28 commentsLW link
(www.spencergreenberg.com)

Sur­vey on the ac­cel­er­a­tion risks of our new RFPs to study LLM capabilities

Ajeya CotraNov 10, 2023, 11:59 PM
27 points
1 commentLW link

Rat Fest 2024

LoganChipkinNov 10, 2023, 11:25 PM
7 points
6 comments1 min readLW link

How I Think, Part Three: Weigh­ing Cryonics

Richard HenageNov 10, 2023, 10:21 PM
4 points
1 comment2 min readLW link

Lin­ear en­cod­ing of char­ac­ter-level in­for­ma­tion in GPT-J to­ken embeddings

Nov 10, 2023, 10:19 PM
34 points
4 comments28 min readLW link

Fol­low-up sur­vey: inositol

ElizabethNov 10, 2023, 7:30 PM
13 points
1 comment1 min readLW link
(acesounderglass.com)

We have promis­ing al­ign­ment plans with low taxes

Seth HerdNov 10, 2023, 6:51 PM
44 points
9 comments5 min readLW link

[Question] Vec­tor search on a large dataset?

camsdixonNov 10, 2023, 6:43 PM
−1 points
2 comments1 min readLW link

About Me

Abe DillonNov 10, 2023, 6:32 PM
3 points
0 comments1 min readLW link

Me­tac­u­lus In­tro­duces AI-Pow­ered Com­mu­nity In­sights to Re­veal Fac­tors Driv­ing User Forecasts

ChristianWilliamsNov 10, 2023, 5:57 PM
6 points
0 commentsLW link
(www.metaculus.com)

Joy in the Here and Real

ScrewtapeNov 10, 2023, 5:22 PM
18 points
0 comments2 min readLW link

Arte­facts gen­er­ated by mode col­lapse in GPT-4 Turbo serve as ad­ver­sar­ial at­tacks.

Sohaib ImranNov 10, 2023, 3:23 PM
11 points
0 comments2 min readLW link

Wastew­a­ter RNA Read Lengths

jefftkNov 10, 2023, 3:20 PM
13 points
0 comments4 min readLW link
(www.jefftk.com)

Up­date on the UK AI Sum­mit and the UK’s Plans

Elliot MckernonNov 10, 2023, 2:47 PM
11 points
0 comments8 min readLW link

Liv Bo­eree Ted Talk Moloch & AI

Neil Nov 10, 2023, 2:04 PM
10 points
2 comments1 min readLW link
(m.youtube.com)

Pick­ing Men­tors For Re­search Programmes

Raymond DouglasNov 10, 2023, 1:01 PM
105 points
8 comments4 min readLW link

GPT-2030 and Catas­trophic Drives: Four Vignettes

jsteinhardtNov 10, 2023, 7:30 AM
50 points
5 comments10 min readLW link
(bounded-regret.ghost.io)

Crock, Crocker, Crockiest

ScrewtapeNov 10, 2023, 6:14 AM
21 points
4 comments6 min readLW link

AI Timelines

Nov 10, 2023, 5:28 AM
300 points
135 comments51 min readLW link2 reviews

ACI#6: A Non-Dual­is­tic ACI Model

Akira PyinyaNov 9, 2023, 11:01 PM
10 points
2 comments6 min readLW link

How I got so ex­cited about HowTruthful

Bruce LewisNov 9, 2023, 6:49 PM
17 points
3 comments5 min readLW link

The case for “Gen­er­ous Tit for Tat” as the ul­ti­mate game the­ory strategy

positivesumNov 9, 2023, 6:41 PM
2 points
3 comments8 min readLW link
(tryingtruly.substack.com)

Text Posts from the Kids Group: 2021

jefftkNov 9, 2023, 5:50 PM
38 points
1 comment8 min readLW link
(www.jefftk.com)

AI #37: Mov­ing Too Fast

ZviNov 9, 2023, 5:50 PM
53 points
5 comments76 min readLW link
(thezvi.wordpress.com)

Learn­ing-the­o­retic agenda read­ing list

Vanessa KosoyNov 9, 2023, 5:25 PM
103 points
1 comment2 min readLW link1 review

​​ Open-ended/​Phenom­e­nal ​Ethics ​(TLDR)

Ryo Nov 9, 2023, 4:58 PM
3 points
0 comments1 min readLW link

Poly­se­man­tic At­ten­tion Head in a 4-Layer Transformer

Nov 9, 2023, 4:16 PM
51 points
0 comments6 min readLW link

On OpenAI Dev Day

ZviNov 9, 2023, 4:10 PM
60 points
0 comments15 min readLW link
(thezvi.wordpress.com)

An­trop­i­cal Prob­a­bil­ities Are Fully Ex­plained by Differ­ence in Pos­si­ble Outcomes

Ape in the coatNov 9, 2023, 3:34 PM
19 points
7 comments5 min readLW link

A free to en­ter, 240 char­ac­ter, open-source iter­ated pris­oner’s dilemma tournament

Isaac KingNov 9, 2023, 8:24 AM
64 points
19 comments1 min readLW link
(manifold.markets)

Into AI Safety Epi­sodes 1 & 2

jacobhaimesNov 9, 2023, 4:36 AM
2 points
0 comments1 min readLW link
(into-ai-safety.github.io)

Mak­ing Bad De­ci­sions On Purpose

ScrewtapeNov 9, 2023, 3:36 AM
49 points
8 comments5 min readLW link

Me­tac­u­lus’s New Side­bar Helps You Find Fore­casts Faster

ChristianWilliamsNov 8, 2023, 8:56 PM
15 points
0 commentsLW link
(www.metaculus.com)

Open-ended ethics of phe­nom­ena (a desider­ata with uni­ver­sal moral­ity)

Ryo Nov 8, 2023, 8:10 PM
1 point
0 comments8 min readLW link

Open Agency model can solve the AI reg­u­la­tion dilemma

Roman LeventovNov 8, 2023, 8:00 PM
22 points
1 comment2 min readLW link

Gothen­burg LW /​ ACX meetup

StefanNov 8, 2023, 7:52 PM
1 point
0 comments1 min readLW link

[Question] Why is less­wrong block­ing wget and curl (scrape)?

nick lacombeNov 8, 2023, 7:42 PM
21 points
15 comments1 min readLW link

[Question] Is there a less­wrong archive of all pub­lic posts?

nick lacombeNov 8, 2023, 7:26 PM
12 points
7 comments1 min readLW link