[Question] Train­ing a RL Model with Con­tin­u­ous State & Ac­tion Space in a Real-World Scenario

Alexander RiesOct 15, 2023, 10:59 PM
0 points
0 comments1 min readLW link

On Fre­quen­tism and Bayesian Dogma

Oct 15, 2023, 10:23 PM
59 points
27 comments6 min readLW link

More or Fewer Fights over Prin­ci­ples and Values?

Oct 15, 2023, 9:35 PM
24 points
10 comments14 min readLW link

Map­ping ChatGPT’s on­tolog­i­cal land­scape, gra­di­ents and choices [in­ter­pretabil­ity]

Bill BenzonOct 15, 2023, 8:12 PM
1 point
0 comments18 min readLW link

The Hid­den Per­ils of Hydrogen

Yudhister KumarOct 15, 2023, 7:51 PM
17 points
3 comments3 min readLW link
(ykumar.org)

Ar­gu­ments for op­ti­mism on AI Align­ment (I don’t en­dorse this ver­sion, will re­u­pload a new ver­sion soon.)

Noosphere89Oct 15, 2023, 2:51 PM
28 points
129 comments25 min readLW link

Hyper­re­als in a Nutshell

Yudhister KumarOct 15, 2023, 2:23 PM
35 points
27 comments5 min readLW link
(ykumar.org)

Dis­cov­er­ing La­tent Knowl­edge in the Hu­man Brain: Part 1 – Clar­ify­ing the con­cepts of be­lief and knowledge

Joseph EmersonOct 15, 2023, 9:02 AM
5 points
0 comments12 min readLW link

[Question] Ra­tion­al­ist hor­ror movies

ElizabethOct 15, 2023, 7:42 AM
46 points
35 comments1 min readLW link

Unity Gridworlds

WillPetilloOct 15, 2023, 4:36 AM
9 points
0 comments1 min readLW link

In mem­ory of Louise Glück

Joe CarlsmithOct 15, 2023, 2:59 AM
41 points
1 comment8 min readLW link

Book Re­view: In­visi­ble China

Yudhister KumarOct 14, 2023, 9:51 PM
4 points
0 comments4 min readLW link
(ykumar.org)

Book Re­view: Rad­i­cal Markets

Yudhister KumarOct 14, 2023, 9:41 PM
12 points
0 comments15 min readLW link
(ykumar.org)

[Question] One-on-one tu­tor­ing for any subject

yakimoffOct 14, 2023, 8:58 PM
8 points
5 comments1 min readLW link

The Pu­ri­tans would one-box: ev­i­den­tial de­ci­sion the­ory in the 17th century

Jacob G-WOct 14, 2023, 8:23 PM
86 points
5 comments3 min readLW link
(jacobgw.com)

Nat­u­ral Ab­strac­tion: Con­ver­gent Prefer­ences Over In­for­ma­tion Structures

paulomOct 14, 2023, 6:34 PM
28 points
1 comment36 min readLW link

ChatGPT tells 20 ver­sions of its pro­to­typ­i­cal story, with a short note on method

Bill BenzonOct 14, 2023, 3:27 PM
7 points
0 comments5 min readLW link

Will no one rid me of this tur­bu­lent pest?

MetacelsusOct 14, 2023, 3:27 PM
154 points
23 comments10 min readLW link
(denovo.substack.com)

Which Anaes­thetic To Choose?

dadadarrenOct 14, 2023, 2:55 PM
10 points
15 comments1 min readLW link

Is the Wave non-dis­par­age­ment thingy okay?

Oct 14, 2023, 5:31 AM
29 points
13 comments11 min readLW link

The Gods of Straight Lines

Richard_NgoOct 14, 2023, 4:10 AM
69 points
13 comments5 min readLW link
(www.narrativeark.xyz)

Eight Magic Lamps

Richard_NgoOct 14, 2023, 4:10 AM
41 points
0 comments6 min readLW link
(www.narrativeark.xyz)

RSPs are pauses done right

evhubOct 14, 2023, 4:06 AM
164 points
73 comments7 min readLW link1 review

Dishon­or­able Gos­sip and Go­ing Crazy

Oct 14, 2023, 4:00 AM
29 points
31 comments23 min readLW link

Disen­tan­gling Our Ter­mi­nal and In­stru­men­tal Values

PeterMcCluskeyOct 14, 2023, 3:35 AM
11 points
1 comment4 min readLW link
(bayesianinvestor.com)

Global Pause AI Protest 10/​21

Oct 14, 2023, 3:20 AM
5 points
0 comments1 min readLW link

[Question] Liter­a­ture On Ex­is­ten­tial Risk From At­mo­spheric Con­tam­i­na­tion?

YitzOct 13, 2023, 10:27 PM
6 points
3 comments1 min readLW link

How to par­ti­tion teams to move fast? De­bat­ing “low-di­men­sional cuts”

Oct 13, 2023, 9:43 PM
41 points
2 comments11 min readLW link

Gothen­burg LW /​ ACX meetup

StefanOct 13, 2023, 9:39 PM
2 points
0 comments1 min readLW link

Meta-Regulations

SableOct 13, 2023, 9:23 PM
18 points
5 comments10 min readLW link
(affablyevil.substack.com)

Hiring: Lighthaven Events & Venue Lead

RaemonOct 13, 2023, 9:02 PM
69 points
3 comments4 min readLW link

Pre­dic­tion mar­kets cov­ered in the NYT pod­cast “Hard Fork”

Austin ChenOct 13, 2023, 6:43 PM
56 points
6 comments9 min readLW link
(www.nytimes.com)

[Paper] All’s Fair In Love And Love: Copy Sup­pres­sion in GPT-2 Small

Oct 13, 2023, 6:32 PM
82 points
4 comments8 min readLW link

FLI pod­cast se­ries, “Imag­ine A World”, about as­pira­tional fu­tures with AGI

Jackson WagnerOct 13, 2023, 4:07 PM
9 points
0 comments4 min readLW link

To open-source or to not open-source, that is (an over­sim­plifi­ca­tion of) the ques­tion.

Justin BullockOct 13, 2023, 3:10 PM
12 points
5 comments5 min readLW link

Com­bi­na­tion Lock Boxes

jefftkOct 13, 2023, 12:50 PM
17 points
9 comments1 min readLW link
(www.jefftk.com)

Cir­cle of Sup­port (Oct 14th @ 10am PST)

AlexeiOct 13, 2023, 9:24 AM
19 points
1 comment1 min readLW link

[Question] How can the world han­dle the HAMAS situ­a­tion?

AnnapurnaOct 13, 2023, 9:15 AM
5 points
43 comments1 min readLW link

UVic AI Ethics Conference

Oct 13, 2023, 7:31 AM
3 points
1 comment1 min readLW link

LW UI fea­tures you might not have tried

ElizabethOct 13, 2023, 3:04 AM
49 points
6 comments1 min readLW link

Re­vis­it­ing Guide Dogs and Blind­ness Prevention

jefftkOct 13, 2023, 2:30 AM
22 points
0 comments2 min readLW link
(www.jefftk.com)

Paper: Un­der­stand­ing and Con­trol­ling a Maze-Solv­ing Policy Network

Oct 13, 2023, 1:38 AM
70 points
0 comments1 min readLW link
(arxiv.org)

OPTIC: An­nounc­ing In­ter­col­le­giate Fore­cast­ing Tour­na­ments in SF, DC, Boston

Oct 13, 2023, 1:36 AM
6 points
0 comments1 min readLW link

Progress links di­gest, 2023-10-12: Dyson sphere ther­mo­dy­nam­ics and a cure for cavities

jasoncrawfordOct 13, 2023, 12:41 AM
15 points
1 comment10 min readLW link
(rootsofprogress.org)

What do Marginal Grants at EAIF Look Like? Fund­ing Pri­ori­ties and Grant­mak­ing Thresh­olds at the EA In­fras­truc­ture Fund

LinchOct 12, 2023, 9:40 PM
20 points
0 comments5 min readLW link

unRLHF—Effi­ciently un­do­ing LLM safeguards

Oct 12, 2023, 7:58 PM
117 points
15 comments20 min readLW link

LoRA Fine-tun­ing Effi­ciently Un­does Safety Train­ing from Llama 2-Chat 70B

Oct 12, 2023, 7:58 PM
151 points
29 comments14 min readLW link

[Question] Look­ing for read­ing recom­men­da­tions: The­o­ries of right/​jus­tice that safe­guard against hav­ing one’s job au­to­mated?

bulKlubOct 12, 2023, 7:40 PM
−1 points
1 comment1 min readLW link

The In­ter­na­tional PauseAI Protest: Ac­tivism un­der uncertainty

Joseph MillerOct 12, 2023, 5:36 PM
32 points
1 comment4 min readLW link

AI #33: Cool New In­ter­pretabil­ity Paper

ZviOct 12, 2023, 4:20 PM
46 points
18 comments46 min readLW link
(thezvi.wordpress.com)