Search­ing for Search­ing for Search

Rubi J. Hudson14 Feb 2024 23:51 UTC
21 points
4 comments7 min readLW link

Some ques­tions for the peo­ple at 80,000 Hours

yanni kyriacos14 Feb 2024 23:15 UTC
1 point
0 comments1 min readLW link
(forum.effectivealtruism.org)

Dis­rupt­ing mal­i­cious uses of AI by state-af­fili­ated threat actors

agucova14 Feb 2024 21:28 UTC
11 points
2 comments1 min readLW link
(openai.com)

Cri­tiques of the AI con­trol agenda

Jozdien14 Feb 2024 19:25 UTC
47 points
14 comments9 min readLW link

Bad busi­ness advice

Logan Kieller14 Feb 2024 17:01 UTC
12 points
2 comments3 min readLW link
(logankieller.substack.com)

Ex­am­ples of gov­ern­ments do­ing good in house (or con­tracted) tech­ni­cal research

NathanBarnard14 Feb 2024 16:22 UTC
12 points
2 comments2 min readLW link

[Question] How can we legally/​ille­gally en­hance the progress of the law of ac­cel­er­at­ing re­turns in AI learn­ing?

Gabi QUENE14 Feb 2024 11:06 UTC
−25 points
0 comments1 min readLW link

[Question] What ex­per­i­ment set­tles the Gary Mar­cus vs Ge­offrey Hin­ton de­bate?

Valentin Baltadzhiev14 Feb 2024 9:06 UTC
12 points
8 comments1 min readLW link

[Question] Op­ti­miz­ing for Agency?

Michael Soareverix14 Feb 2024 8:31 UTC
8 points
4 comments2 min readLW link

Re­quire­ments for a Basin of At­trac­tion to Alignment

RogerDearnaley14 Feb 2024 7:10 UTC
21 points
6 comments31 min readLW link

FTX ex­pects to re­turn all cus­tomer money; claw­backs may go away

Mikhail Samin14 Feb 2024 3:43 UTC
33 points
1 comment1 min readLW link
(www.nytimes.com)

Scale Was All We Needed, At First

Gabe M14 Feb 2024 1:49 UTC
266 points
31 comments8 min readLW link
(aiacumen.substack.com)

CFAR Take­aways: An­drew Critch

Raemon14 Feb 2024 1:37 UTC
215 points
62 comments5 min readLW link

Meetup In a Box: Year In Review

Czynski14 Feb 2024 1:18 UTC
26 points
0 comments4 min readLW link

An EA used de­cep­tive mes­sag­ing to ad­vance their pro­ject; we need mechanisms to avoid de­on­tolog­i­cally du­bi­ous plans

Mikhail Samin13 Feb 2024 23:15 UTC
16 points
1 comment1 min readLW link

Use­ful start­ing code for interpretability

eggsyntax13 Feb 2024 23:13 UTC
19 points
2 comments1 min readLW link

Masterpiece

Richard_Ngo13 Feb 2024 23:10 UTC
147 points
20 comments4 min readLW link
(www.narrativeark.xyz)

A Bridge Between Utili­tar­i­anism & Stoicism

Jonathan Moregård13 Feb 2024 22:46 UTC
5 points
0 comments5 min readLW link
(honestliving.substack.com)

The “con­text win­dow” anal­ogy for hu­man minds

Ruby13 Feb 2024 19:29 UTC
26 points
0 comments2 min readLW link

More on the Ap­ple Vi­sion Pro

Zvi13 Feb 2024 17:40 UTC
33 points
5 comments8 min readLW link
(thezvi.wordpress.com)

Lin­ear White

Teja Prabhu13 Feb 2024 16:31 UTC
−3 points
3 comments3 min readLW link
(krez.expert)

Causal­ity is Everywhere

silentbob13 Feb 2024 13:44 UTC
25 points
12 comments8 min readLW link

Tech­nolo­gies and Ter­minol­ogy: AI isn’t Soft­ware, it’s… Deep­ware?

13 Feb 2024 13:37 UTC
40 points
9 comments8 min readLW link

Be­come a Bet­ter Sto­ry­tel­ler by...Pausing

Declan Molony13 Feb 2024 7:59 UTC
3 points
3 comments2 min readLW link

[Question] LessWrong Is Very Wrong: Ul­ti­mately All So­cial Me­dia Plat­forms Are The Same

Amritesh Kumar13 Feb 2024 6:53 UTC
−16 points
2 comments1 min readLW link

Lsusr’s Ra­tion­al­ity Dojo

lsusr13 Feb 2024 5:52 UTC
97 points
17 comments3 min readLW link

[Question] Where is the Town Square?

Gretta Duleba13 Feb 2024 3:53 UTC
43 points
8 comments1 min readLW link

My cover story in Ja­cobin on AI cap­i­tal­ism and the x-risk debates

garrison12 Feb 2024 23:34 UTC
91 points
5 comments1 min readLW link
(jacobin.com)

What is On­tol­ogy?

martinkunev12 Feb 2024 23:01 UTC
4 points
0 comments4 min readLW link

Thank you for trig­ger­ing me

Cissy12 Feb 2024 20:09 UTC
5 points
1 comment6 min readLW link
(www.moremyself.xyz)

In­ter­pret­ing Quan­tum Me­chan­ics in In­fra-Bayesian Physicalism

Yegreg12 Feb 2024 18:56 UTC
28 points
6 comments32 min readLW link

I played the AI box game as the Gate­keeper — and lost

datawitch12 Feb 2024 18:39 UTC
24 points
51 comments4 min readLW link

The Last Laugh: Ex­plor­ing the Role of Hu­mor as a Bench­mark for Large Lan­guage Models

Greg Robison12 Feb 2024 18:34 UTC
4 points
5 comments11 min readLW link

Nat­u­ral ab­strac­tions are ob­server-de­pen­dent: a con­ver­sa­tion with John Wentworth

Martín Soto12 Feb 2024 17:28 UTC
38 points
13 comments7 min readLW link

Tort Law Can Play an Im­por­tant Role in Miti­gat­ing AI Risk

Gabriel Weil12 Feb 2024 17:17 UTC
37 points
9 comments5 min readLW link

On the Pro­posed Cal­ifor­nia SB 1047

Zvi12 Feb 2024 16:40 UTC
45 points
17 comments12 min readLW link
(thezvi.wordpress.com)

Thoughts on “The Offense-Defense Balance Rarely Changes”

Cullen12 Feb 2024 3:26 UTC
46 points
4 comments1 min readLW link

Skep­ti­cism About Deep­Mind’s “Grand­mas­ter-Level” Chess Without Search

Arjun Panickssery12 Feb 2024 0:56 UTC
53 points
13 comments3 min readLW link

[Question] What are the known difficul­ties with this al­ign­ment ap­proach?

tailcalled11 Feb 2024 22:52 UTC
18 points
24 comments1 min readLW link

[Question] What are the de­cid­ing fac­tors of hu­man cog­ni­tive en­durance?

koratkar11 Feb 2024 21:56 UTC
22 points
3 comments1 min readLW link

Carl Shul­man On Dwarkesh Pod­cast June 2023

Moonicker11 Feb 2024 21:02 UTC
12 points
0 comments159 min readLW link

How do you ac­tu­ally ob­tain and re­port a like­li­hood func­tion for sci­en­tific re­search?

Peter Berggren11 Feb 2024 17:42 UTC
55 points
4 comments1 min readLW link

The en­tropy maxim for bi­nary questions

dkl911 Feb 2024 17:17 UTC
2 points
1 comment1 min readLW link
(dkl9.net)

GPT2XL_RLLMv3 vs. Bet­terDAN, AI Machi­avelli & Oppo Jailbreaks

MiguelDev11 Feb 2024 11:03 UTC
16 points
4 comments14 min readLW link

[Question] What’s the the­ory of im­pact for ac­ti­va­tion vec­tors?

Chris_Leong11 Feb 2024 7:34 UTC
57 points
12 comments1 min readLW link

Ex­per­i­ment­ing With Foot­board Piezos

jefftk11 Feb 2024 3:00 UTC
11 points
2 comments2 min readLW link
(www.jefftk.com)

The Core Values of Life—A pro­posal for a uni­ver­sal the­ory of ethics

Thomas Gjøstøl10 Feb 2024 21:48 UTC
2 points
4 comments18 min readLW link

And All the Shog­goths Merely Players

Zack_M_Davis10 Feb 2024 19:56 UTC
139 points
57 comments12 min readLW link

Sam Alt­man’s Chip Am­bi­tions Un­der­cut OpenAI’s Safety Strategy

garrison10 Feb 2024 19:52 UTC
198 points
52 comments1 min readLW link
(garrisonlovely.substack.com)

The lat­tice of par­tial updatelessness

Martín Soto10 Feb 2024 17:34 UTC
21 points
5 comments5 min readLW link