RSS

Iknownothing

Karma: 80

Making a research platform for AI Alignment at https://​​ai-plans.com/​​
Come critique AI Alignment plans and get feedback on your alignment plan!

AI Law-a-Thon

Iknownothing28 Jan 2024 2:30 UTC
5 points
3 comments1 min readLW link

Re­view of Align­ment Plan Cri­tiques- De­cem­ber AI-Plans Cri­tique-a-Thon Re­sults

Iknownothing15 Jan 2024 19:37 UTC
24 points
0 comments25 min readLW link
(aiplans.substack.com)

Cri­tique-a-Thon of AI Align­ment Plans

Iknownothing5 Dec 2023 20:50 UTC
12 points
3 comments1 min readLW link

Pro­posal for im­prov­ing state of al­ign­ment research

Iknownothing6 Nov 2023 13:55 UTC
2 points
0 comments1 min readLW link

Look­ing for judges for cri­tiques of Align­ment Plans

Iknownothing17 Aug 2023 22:35 UTC
5 points
0 comments1 min readLW link

[Question] Spe­cific Ar­gu­ments against open source LLMs?

Iknownothing30 Jul 2023 14:27 UTC
4 points
2 comments1 min readLW link

AI-Plans.com 10-day Cri­tique-a-Thon

Iknownothing27 Jul 2023 11:44 UTC
8 points
2 comments2 min readLW link
(manifund.org)

Sim­ple al­ign­ment plan that maybe works

Iknownothing18 Jul 2023 22:48 UTC
4 points
8 comments1 min readLW link

Even briefer sum­mary of ai-plans.com

Iknownothing16 Jul 2023 23:25 UTC
10 points
6 comments2 min readLW link
(www.ai-plans.com)

LeCun says mak­ing a util­ity func­tion is intractable

Iknownothing28 Jun 2023 18:02 UTC
2 points
3 comments1 min readLW link

Brief sum­mary of ai-plans.com

Iknownothing28 Jun 2023 0:33 UTC
9 points
4 comments2 min readLW link
(ai-plans.com)

An overview of the points system

Iknownothing27 Jun 2023 9:09 UTC
3 points
4 comments1 min readLW link
(ai-plans.com)

AI-Plans.com—a con­tributable compendium

Iknownothing25 Jun 2023 14:40 UTC
39 points
7 comments4 min readLW link
(ai-plans.com)

A more effec­tive Ele­va­tor Pitch for AI risk

Iknownothing15 Jun 2023 12:39 UTC
2 points
0 comments1 min readLW link

A more grounded idea of AI risk

Iknownothing11 May 2023 9:48 UTC
3 points
4 comments1 min readLW link

An Ig­no­rant View on Ineffec­tive­ness of AI Safety

Iknownothing7 Jan 2023 1:29 UTC
14 points
7 comments3 min readLW link