Iknownothing

Karma: 80

Making a research platform for AI Alignment at https://ai-plans.com/
Come critique AI Alignment plans and get feedback on your alignment plan!

AI-Plans.com—a contributable compendium

Iknownothing25 Jun 2023 14:40 UTC

39 points

7 comments4 min readLW link

(ai-plans.com)

Review of Alignment Plan Critiques- December AI-Plans Critique-a-Thon Results

Iknownothing15 Jan 2024 19:37 UTC

24 points

0 comments25 min readLW link

(aiplans.substack.com)

An Ignorant View on Ineffectiveness of AI Safety

Iknownothing7 Jan 2023 1:29 UTC

14 points

7 comments3 min readLW link

Critique-a-Thon of AI Alignment Plans

Iknownothing5 Dec 2023 20:50 UTC

12 points

3 comments1 min readLW link

Even briefer summary of ai-plans.com

Iknownothing16 Jul 2023 23:25 UTC

10 points

6 comments2 min readLW link

(www.ai-plans.com)

Brief summary of ai-plans.com

Iknownothing28 Jun 2023 0:33 UTC

9 points

4 comments2 min readLW link

(ai-plans.com)

Iknownothing 4 Nov 2023 23:07 UTC
9 points
6
in reply to: aogara’s comment on: My thoughts on the social response to AI risk
“CESI’s Artificial Intelligence Standardization White Paper released in 2018 states
that “AI systems that have a direct impact on the safety of humanity and the safety of life,
and may constitute threats to humans” must be regulated and assessed, suggesting a broad
threat perception (Section 4.5.7).42 In addition, a TC260 white paper released in 2019 on AI
safety/security worries that “emergence” (涌现性) by AI algorithms can exacerbate the
black box effect and “autonomy” can lead to algorithmic “self-improvement” (Section
3.2.1.3).43”
From https://concordia-consulting.com/wp-content/uploads/2023/10/State-of-AI-Safety-in-China.pdf

Iknownothing 22 Jan 2023 22:45 UTC
9 points
2
in reply to: Jiro’s comment on: Emotional attachment to AIs opens doors to problems
No. Humans are not large networks that can be quickly and easily controlled. Among many, many other differences.

AI-Plans.com 10-day Critique-a-Thon

Iknownothing27 Jul 2023 11:44 UTC

8 points

2 comments2 min readLW link

(manifund.org)

Iknownothing 6 Nov 2023 13:26 UTC
8 points
2
in reply to: trevor’s comment on: AI Safety is Dropping the Ball on Clown Attacks, and Mind Control in General
When I say media, I mean social media, movies, videos, books etc- any type of recording or something that you believe you’re using as entertainment.

I’m trying this myself. Done singular days before, sometimes 2 or 3 days, but failed to keep it consistent. I did find that when I did it, my work output was far higher and greater quality, I had a much better sleeping schedule and was generally in a much more enjoyable mood.
I also ended up spending more time with friends and family, meeting new people, trying interesting things, spending time outdoors, etc.

This time I’m building up to it- starting with 1 media free hour a day, then 2 hours, then 3, etc.
I think building up to it will let me build new habits which will stick more.

Iknownothing 5 Nov 2023 1:36 UTC
8 points
0
on: AI Safety is Dropping the Ball on Clown Attacks, and Mind Control in General
A challenge for folks interested: spend 2 weeks without media based entertainment.

Iknownothing 11 May 2023 16:32 UTC
8 points
0
in reply to: Dagon’s comment on: A more grounded idea of AI risk
It’s not directly about AGI, no. But it could be a way to change a skeptic’s mind about AI risk. Which could be useful if they’re a regulator/politician.

Iknownothing 25 Jun 2023 14:41 UTC
6 points
2
on: AI-Plans.com—a contributable compendium
This plan originated from the idea of trying to have a hackathon to disprove alignment plans. I’m still very interested in that!

Looking for judges for critiques of Alignment Plans

Iknownothing17 Aug 2023 22:35 UTC

5 points

0 comments1 min readLW link

AI Law-a-Thon

Iknownothing28 Jan 2024 2:30 UTC

5 points

3 comments1 min readLW link

Iknownothing 30 Mar 2023 18:08 UTC
5 points
7
in reply to: Noosphere89’s comment on: Pausing AI Developments Isn’t Enough. We Need to Shut it All Down by Eliezer Yudkowsky
You disagree with doomerism as a mindset, or factual likelihood? Or both?
I think doomerism as a mindset isn’t great, but in terms of likelihood, there are ~3 things likely to kill humanity atm. AI being the first.

[Question] Specific Arguments against open source LLMs?

Iknownothing30 Jul 2023 14:27 UTC

4 points

2 comments1 min readLW link

Simple alignment plan that maybe works

Iknownothing18 Jul 2023 22:48 UTC

4 points

8 comments1 min readLW link

Iknownothing 26 Sep 2023 20:07 UTC
4 points
−5
on: Automatic Rate Limiting on LessWrong
I’m generally disincentivized to post or put effort into a post from the system where someone can just heavily downvote my post, without even giving a reason.

An overview of the points system

Iknownothing27 Jun 2023 9:09 UTC

3 points

4 comments1 min readLW link

(ai-plans.com)