Neil

Karma: 620

I’m bumping into walls but hey now I know what the maze looks like.

The smallest possible button (or: moth traps!)

Neil 2 Sep 2023 15:24 UTC

113 points

17 comments3 min readLW link

(neilwarren.substack.com)

Neil 2 Apr 2024 22:26 UTC
60 points
51
on: Neil Warren’s Shortform
A functionality I’d like to see on LessWrong: the ability to give quick feedback for a post in the same way you can react to comments (click for image). When you strong-upvote or strong-downvote a post, a little popup menu appears offering you some basic feedback options. The feedback is private and can only be seen by the author.
I’ve often found myself drowning in downvotes or upvotes without knowing why. Karma is a one-dimensional measure, and writing public comments is a trivial inconvience: this is an attempt at middle ground, and I expect it to make post reception clearer.
See below my crude diagrams.

If you are too stressed, walk away from the front lines

Neil 12 Jun 2023 14:26 UTC

42 points

14 comments5 min readLW link

“Natural is better” is a valuable heuristic

Neil 20 Jun 2023 22:25 UTC

35 points

16 comments4 min readLW link

Some rules for life (v.0,0)

Neil 17 Aug 2023 0:43 UTC

33 points

13 comments12 min readLW link

(neilwarren.substack.com)

You don’t get to have cool flaws

Neil 28 Jul 2023 5:37 UTC

30 points

16 comments2 min readLW link

Consequentialism is a compass, not a judge

Neil 13 Apr 2024 10:47 UTC

26 points

6 comments2 min readLW link

The Sequences on YouTube

Neil 7 Jan 2024 1:44 UTC

26 points

9 comments2 min readLW link

[Question] How does it feel to switch from earn-to-give?

Neil 31 Mar 2024 16:27 UTC

25 points

4 comments2 min readLW link

Neil 6 Sep 2023 16:12 UTC
23 points
10
in reply to: Thomas Sepulchre’s comment on: Find Hot French Food Near Me: A Follow-up
I’m French. Pétard is a very minor swear word, on par with “great Scott!”
It’s not meant as an insult at all. The most common French swear word is probably “putain” (used like “fuck” is) and pétard is used as an attenuated version, (like saying “fudge”).
(As a frenchman, I also admit to the existence of a writhing snake inside my gut telling me to downvote this heretical post which dares! compare French cuisine with German cuisine. Luckily, I have learned enough rationality to override my primal instincts.)

Privacy and writing

Neil 6 Apr 2024 8:20 UTC

20 points

1 comment5 min readLW link

Taboo “procrastination”

Neil 12 Dec 2023 21:33 UTC

20 points

7 comments1 min readLW link

Puffer-pope reality check

Neil 5 Jul 2023 9:27 UTC

20 points

2 comments1 min readLW link

Politics are not serious by default

Neil 28 Mar 2024 23:36 UTC

20 points

11 comments2 min readLW link

You can rack up massive amounts of data quickly by asking questions to all your friends

Neil 21 Jan 2024 1:27 UTC

14 points

2 comments2 min readLW link

Detachment vs attachment [AI risk and mental health]

Neil 15 Jan 2024 0:41 UTC

14 points

4 comments3 min readLW link

Neil 18 Aug 2023 22:06 UTC
14 points
0
on: 6 non-obvious mental health issues specific to AI safety
Very insightful post. Here are personal thoughts with low epistemic status and high rambling potential:
These all feel to me like corollaries to the belief “AGI is so important that I can’t gauge the value of anything else except in regards to how it affects AGI”. Hence: “everything else is meaningless because AGI will change everything soon” or “nobody around me is looking up at the meteor about to hit us and that makes me feel kind of insane. (*Cough* so I hang out with rationalists, whose entire shtick is learning how not to be insane)”.
As for other non-obvious effects: I personally feel some sort of perceived fragility around the whole field. There are arguments on this site for why AGI alignment should not be discussed in politics or why attempting to convince OpenAI or DeepMind employees to switch jobs can easily backfire (eg this post for caution advice). These make any outreach at all seem risky. There are also people I know wondering whether they should attempt to do anything at all relative to alignment, because they perceive themselves as probable dead weights. The relatively short timelines, the sheer scope, and the aura of impossibility around alignment seem to make people more cautious than they otherwise should be. Obviously the whole point of the field is to be cautious; but while it’s true that the tried-and-tested scientific method isn’t safe for AGI in general I’m not sure stressing the rationalist-tools solve-problems-before-you-experiment approach is healthy everywhere. So, caution is right there in the description of the field, but you have to make sure you contain it well so that it doesn’t infect places where you would do good to be reckless and use trial-and-error. I am probably quite wrong about this but I don’t see many people talking about it, so if there’s any reasonable doubt we should figure it out.
Alignment work should probably be perceived as less fragile. Unlike the AI field in general, alignment projects specifically don’t pose much of a risk to the world. So we can probably afford to be more loose here than elsewhere. In my experience alignment feels like a pack of delicate butterflies flying together, with every flap of wings sending dozens of comrades spiraling out of the sky, which might or might not set off a domino/Rube Goldberg machine that blows up the world.

Neil 1 Apr 2024 8:27 UTC
13 points
3
on: Neil Warren’s Shortform
Bonus song in I have been a good Bing: “Claude’s Anguish”, a 3-minute death-metal song whose lyrics were written by Claude when prompted with “how does the AI feel?”: https://app.suno.ai/song/40fb1218-18fa-434a-a708-1ce1e2051bc2/ (not for the faint of heart)

AI as a natural disaster

Neil 10 Jan 2024 0:42 UTC

11 points

1 comment7 min readLW link

Neil 4 Sep 2023 9:00 UTC
11 points
3
in reply to: Metacelsus’s comment on: The smallest possible button (or: moth traps!)
Interesting, thanks for posting that! One of the reasons I like this forum is because there are people running around on here who’ve read papers like “Salivary Digestion Extends the Range of Sugar-Aversions in the German Cockroach” and you get to talk to them for free.
So if I understand the abstract and skimmed paper so far, we’re seeing more saliva-based aversion to pure glucose because pure glucose is a superstimulus (the roaches still accept “complex glucose”), and human trap designs are fond of superstimuli, as cheap ways to radically increase the probability your trap works, so the traps are selecting for pure glucose aversion. Given how short insect reproduction cycles are and how many there are anyway, we’ll probably observe this kind of evolution everywhere, as well as every time we switch traps.