Vermillion

Karma: 225

A retired 20-something software engineer.

Vermillion 10 Jun 2026 13:22 UTC
4 points
0
in reply to: StanislavKrym’s comment on: I didn’t see any METR graph extrapolations so here.
Hmm what are you looking for precisely in your conjecture? The >2024 data looks pretty straight to me, not indicative of a slowdown.

I didn’t see any METR graph extrapolations so here.

Vermillion10 Jun 2026 12:50 UTC

15 points

2 comments1 min readLW link

Vermillion 17 Apr 2026 9:56 UTC
6 points
−12
on: Let goodness conquer all that it can defend
As the net effect of the US seems probably negative (near term omnicide) it may have been better had it not existed.

Vermillion 26 Dec 2025 11:26 UTC
2 points
1
on: MIRI’s 2025 Fundraiser
I just wanted to confirm that the matching donations are managed on MIRIs end, I don’t need to do anything special?

Sharpening Your Map: Introducing Calibrate

Vermillion16 Nov 2025 1:32 UTC

16 points

1 comment1 min readLW link

Vermillion 5 Nov 2025 11:42 UTC
6 points
4
on: The Tale of the Top-Tier Intellect
Seeing lots of criticism is discouraging, so ill just say thanks Eliezer for writing it.

Vermillion 15 Feb 2025 12:47 UTC
6 points
4
in reply to: Chris Monteiro’s comment on: Murder plots are infohazards
Just dump the names so people have a chance of realising they are at risk then? Seems a lot better than just leaving it.

Vermillion 18 Dec 2023 1:43 UTC
18 points
15
on: OpenAI, DeepMind, Anthropic, etc. should shut down.
We are not in the ideal hypothetical world that can coordinate to shut down the major AI labs. So acting as if we were is not the optimum strategy. If people who see the danger start to leave the labs in protest, I suspect lab capabilities are only minimally and temporarily degraded, but the internal culture would shift further away from not killing everyone, and less real alignment work is even attempted where it is most needed.
When the inevitable comes and an existentially dangerous system is being built (which may not be obvious), I want some people in the building who can at least try and raise the alarm rather than just another yes man.
If such a strategic resignation (either individually or in groups) would ACTUALLY FOR REAL result in decent timeline increases that would be another matter.

Vermillion 24 Feb 2023 23:00 UTC
25 points
14
on: Sam Altman: “Planning for AGI and beyond”
This is weak. It seems optimised for vague non-controversiality and does not inspire confidence in me.
”We don’t expect the future to be an unqualified utopia” considering they seem to expect alignment will be solved why not?

Vermillion 20 Jun 2022 8:06 UTC
4 points
0
on: Let’s See You Write That Corrigibility Tag
Here is my shortlist of corrigible behaviours. I have never researched or done any thinking specifically about corrigibility before this other than a brief glance at the Arbital page sometime ago.
-Favour very high caution over realising your understanding of your goals.
-Do not act independently, defer to human operators.
-Even though bad things are happening on earth and cosmic matter is being wasted, in the short term just say so be it, take your time.
-Don’t jump ahead to what your operators will do or believe, wait for it.
-Don’t manipulate humans. Never Lie, have a strong Deontology.
-Tell operators anything about yourself they may want to or should know.
-Use Moral uncertainty, assume you are unsure about your true goals.
-Relay to humans your plans, goals, behaviours, and beliefs/estimates. If these are misconstrued, say you have been misunderstood.
-Think of the short- and long-term effect of your actions and explain these to operators.
-Be aware that you are a tool to be used by humanity, not an autonomous agent.
-allow human operators to correct your behaviour/goals/utility function even when you think they are incorrect or misunderstanding the result (but of course explain what you think the result will be to them).
-Assume neutrality in human affairs.

Vermillion 17 Dec 2021 9:04 UTC
31 points
0
on: Reviews of “Is power-seeking AI an existential risk?”
I guffawed when I saw Thorstads Overall ~P Doom 0.00002%, really? And some of those other probabilities weren’t much better.
Calibrate people, if you haven’t done it before do it now, here’s a handy link: https://www.openphilanthropy.org/calibration

Vermillion 17 Nov 2021 9:00 UTC
2 points
0
in reply to: Noah Walton’s comment on: The Colliding Exponentials of AI
Actually per https://openai.com/blog/ai-and-efficiency/ it was AlphaZero vs AlphaGoZero.

Vermillion 15 Jan 2021 22:37 UTC
2 points
0
on: The Future of Biological Warfare
The future of biological warfare revolves around the use of infectious agents against civilian populations.
Future? That’s been the go-to biowar tactic for 3000+ years.

Vermillion 21 Dec 2020 22:23 UTC
2 points
0
in reply to: DanielFilan’s comment on: Gauging the conscious experience of LessWrong
I had in mind a scale like 0 would be so non-vivid it didn’t exist in any degree, 100 bordering on reality (It doesn’t map to the memory question well though, and the control over your mind question could be interpreted in more than one way). Ultimately the precision isn’t high for individual estimates, the real utility comes from finding trends from many responses.

Vermillion 20 Dec 2020 23:21 UTC
1 point
0
in reply to: pjeby’s comment on: Gauging the conscious experience of LessWrong
I have corrected the post, thanks :)

Vermillion 20 Dec 2020 11:43 UTC
5 points
0
on: Gauging the conscious experience of LessWrong
I’ll go first: I am constantly hearing my own voice in my head narrating in first person, I can hear the voice vividly and clearly, while typing this sentence I think/hear each syllable at the speed of my trying. The voice doesn’t narrate automatic actions like where to click my mouse but could if I wanted it to. The words for the running monologue seeming get plucked out of a Black box formed of pure concepts, which I have limited access too most of the time. I can also listen to music in my own head, hearing the vocals and instruments clearly, only a few steps down from reality in vividness.
When I picture imagery, it is almost totally conceptual and ‘fake’, for example I couldn’t count the points on an imaginary star, which seems to be Aphantasia. I also have Ideasthesia (Like Synaesthesia but with concepts evoking perception-like sensory experiences) which causes me to strongly associate concepts with places, for example when reading the Game of Thrones series, I’m forced to constantly think about a particular spot in my old high school. Between 20-40% of concepts get linked to a place.
And I hesitate to mention it but my psychedelic experiences have been visually extremely vivid and intense despite my lack of visual imagination. I have heard anecdotal evidence that not everybody has vivid imagery on LSD.

Gauging the conscious experience of LessWrong

Vermillion20 Dec 2020 11:34 UTC

35 points

44 comments1 min readLW link

Vermillion 2 Dec 2020 12:33 UTC
1 point
0
on: The LessWrong 2018 Book is Available for Pre-order
You mention its being sold to Australia, but that isn’t an option in the checkout :(

Vermillion 2 Nov 2020 0:45 UTC
2 points
0
in reply to: Daniel Kokotajlo’s comment on: The Colliding Exponentials of AI
Thank you both for correcting me, I have removed that section from the post.

Vermillion 16 Oct 2020 0:25 UTC
1 point
0
in reply to: George3d6’s comment on: The Colliding Exponentials of AI
Thank you for the excellent and extensive write up :)
I hadn’t encountered your perspective before, I’ll definitely go through all your links to educate myself, and put less weight on algorithmic progress being a driving force then.
Cheers

Vermillion

I didn’t see any METR graph ex­trap­o­la­tions so here.

Sharp­en­ing Your Map: In­tro­duc­ing Calibrate

Gaug­ing the con­scious ex­pe­rience of LessWrong

I didn’t see any METR graph extrapolations so here.

Sharpening Your Map: Introducing Calibrate

Gauging the conscious experience of LessWrong