metachirality

Karma: 592

metachirality 2 Jul 2025 8:20 UTC
6 points
9
in reply to: Mikhail Samin’s comment on: Mikhail Samin’s Shortform
Confused about the disagreements. Is it because of the AI output or just the general idea of an AI risk chatbot?

metachirality 2 Jul 2025 8:15 UTC
1 point
0
in reply to: dbohdan’s comment on: dbohdan’s Shortform
Here’s a riddle: A woman falls in love with a man at her mother’s funeral, but forgets to get contact info from him and can’t get it from any of her acquaintances. How could she find him again? The answer is to kill her father in hopes that the man would come to the funeral.

It reminds me of [security mindset](https://www.schneier.com/blog/archives/2008/03/the_security_mi_1.html), in which thinking like an attacker exposes leaky abstractions and unfounded assumptions, something that is also characteristic of being agentic and “just doing things.”

metachirality 22 Jun 2025 17:26 UTC
1 point
0
in reply to: Hastings’s comment on: Agentic Misalignment: How LLMs Could be Insider Threats
In fact, Claude 3 Opus is still available.

metachirality 14 Jun 2025 7:19 UTC
1 point
−4
in reply to: jchan’s comment on: Religion for Rationalists
Is Judaism not also based around disputation of texts?

metachirality 5 Jun 2025 2:02 UTC
1 point
0
in reply to: Thomas Kwa’s comment on: metachirality’s Shortform
I pretty much agree with this. I just posted this as a way of illustrating how simulacrum stages could be generalized to be more than just about signalling and language. In a way, even stocks are stage 4 since they cash out in currency, so that stuff can be one stage in one way but another stage in another way.

metachirality 3 Jun 2025 0:12 UTC
16 points
−4
on: metachirality’s Shortform
Simulacrum stages as various kinds of assets:
- Stage 1: Most assets. Apples are valued for their taste and nutrition. Stocks of apple-farming companies are claims to profit which comes from providing apples, which are valued. Shares in prediction markets eventually cash out.
- Stage 2: Assets held because the holder thinks others will, perhaps erroneously, think it has increased in stage 1 value. Includes Ponzi schemes and Enron stocks/options.
- Stage 3: Manifold Market stocks and some memecoins, which are associated with a thing, person, or concept, which is used as a Schelling point to determine their value. One only expects their value to correlate with the associated concept because everyone else thinks it’ll correlate and buy and sell accordingly.
- Stage 4: Cryptocurrency and assets of pure speculation, where the price rises only because everyone expects it will rise and buys accordingly, mutatis mutandis for lowering. Also includes fiat currency, which is valuable because everyone agrees to use it as a medium of exchange.

metachirality 2 Jun 2025 22:22 UTC
11 points
0
on: metachirality’s Shortform
Game theory of simulacrum stages:
- If everyone else is truthful, lying wins.
- If everyone else is lying, not taking them seriously wins.
- If everyone else is not taking anyone else seriously, there is no need to pretend.

metachirality 17 May 2025 4:30 UTC
5 points
0
on: Eliezer and I wrote a book: If Anyone Builds It, Everyone Dies
I made a Manifold market for how many pre-orders there will be!

metachirality 14 May 2025 8:57 UTC
1 point
0
on: It’s Okay to Feel Bad for a Bit
I’m generally confused by the notion that Buddhism entails suppressing one’s emotions. Stoicism maybe, but Buddhism?
Buddhism is about what to do if one has no option but to feel one’s emotions.

metachirality 14 May 2025 8:53 UTC
1 point
0
in reply to: PhilGoetz’s comment on: It’s Okay to Feel Bad for a Bit
How is that mutually exclusive with (1)?

metachirality 2 May 2025 7:04 UTC
1 point
0
in reply to: JBlack’s comment on: metachirality’s Shortform
Am I correct in assuming you don’t think one should give the money in the counterfactual mugging?

metachirality 2 May 2025 2:49 UTC
1 point
0
in reply to: JBlack’s comment on: metachirality’s Shortform
I don’t know what the first part of your comment is trying to say. I agree that counterfactual mugging isn’t a thing that happens. That’s why it’s called a thought experiment.
I’m not quite sure what the last paragraph is trying to say either. It sounds somewhat similar to an counter-argument I came up with (which I think is pretty decisive), but I can’t be certain what you actually meant. In any case, there is the obvious counter-counter-argument that in the counterfactual mugging, the agent in the heads branch and the tails branch are not quite identical either, one has seen the coin land on heads and the other has seen the coin land on tails.

metachirality 1 May 2025 23:34 UTC
0 points
−3
on: metachirality’s Shortform
I came up with an argument for alignment by default.
In the counterfactual mugging scenario, a rational agent gives the money, even though they never see themselves benefitting from it. Before the coin flip, the agent would want to self-modify to give the money to maximize the expected value, therefore the only reflectively stable option is to give the money.
Now imagine instead of a coin flip, it’s being born as one of two people: Alice, who wants to not be murdered for 100 utils, and Bob, who wants to murder Alice for 1 utils. As with the counterfactual mugging, before you’re born, you’d rationally want to self-modify to not murder Alice to maximize the expected value.
What you end up with is basically morality (or at least it is the only rational choice regardless of your morality), so we should expect sufficiently intelligent agents to act morally.

metachirality 15 Mar 2025 19:51 UTC
2 points
0
in reply to: avturchin’s comment on: bgold’s Shortform
Sounds like synesthesia

metachirality 27 Jan 2025 9:49 UTC
4 points
2
in reply to: lc’s comment on: [Link] A community alert about Ziz
I fear that, while it might be a good idea to discourage LSD, it would make things even worse to discourage transitioning.

metachirality 3 Jan 2025 7:54 UTC
2 points
0
in reply to: keltan’s comment on: keltan’s Shortform
Highly Advanced Epistemology 101?

metachirality 2 Jan 2025 19:46 UTC
11 points
0
in reply to: gwern’s comment on: Deontic Explorations In “Paying To Talk To Slaves”
probably doesn’t change much, but janus’ claude generated comment was the first mention of claude acting like a base model on LW

metachirality 8 Dec 2024 18:56 UTC
1 point
1
in reply to: Guive’s comment on: Habryka’s Shortform Feed
It ought to be a top-level post on the EA forum as well.

metachirality 8 Dec 2024 18:53 UTC
2 points
1
in reply to: cubefox’s comment on: Algebraic Linguistics
Well that’s because it’s meant to be quantifying over linear equations. $x$ and $y$ are not meant to be replaced but $a$ and $b$ are.

metachirality 7 Dec 2024 22:04 UTC
3 points
1
on: Algebraic Linguistics
i is often used as an index in math, similar to how it is used as an index in for loops.