metachirality

Karma: 435

metachirality 20 Jan 2024 0:31 UTC
41 points
8
on: There is way too much serendipity
I think artificial sweeteners are so often discovered serendipitously because artificial sweeteners also tend to be insanely sweet (you usually find them mixed with a higher volume of filler because of how sweet they are), which makes them easy to notice even with standard safety measures.

metachirality 20 Jan 2024 23:57 UTC
40 points
9
in reply to: the gears to ascension’s comment on: There is way too much serendipity
Looking at this and this, I’d guess that it’s just harder to produce super toxic toxins artificially than it is to produce super sweet sweeteners. IIRC the mass of neotame it takes to taste any sweetness is lower than the mass of VX it takes to kill someone.

An explanation of decision theories

metachirality1 Jun 2023 3:42 UTC

20 points

4 comments5 min readLW link

metachirality 15 Feb 2024 1:54 UTC
20 points
8
in reply to: Raemon’s comment on: Masterpiece
A nod to Lena Forsen, a photo of whom is often used as a test image in image processing papers.

metachirality 4 Dec 2023 22:51 UTC
19 points
0
on: 2023 Unofficial LessWrong Census/Survey
This is the first LW census I’ve taken!

[Question] What’s up with psychonetics?

metachirality16 Sep 2023 1:12 UTC

18 points

15 comments1 min readLW link

metachirality 1 Apr 2024 1:15 UTC
18 points
7
on: metachirality’s Shortform
What if we had a setting to hide upvotes/hide reactions/randomize order of comments so they aren’t biased by the desire to conform?

[Question] Is anyone working on formally verified AI toolchains?

metachirality12 Mar 2024 19:36 UTC

17 points

2 comments1 min readLW link

metachirality 7 Mar 2024 2:32 UTC
16 points
0
in reply to: River’s comment on: My Clients, The Liars
Aw man there goes my weekend plans

Notice your everything

metachirality8 Aug 2023 2:38 UTC

15 points

1 comment2 min readLW link

metachirality 25 Apr 2023 1:04 UTC
15 points
8
on: The Toxoplasma of AGI Doom and Capabilities?
My immediate thought is that the cat is already out of the bag and whatever risk there was of AI safety people accelerating capabilities is nowadays far outweighed by capabilities hype and in general, much larger incentives, and that the most we can do is to continue to build awareness of AI risk. Something about this line of reasoning strikes me as uncritical though.

metachirality 23 Feb 2024 18:50 UTC
13 points
6
on: In set theory, everything is a set
I actually dislike making everything a set, it feels similar to programming in Brainfuck. Sure it’s Turing complete, but the way programs are structured don’t map cleanly to how a human would conceptualize it and you need to write a lot of boilerplate for things you ordinarily don’t even think about.

In practice, this leads to confusing notation like using “subset of” or “element of” for “lesser than”, which makes it harder to see whether to think of something as a number or just a generic set. Here, since X is not “typed”, it is hard to see that it should be thought of as a set of sets rather than just a generic set.

Also you get weird pathological stuff like {1,2} being a topological space.

As a formalism for mathematics, I much prefer type theory which not only more cleanly maps onto how humans think, but also uses simpler axioms. It also has connections to logic, computer science, and category theory (and by extension many other fields of math).

[Question] Has anyone thought about how to proceed now that AI notkilleveryoneism is becoming more relevant/is approaching the Overton window?

metachirality5 Apr 2023 3:06 UTC

11 points

8 comments1 min readLW link

metachirality 13 Dec 2023 23:15 UTC
11 points
0
on: Is being sexy for your homies?
I feel like just “gender” would be better than “physical sex” here. For instance, I’d expect trans women to fall in the female cluster (although being trans intersects with that in ways that strain this model but it still rounds to what I said so)

metachirality 18 Sep 2023 5:29 UTC
10 points
2
in reply to: omnizoid’s comment on: Contra Heighn Contra Me Contra Heighn Contra Me Contra Functional Decision Theory
I don’t think the specific part of decision theory where people argue over Newcomb’s problem is large enough as a field to be subject to the EMH. I don’t think the incentives are awfully huge either. I’d compare it to ordinal analysis, a field which does have PhDs but very few experts in general and not many strong incentives. One significant recent result (if the proof works then the ordinal notation in question would be most powerful proven well-founded) was done entirely by an amateur building off of work by other amateurs (see the section on Bashicu Matrix System): https://cp4space.hatsya.com/2023/07/23/miscellaneous-discoveries/

metachirality 29 Apr 2024 19:05 UTC
9 points
3
in reply to: avturchin’s comment on: avturchin’s Shortform
We don’t actually know if it’s GPT 4.5 for sure. It could be an alternative training run that preceded the current version of ChatGPT 4 or even a different model entirely.

metachirality 22 Mar 2024 14:41 UTC
9 points
0
in reply to: eggsyntax’s comment on: eggsyntax’s Shortform
How do you know that this isn’t how human consciousness works?

metachirality 27 Dec 2023 23:59 UTC
9 points
3
on: In Defense of Epistemic Empathy
I think it makes more sense to word this as “others are not remarkably more irrational than you are” rather than saying that disagreements are not caused by irrationality.

metachirality 23 May 2023 21:40 UTC
9 points
0
in reply to: YonatanK’s comment on: Open & Welcome Thread—May 2023
Formal alignment proposals avoid this problem by doing metaethics, mostly something like determining what a person would want if they were perfectly rational (so no cognitive biases or logical errors), otherwise basically omniscient, and had an unlimited amount of time to think about it. This is called reflective equilibrium. I think this approach would work for most people, even pretty terrible people. If you extrapolated a terrorist who commits acts of violence for some supposed greater good, for example, they’d realize that the reasoning they used to determine that said acts of violence were good was wrong.

Corrigibility, on the other hand, is more susceptible to this problem and you’d want to get the AI to do a pivotal act, for example, destroying every GPU to prevent other people from deploying harmful AI, or unaligned AI for that matter.

Realistically, I think that most entities who’d want to use a superintelligent AI like a nuke would probably be too short-sighted to care about alignment, but don’t quote me on that.

[Question] What should my college major be if I want to do AI alignment research?

metachirality25 May 2023 18:23 UTC

8 points

7 comments1 min readLW link

metachirality

An ex­pla­na­tion of de­ci­sion theories

[Question] What’s up with psy­cho­net­ics?

[Question] Is any­one work­ing on for­mally ver­ified AI toolchains?

No­tice your everything

[Question] Has any­one thought about how to pro­ceed now that AI notkil­lev­ery­oneism is be­com­ing more rele­vant/​is ap­proach­ing the Over­ton win­dow?

[Question] What should my col­lege ma­jor be if I want to do AI al­ign­ment re­search?