der

Karma: 62

der Mar 2, 2025, 5:44 AM
−1 points
0
on: der’s Shortform
how about gary marcus as a situational awareness dampening, counter-panic psyop

der Jan 30, 2025, 2:49 PM
5 points
0
on: der’s Shortform
Prediction (influenced by R1-Zero): By EOY, expert-level performance will be reported on outcome prediction for a certain class of AI experiments—those that can be specified concisely in terms of code and data sets that:
1. are frequently used and can be referenced by name, e.g. MNIST digits, or
2. are small enough to be given explicitly, or
3. are synthetic, specified by their exact distribution in code.

der’s Shortform

derJan 30, 2025, 2:49 PM

2 points

2 comments LW link

der Jul 26, 2024, 2:03 AM
1 point
0
in reply to: O O’s comment on: “AI achieves silver-medal standard solving International Mathematical Olympiad problems”
We don’t know how narrow it is yet. If they did for algebra and number theory something like what they did for geometry in alphageometry (v1), providing it a well-chosen set of operations, then I’ll be more inclined to agree.

der Jul 25, 2024, 10:30 PM
3 points
2
on: “AI achieves silver-medal standard solving International Mathematical Olympiad problems”
I don’t understand why people aren’t freaking out from this news. Waiting for the paper I guess.

der Jul 19, 2024, 5:50 PM
1 point
0
in reply to: Richard_Kennaway’s comment on: The Potential Impossibility of Subjective Death
What we want is orthogonal though, right? Unless you think that metaphysics is so intractable to reason about logically that the best we can do is go by aesthetics.

der Jul 19, 2024, 5:43 PM
1 point
0
in reply to: JBlack’s comment on: The Potential Impossibility of Subjective Death
Unfortunately the nature of reality belongs to the collection of topics that we can’t expect the scientific method alone to guide us on. But perhaps you agree with that, since in your second paragraph you essentially point out that practically all of mathematics belongs to the same collection.

der Jul 19, 2024, 5:29 PM
1 point
0
on: The Potential Impossibility of Subjective Death
It’s not necessary to bring quantum physics into it. Isomorphic consciousness-structures have the same experience (else they wouldn’t be isomorphic, since we make their experience part of them). The me up to the point of waking up tomorrow (or the point of my apparent death) is a such a structure (with no canonical language unfortunately; there are infinitely many that suffice), and so it has an elementary class, the structures that elementarily extend it, in particular that extend its experience past tomorrow morning.

der Jul 19, 2024, 4:57 PM
1 point
0
on: The Potential Impossibility of Subjective Death
+2 for brevity! A couple more explorations of this idea that I didn’t see linked yet. They are more verbose, but in a way I appreciate.
- The mathematical universe: the map that is the territory. I’d love to meet the author of this. They also wrote the excellent If a tree falls on Sleeping Beauty.… Sadly they haven’t used that account in many years.
- Simulation, Consciousness, Existence (Hans Moravec)
If you want to explore this idea further, I’d love you join you.

der Jul 6, 2024, 10:38 PM
1 point
0
in reply to: Radford Neal’s comment on: Fertility Roundup #2

But “more people are better” ought to be a belief of everyone, whether pro-fertility or not. It’s an “other things being equal” statement, of course—more people at no cost or other tradeoff is good. One can believe that and still think that less people would be a good idea in the current situation. But if you don’t think more people are good when there’s no tradeoff, I don’t see what moral view you can have other than nihilism or some form of extreme egoism.

Do all variants of downside focused ethics get dismissed as extreme egoism? Hard to see them as nihilistic.

I suspect clarity and consensus on the meaning of “more people at no cost or other tradeoff” to be difficult. If “more people” means more happy people preoccupied with the welfare of the least fortunate, then sure “at no cost or other tradeoff” should suffice for practically everyone to get behind it. But that seems like quite a biased distribution for a default meaning of “more people.”

der Jul 6, 2024, 9:19 PM
1 point
0
on: Rapid capability gain around supergenius level seems probable even without intelligence needing to improve intelligence
When capability is performing unusually quickly
Assuming you meant “capability is improving.” I expect capability will always feel like it’s improving slowly in an AI researcher’s own work, though… :-/ I’m sure you’re aware that many commenters have suggested this as an explanation for why AI researchers seem less concerned than outsiders.

der Nov 2, 2023, 3:20 PM
0 points
2
in reply to: trevor’s comment on: AI Safety is Dropping the Ball on Clown Attacks, and Mind Control in General
“Clown attack” is a phenomenal term, for a probably real and serious thing. You should be very proud of it.

der Jul 5, 2023, 11:01 PM
5 points
2
in reply to: FeepingCreature’s comment on: Douglas Hofstadter changes his mind on Deep Learning & AI risk (June 2023)?
This was thought provoking. While I believe what you said is currently true for the LLMs I’ve used, a sufficiently expensive decoding strategy would overcome it. Might be neat to try this for the specific case you describe. Ask it a question that it would answer correctly with a good prompt style, but use the bad prompt style (asking to give an answer that starts with Yes or No), and watch how the ratio of the cumulative probabilities of Yes* and No* sequences changes as you explore the token sequence tree.

der Jun 27, 2023, 4:05 PM
1 point
on: The mathematical universe: the map that is the territory
Anybody know who the author is? I’m trying to get in contact, but they haven’t posted on LW in 12 years, so they might not get message notifications.

der Apr 10, 2023, 9:46 AM
1 point
0
in reply to: Richard_Ngo’s comment on: Policy discussions follow strong contextualizing norms
I see. I guess hadn’t made the connection of attributing benefits to high-contextualizing norms. Only got as far as observing that certain conversations go better with comp lit friends than with comp sci peers. That was the only sentence that gave me a parse failure. I liked the post a lot.

der Apr 9, 2023, 7:16 PM
−7 points
0
in reply to: Mateusz Bagiński’s comment on: Ng and LeCun on the 6-Month Pause (Transcript)
@lc and @Mateusz, keep up that theorizing. This needs a better explanation.

der Apr 6, 2023, 4:14 PM
1 point
0
in reply to: der’s comment on: Policy discussions follow strong contextualizing norms
Ah, no line number. Context:

To me it seems analogous to how there are many statements that need to be said very carefully in order to convey the intended message under high-decoupling norms, like claims about how another person’s motivations or character traits affect their arguments.

der Apr 6, 2023, 4:13 PM
1 point
0
on: Policy discussions follow strong contextualizing norms
high-decoupling
Did you mean high-contextualizing here?

der Mar 3, 2023, 2:04 PM
1 point
0
on: AGI will have learnt utility functions
Interestingly, learning a reward model for use in planning has a subtle and pernicious effect we will have to deal with in AGI systems, which AIXI sweeps under the rug: with an imperfect world or reward model, the planner effectively acts as an adversary to the reward model. The planner will try very hard to push the reward model off distribution so as to get it to move into regions where it misgeneralizes and predicts incorrect high reward.
Remix: With an imperfect world… the mind effectively acts as an adversary to the heart.
Think of a person who pursues wealth as an instrumental goal for some combination of doing good, security, comfort, and whatever else their value function ought to be rewarding (“ought” in a personal coherent extrapolated volition sense). They achieve it but then, apparently it’s less uncomfortable to go on accumulating more wealth than it is to get back to the thorny question of what their value function ought to be.

der Sep 23, 2022, 7:28 PM
1 point
on: Writeup: Progress on AI Safety via Debate
Is there a more-formal statement somewhere of the theorem in Complexity theory of team games without secrets? Specifically, one that only uses terms with standard meanings in complexity theory? I find that document hard to parse.
If concreteness is helpful, take “terms with standard meanings in Complexity Theory” to be any term defined in any textbook on complexity theory.