This may be just me (I’ve barely interacted with you), but you create the impression of being intentional in your actions and busy. When I saw you carrying the ball around, I found it funny, but assumed you have a deliberate goal in mind, rather than simply looking for people to be playful (I wasn’t at the area near the roof). That being said, improvisation is a skill a lot of people at lessonline (me included) could improve at.
martinkunev
balls
Studies show that humans have shape bias compared to some other animals (e.g. dogs). Dogs tend to categorize more based on size and texture. I’m not sure what this tells us. Maybe it just points to useful concepts being goal-dependent?
It may be interesting to compare languages and look for concepts only present in some. The examples I could find seem to be from categories other than nouns (e.g. left/right, specific colors, numbers, time).
In this example the program generating the data from both S1 and S2 is the more general abstraction, but can we really think of it as natural?
Maybe we have M3, M4 and so forth and if we try use the corresponding strings S1, … Sn to produce the data, we get something longer (each Si is small but there are many of them). In that case, there will be no single abstraction which is the most “natural”. I’m handwaving a lot, but does this make sense?
Not exactly what you’re asking for, but maybe in the right direction?
Things like >>= != =~ ++ ← /* <% are arguably new symbols, typically built on top of ASCII but only for convenience
I think the hard part is actually finding the right books / resources when you don’t have the skill (obviously) you’re trying to learn.
I don’t think I’ve heard people calling 1 wireheading. It certainly isn’t whatI have in mind when I hear the term.
For 2, I’m interested to see a non-embedded example. If the agent can tamper the input to its reward function, wouldn’t that make it embedded?
The Death in Damascus thought experiment shows some conceptual shortcomings of CDT which precommitment cannot deal with.
In this thought experiment, CDT could (depending on implementation) enter into an infinite loop or keep on wasting resources to move between Damascus and Aleppo.
I like the idea a lot. There are some details I’d question (coming up with concepts of attention, boredom or gravity doesn’t feel easy to me). I don’t think there is enough information to reconstruct a human face. I appreciate the post, because I would have strived for perfection and either gave up or never finished it.
That’s an interesting approach. My thinking at the time was that the number of vaccines administered gives legitimacy to the government’s decisions. There were several other factors I considered besides the mandate:
All this happened while the vaccine was still in experimental stage in France.
There was political motivation to use all the vaccines already ordered. As far as I’m aware, the frequency of boosting was not justified by any science.
The benefits of the vaccines were exaggerated in public discourse (“you’ll never get sick”, “the virus will disappear”, etc.). The priority was never honest communication but manipulating people.
There was an attempt to ridicule any opposition to the vaccine mandate as being motivated by woo-woo.
Just to be clear, I still wasn’t sure I was making the right choice at the time and I would have gotten vaccinated if there was no mandate.
What is the relationship between convergent factorization and natural latent?
this sounds like 3. Context Refresh
60% resolved my social anxiety
I’m honestly curious how you measure that. I’ve been asking myself “am I progressing with respect to social anxiety” and I don’t have a good answer.
To me this reads as a definition of the term “free will” and then arguing against its existence.
If something can predict your action with better than random accuracy no matter how hard you try to prevent them, you don’t have free will over that action
The word “try” here hides lots of complexity and does some heavy lifting. Can you taboo it and rephrase that? Also, are you rejecting the computational theory of mind?
In the case of the brain-scanner, I’d say “the second before” just reflects the fact that taking an action in the physical world takes >0 time, and once you’ve made the decision to start the action you are no longer free to reverse it
If you take an agent embedded in the environment, I don’t think there is a clear-cut boundary between “not yet made a decision” and “have made the decision”. This abstraction just breaks down.
My fear with the teleporter has always been the engineering details—can it get a consistent snapshot of me? what about the last moments after the snapshot and before the old copy is destroyed? can it reliably reconstruct me? what happens in case of a failure?
Assuming many worlds quantum mechanics, we should have similar anticipations for forking into two and for tossing a quantum coin.
There is no rollback when open-weight models are almost SOTA. Could we convince people like Zuck that open weights are too risky? I seriously doubt it.
There are some differences between what women consider attractive female traits and what men actually find attractive. The example I like to give is how some women enlarge their lips even though men (usually) don’t find that attractive.
I think these are the relevant points:
Maybe women’s sense of each other’s beauty is more discriminating than men’s.
Some women probably don’t distinguish a sex symbol’s actual physical attractiveness from the other characteristics that would make her rarely and especially appealing to women (like fame, wealth, etc.), and they’re neglecting to account for these factors being less salient to men.
This:
Might agent rewrite agent ‘s brain to make agent better satisfy agent ’s utility function? Most forms of wire-heading inherently limit the ability of agents to affect the future
and this
We have not proved that agent does not try to affect agent ‘s utility function (in fact, I expect in many cases agent does try to influence agent ’s utility function).
appear to be in conflict. Are you trying to say that depending on the circumstances b may try to influence a’s utility function or avoid doing so?
How important is it to keep the system prompt short? I guess this would depend on the model, but does anybody have useful tips on that?
I’m also interested in trying such a discussion group