npostavs

Karma: 222

npostavs Jun 25, 2025, 2:17 AM
7 points
5
in reply to: dirk’s comment on: Analyzing A Critique Of The AI 2027 Timeline Forecasts
I don’t think this interpretation can hold up: the body of titotal’s post doesn’t deal with the good vs bad timeline. It’s just about the uncertainty of modelling AI progress which applies for both the good and bad timelines.

npostavs May 23, 2025, 5:20 PM
1 point
0
in reply to: Mis-Understandings’s comment on: AI #117: OpenAI Buys Device Maker IO
I think it’s an intentional pun, like, “whether forecasters” are people who predict whether something will happen or not.

npostavs Apr 3, 2025, 12:05 AM
1 point
0
in reply to: milanrosko’s comment on: I, G(Zombie)
Maybe I should have asked: In what sense are machines “fully doing” first-order logic? I think I understand the part where first logic formulas are recursively enumerable, in theory, but isn’t that intractable to the point of being useless and irrelevant in practice?

npostavs Mar 30, 2025, 9:13 PM
1 point
0
on: I, G(Zombie)

Unlike first-order logic, second-order logic is not recursively enumerable—less computationally tractable, more fluid, more human. It operates in a space that, for now, remains beyond the reach of machines still bound to the strict determinism of their logic gates.

In what sense is second-order logic “beyond the reach of machines”? Is it non-deterministic? Or what are you trying to say here? (Maybe some examples would help)

npostavs Mar 13, 2025, 2:31 PM
1 point
0
on: Penny Whistle in E?
What about tuning the fiddle strings down 1 tone?

npostavs Feb 27, 2025, 1:52 AM
6 points
2
on: The non-tribal tribes
You say this:

If you’re thinking, “Wait no, I’m pretty sure my group is fundamentally about X, which is fundamentally good,” then you’re probably still in Red or Blue.

But you also say this:

First, the Grey tribe is about something, [...] things that people already think are good in themselves.

Doesn’t the first statement completely undermine the second one?

npostavs Feb 15, 2025, 4:37 AM
1 point
0
on: Hopeful hypothesis, the Persona Jukebox.
I guess you meant jukebox, not jutebox. Unless there is some kind of record-playing box made of jute fiber that I haven’t heard of...

npostavs Jan 5, 2025, 8:00 PM
3 points
0
in reply to: RohanS’s comment on: RohanS’s Shortform

but I recently tried again to see if it could learn at runtime not to lose in the same way multiple times. It couldn’t. I was able to play the same strategy over and over again in the same chat history and win every time.

I wonder if having the losses in the chat history would instead be training/reinforcing it to lose every time.

npostavs Nov 22, 2024, 9:39 PM
3 points
2
in reply to: Ann’s comment on: LLM chatbots have ~half of the kinds of “consciousness” that humans believe in. Humans should avoid going crazy about that.
Yes, my understanding is that the system prompt isn’t really priviledged in any way by the LLM itself, just in the scaffolding around it.

But regardless, this sounds to me less like maintaining or forming a sense of purpose, and more like retrieving information from the context window.

That is, if the LLM has previously seen (through system prompt or first instruction or whatever) “your purpose is to assist the user”, and later sees “what is your purpose?” an answer saying “my purpose is to assist the user” doesn’t seem like evidence of purposefulness. Same if you run the exercise with “flurbles are purple”, and later “what color are flurbles?” with the answer “purple”.

npostavs Nov 22, 2024, 11:46 AM
8 points
3
on: LLM chatbots have ~half of the kinds of “consciousness” that humans believe in. Humans should avoid going crazy about that.
#2: Purposefulness. The Big 3 LLMs typically maintain or can at least form a sense of purpose or intention throughout a conversation with you, such as to assist you.
Isn’t this just because the system prompt is always saying something along the lines of “your purpose is to assist the user”?

npostavs Nov 9, 2024, 4:56 PM
2 points
1
on: Active Recall and Spaced Repetition are Different Things

by saying their name aloud: [...] …but it’s a lot more difficult to use active recall to remember people’s names.

I’m confused, isn’t saying their name in a sentence an example of active recall?

npostavs Nov 8, 2024, 2:44 AM
6 points
5
in reply to: Brendan Long’s comment on: AI #89: Trump Card
Finding two bugs in a large codebase doesn’t seem especially suspicious to me.

npostavs Nov 4, 2024, 10:08 PM
1 point
0
in reply to: Donatas Lučiūnas’s comment on: Claude seems to be smarter than LessWrong community
I don’t think I understand, what is the strawman?

npostavs Nov 4, 2024, 1:18 PM
6 points
0
in reply to: Donatas Lučiūnas’s comment on: Claude seems to be smarter than LessWrong community
I think the AI gave the expected answer here, that is, it agreed with and expanded on the opinions given in the prompt. I wouldn’t say it’s great or dumb, it’s just something to be aware of when reading AI output.

npostavs Nov 3, 2024, 11:07 PM
12 points
5
on: Claude seems to be smarter than LessWrong community
It looks like you are measuring smartness by how much it agrees with your opinions? I guess you will find that Claude is not only smarter than LessWrong, but it’s also smarter any human alive (except yourself) by this measure.

npostavs 8 Sep 2024 13:45 UTC
1 point
0
in reply to: M. Y. Zuo’s comment on: Book Review: What Even Is Gender?
Entries 1a and 1b are obviously not not relevant to the OP, which is mainly about the sense in 3b (maybe a little bit the 3a sense too, since it is “merged with or coloured by sense 3b”).

Entry 3b looks (to me) sufficiently broad and vague that it doesn’t really rule anything out. Do you think it contradicts anything that’s in the OP?

npostavs 6 Sep 2024 2:59 UTC
1 point
0
in reply to: M. Y. Zuo’s comment on: Book Review: What Even Is Gender?

The OED defines ‘gender’, excluding obsolete meanings, as follows:

Okay? Why are you telling us this?

npostavs 9 Aug 2024 3:22 UTC
2 points
0
in reply to: rotatingpaguro’s comment on: AI #76: Six Shorts Stories About OpenAI

Maybe if you solve for equilibrium you get that after releasing the tool, the tool is defeated reasonably quickly?

I believe it’s already known that running the text through another (possibly smaller and cheaper) LLM to reword it can remove the watermarking. So for catching cheaters it’s only a tiny bit stronger than searching for “as a large language model” in the text.

npostavs 3 Aug 2024 14:02 UTC
1 point
0
on: We Don’t Just Let People Die—So What Next?
Why release a phone with 5 new features when you can just R&D one and put it in a new case?
In the ideal case of a competitive market, you don’t release just one new feature, because any of your competitors could release a phone with two new features and eat your lunch. But the real-world smartphone market is surely much closer to oligopoly than perfect competition.
The costs of the competition of the market are almost invisible, but we have been seeing them over decades get more and more obvious.
How sure are you that this isn’t rather the costs of lack of competition?

npostavs 25 Jul 2024 23:57 UTC
2 points
0
in reply to: Brian Bien’s comment on: Ransomware Payments Should Require a Sin Tax
Maybe, although what is “sufficient” depends a lot on the rate of catching the evaders. I don’t have a good guess as to what that rate is.