Josh Snider

Karma: 274

Josh Snider 30 Jan 2026 16:59 UTC
1 point
0
in reply to: Seth Herd’s comment on: Claude Plays Pokemon: Opus 4.5 Follow-up
I understand the concern, but when we test human skills (LSATs, job interviews, driver’s exams), we do it with very little help, even though being a lawyer or the average job is one where you will have plenty of teammates and should use as much assistance as possible.

Josh Snider 25 Jan 2026 3:03 UTC
5 points
−2
in reply to: Richard_Ngo’s comment on: ricraz’s Shortform
Is your idea that “gradual disempowerment” isn’t a real problem or that it’s a distraction from actual issues? I’ve heard arguments for both, so I’m not sure what the details of your beliefs are. Personally, I see “gradual disempowerment” as a process that has already begun, but the main danger is AI deciding we should die, not humans living in comfort while all the real power is held by AI.

Josh Snider 23 Jan 2026 17:14 UTC
2 points
0
on: A quick, elegant derivation of Bayes’ Theorem
Your proof of Bayes’ Theorem assumes P(A and B)=P(A)⋅P(B∣A), but it’s not clear why someone who doubts Bayes would accept that.

Josh Snider 23 Jan 2026 7:32 UTC
20 points
−4
in reply to: Daniel Kokotajlo’s comment on: Claude’s new constitution
I’m with Anthropic on this, most people are less virtuous than Claude, so Claude obeying them to do non-virtuous things is not desirable.

Josh Snider 11 Jan 2026 5:55 UTC
−2 points
0
on: How Humanity Wins
Regardless of the content, the presentation is very disorganized. It gives me the impression that these are schizophrenic ramblings, not a serious idea.

Josh Snider 10 Jan 2026 7:39 UTC
1 point
0
on: Chapter 104: The Truth, Pt 1, Riddles and Answers
Wow! The ending is still a “wham line” even though it really should not be a surprise and this isn’t my first time reading it.

Josh Snider 7 Jan 2026 6:56 UTC
1 point
0
on: Chapter 40: Pretending to be Wise, Pt 2
On rereading, Harry is definitely far too confident the afterlife doesn’t exist here, but I believe that was intentional.

Josh Snider 7 Jan 2026 6:01 UTC
1 point
0
in reply to: TheWakalix’s comment on: Chapter 37: Interlude: Crossing the Boundary
It says three comments now and this should be the fourth comment. Problem solved?

Josh Snider 6 Jan 2026 14:19 UTC
2 points
0
in reply to: O O’s comment on: 2025 in AI predictions
I agree Opus can do this with an expert user, but non-expert users might have to wait one or two more models.

Josh Snider 5 Jan 2026 22:33 UTC
5 points
0
in reply to: Josh Snider’s comment on: How middle powers may prevent the development of artificial superintelligence
I wrote a post saying it would be better for middle powers to do diplomacy and work directly with the AI labs, but I no longer endorse it and it will likely stay in drafts indefinitely. If you want to read that post, I’d recommend writing it yourself.

Josh Snider 18 Dec 2025 20:50 UTC
1 point
1
in reply to: Karl Krueger’s comment on: Karl Krueger’s Shortform
Sure, but it’s not the politics that are making long-haul trucking use less self-driving than taxis. It’s that the technical work is somewhat harder and the customer cares less about employee quality. It’s a temporary phase anyway.

Josh Snider 5 Dec 2025 19:27 UTC
7 points
0
on: The behavioral selection model for predicting AI motivations
Yes, this is great work. Probably in the top 10 things to read for the year. One thing I’d highlight is how much the selection could differ between labs. I actually did a relevant eval and post for that recently (https://www.lesswrong.com/posts/qE2cEAegQRYiozskD/is-friendly-ai-an-attractor-self-reports-from-22-models-say). Someone might also look into the market demand for AI capabilities (coding, ERP, homework “help”) and how that feeds into this model.