Nicholas / Heather Kross

Karma: 2,040

Theoretical AI alignment (and relevant upskilling) in my free time. My current view of the field is here (part 1) and here (part 2).

Nicholas / Heather Kross Feb 6, 2025, 12:07 AM
2 points
0
on: We Fell For It
Something else I just realized: Georgism is a leftish idea that recognizes some (but not all) leftish ideas I’ve discussed or referenced above, and its modern form is currently rationalist-adjacent. Progress!

Nicholas / Heather Kross Feb 6, 2025, 12:06 AM
4 points
2
in reply to: sloonz’s comment on: We Fell For It
Ah, sorry yeah I think it was a mistake on my part to mostly make the post a verbatim Discord reply. Lots of high-context stuff that I didn’t explain well.

This specific part is (in my usage/interpretation; if you click the link, the initial context was an Emmett Shear tweet) basically a shorthand for one or more “basic” leftist views, along the lines of these similar-but-somewhat-distinct claims:
- Capitalism more-reliably rewards power-maximizers than social-utility-maximizers.
- Under capitalism and similar incentive-structures, we’d expect conflict theory to predict entities’ wealth better than mistake theory.
- General outcomes, under capitalism and similar incentive-structures, are downstream of “brute power” (from guns to monopolies) far more than the things we’d “want” to reward (innovation, good service, helping people, etc).

Nicholas / Heather Kross Feb 5, 2025, 6:27 PM
2 points
2
on: We Fell For It
In hindsight, I over-updated on my previous success with a poorly-written angry short post with a clickbait title and lots of inline links criticizing the rationality community. Oops.

Nicholas / Heather Kross Feb 5, 2025, 12:50 AM
2 points
0
in reply to: Dacyn’s comment on: Rationalist Movie Reviews
I said “one of the best movies about”, not “one of the best movies showing you how to”.

Rationalist Movie Reviews

Nicholas / Heather KrossFeb 1, 2025, 11:10 PM

16 points

2 comments4 min readLW link

(www.thinkingmuchbetter.com)

Nicholas / Heather Kross Dec 22, 2024, 7:31 PM
3 points
1
in reply to: Nicholas / Heather Kross’s comment on: NicholasKross’s Shortform
The punchline is “alignment could productively use more funding”. Many of us already know that, but I felt like putting a mildly-opinionated spin on what kind of things, at the margin, may help top researchers. (Also I spent several minutes editing/hedging the joke)

Nicholas / Heather Kross Dec 22, 2024, 7:22 PM
3 points
1
on: NicholasKross’s Shortform
Virgin 2030s [sic] MIRI fellow:
- is cared for so they can focus on research
- has staff to do their laundry
- soyboys who don’t know *real* struggle
- 3 LDT-level alignment breakthroughs per week
CHAD 2010s Yudkowsky:
- founded a whole movement to support himself
- “IN A CAVE, WITH A BOX OF SCRAPS”
- walked uphill both ways to Lightcone offices.
- alpha who knows *real* struggle
- 1 LDT-level alignment breakthrough per decade

Nicholas / Heather Kross Nov 30, 2024, 2:18 AM
2 points
0
in reply to: Zheng Wang’s comment on: Why and When Interpretability Work is Dangerous
Kinda, my current mainline-doom-case is “some AI gets controlled --> powerful people use it to prop themselves up --> world gets worse until AI gets uncontrollably bad --> doom”. I would call it a different yet also-important doom case of “perpetual low-grade-AI dictatorship where the AI is controlled by humans in a surveillance state”.

Nicholas / Heather Kross Nov 10, 2024, 5:36 PM
3 points
1
in reply to: Nicholas / Heather Kross’s comment on: An AI crash is our best bet for restricting AI
EDIT: Due to the incoming administration’s ties to tech investors, I no longer think an AI crash is so likely. Several signs IMHO point to “they’re gonna go all-in on racing for AI, regardless of how ‘needed’ it actually is”.

Nicholas / Heather Kross Oct 18, 2024, 5:44 PM
5 points
2
on: An AI crash is our best bet for restricting AI
For more details on (the business side of) a potential AI crash, see recent articles by the blog Where’s Your Ed At, which wrote the sorta-well-known post “The Man Who Killed Google Search”.
For his AI-crash posts, start here and here and click on links to his other posts. Sadly, the author falls into the trap of “LLMs will never get to reasoning because they don’t, like, know stuff, man”, but luckily his core competencies (the business side, analyzing reporting) show why an AI crash could still very much happen.
What links here?
- OpenAI defected, but we can take honest actions by Remmelt (EA Forum; Oct 21, 2024, 8:41 AM; 19 points)
- OpenAI defected, but we can take honest actions by Remmelt (Oct 21, 2024, 8:41 AM; 17 points)

Nicholas / Heather Kross Jul 18, 2024, 10:53 PM
3 points
1
on: AI #73: Openly Evil AI
Further context on the Scott Adams thing lol: He claims to have taken hypnosis lessons decades ago and has referred to using it multiple times. His, uh, personality also seems to me like it’d be more susceptible to hypnosis than average (and even he’d probably admit this in a roundabout way).

Nicholas / Heather Kross May 11, 2024, 6:05 PM
2 points
0
in reply to: Nicholas / Heather Kross’s comment on: Please stop publishing ideas/insights/research about AI
Further observation about that second sentence.

Nicholas / Heather Kross May 10, 2024, 9:44 PM
3 points
0
in reply to: leogao’s comment on: Please stop publishing ideas/insights/research about AI
I think deeply understanding top tier capabilities researchers’ views on how to achieve AGI is actually extremely valuable for thinking about alignment. Even if you disagree on object level views, understanding how very smart people come to their conclusions is very valuable.
I think the first sentence is true (especially for alignment strategy), but the second sentence seems sort of… broad-life-advice-ish, instead of a specific tip? It’s a pretty indirect help to most kinds of alignment.
Otherwise, this comment’s points really do seem like empirical things that people could put odds or ratios on. Wondering if a more-specific version of those “AI Views Snapshots” would be warranted, for these sorts of “research meta-knowledge” cruxes. Heck, it might be good to have lots of AI Views Snapshot DLC Mini-Charts, from for-specific-research-agendas(?) to internal-to-organizations(?!?!?!?).

Nicholas / Heather Kross May 10, 2024, 1:49 AM
2 points
0
on: LessOnline (May 31—June 2, Berkeley, CA)
I can’t make this one, but I’d love to be at future LessOnline events when I’m less time/budget-constrained! :)

Nicholas / Heather Kross May 2, 2024, 11:19 PM
2 points
0
on: Unintentionally Creating Value
First link is broken.

Nicholas / Heather Kross May 2, 2024, 4:17 PM
−5 points
−8
on: Please stop publishing ideas/insights/research about AI
“But my ideas are likely to fail! Can I share failed ideas?”: If you share a failed idea, that saves the other person time/effort they would’ve spent chasing that idea. This, of course, speeds up that person’s progress, so don’t even share failed ideas/experiments about AI, in the status quo.
“So where do I privately share such research?” — good question! There is currently no infrastructure for this. I suggest keeping your ideas/insights/research to yourself. If you think that’s difficult for you to do, then I suggest not thinking about AI, and doing something else with your time, like getting into factorio 2 or something.
“But I’m impatient about the infrastructure coming to exist!”: Apply for a possibly-relevant grant and build it! Or build it in your spare time. Or be ready to help out if/when someone develops this infrastructure.
“But I have AI insights and I want to convert them into money/career-capital/personal-gain/status!”: With that kind of brainpower/creativity, you can get any/all of those things pretty efficiently without publishing AI research, working at a lab, advancing a given SOTA, or doing basically (or literally) anything that differentially speeds up AI capabilities. This, of course, means “work on the object-level problem, without routing that work through AI capabilities”, which is often as straightforward “do it yourself”.
“But I’m wasting my time if I don’t get involved in something related to AGI!”: “I want to try LSD, but it’s only available in another country. I could spend my time traveling to that country, or looking for mushrooms, or even just staying sober. Therefore, I’m wasting my time unless I immediately inject 999999 fentanyl.”

Nicholas / Heather Kross Apr 19, 2024, 5:37 PM
5 points
0
in reply to: Ben Pace’s comment on: LessOnline Updates Thread
How scarce are tickets/”seats”?

Nicholas / Heather Kross Apr 2, 2024, 12:05 AM
10 points
0
on: Introducing Open Asteroid Impact
I will carefully hedge my investment in this company by giving it $325823e7589245728439572380945237894273489, in exchange for a board seat so I can keep an eye on it.

Nicholas / Heather Kross Apr 2, 2024, 12:03 AM
6 points
1
on: Introducing Open Asteroid Impact
I have over 5 Twitter followers, I’ll take my board seat when ur ready

Nicholas / Heather Kross Mar 13, 2024, 5:20 PM
7 points
0
in reply to: Kaj_Sotala’s comment on: Why I no longer identify as transhumanist
Giving up on transhumanism as a useful idea of what-to-aim-for or identify as, separate from how much you personally can contribute to it.
More directly: avoiding “pinning your hopes on AI” (which, depending on how I’m supposed to interpret this, could mean “avoiding solutions that ever lead to aligned AI occurring” or “avoiding near-term AI, period” or “believing that something other than AI is likely to be the most important near-future thing”, which are pretty different from each other, even if the end prescription for you personally is (or seems, on first pass, to be) the same.), separate from how much you personally can do to positively affect AI development.
Then again, I might’ve misread/misinterpreted what you wrote. (I’m unlikely to reply to further object-level explanation of this, sorry. I mainly wanted to point out the pattern. It’d be nice if your reasoning did turn out correct, but my point is that its starting-place seems/seemed to be rationalization as per the pattern.)

Nicholas / Heather Kross

Ra­tion­al­ist Movie Reviews

Rationalist Movie Reviews