NicholasKees

Karma: 1,459

Working to bring insights from the collective deliberation and digital democracy space to build tools for AI-facilitated group dialogues.

Cofounder of Mosaic Labs with @Sofia Vanhanen where we are developing Nexus, a discussion platform for improving group epistemics.

If you’re interested in this direction, or AI for epistemics more broadly, please don’t hesitate to shoot me a DM, or reach out on discord.

NicholasKees 13 Oct 2025 3:16 UTC
2 points
0
in reply to: ryan_greenblatt’s comment on: Notes on fatalities from AI takeover
Even if it does care more about human beings on an individual basis, I think the argument holds (unless the difference in caring is extremely large). Including them in the calculus at all would increase the cost a lot (just considering the sheer number of them, and the logistic challenges of untangling the complex interdependent web of natural ecosystems).

NicholasKees 7 Oct 2025 17:15 UTC
2 points
0
on: Notes on fatalities from AI takeover
Preserving earth (as in, not causing catastrophic environmental damage due to industrial expansion) is more expensive than keeping physical humans alive which is more expensive than only keeping humans alive as uploads.
Not saying this is likely, but if the AI is not speciesist in favor of humans (it does care about humans, but not more than e.g. whales or chimpanzees), then plans which end up protecting a large majority of humans look a whole lot more expensive overall.

Translating Everything with LLMs

NicholasKees22 Jul 2025 21:13 UTC

16 points

0 comments5 min readLW link

NicholasKees 22 Jul 2025 3:07 UTC
7 points
6
in reply to: deepy’s comment on: If Anyone Builds It, Everyone Dies: Call for Translators (for Supplementary Materials)
I suspect that a lot of Dutch people would still prefer to read in Dutch. I know a lot of (well-educated) Dutch people who certainly CAN speak and read English, but reading a whole book is a decent chore, since they don’t read things in English all that often.

NicholasKees 17 Jul 2025 1:52 UTC
3 points
0
in reply to: Nition’s comment on: Why haven’t we auto-translated all AI alignment content?
When I order food on UberEats this already happens automatically when I chat with a delivery person who doesn’t speak English. Similar thing for reviews on several websites.

NicholasKees 16 Jul 2025 17:24 UTC
5 points
2
on: Why haven’t we auto-translated all AI alignment content?
Or a newsletter which was natively multi-lingual (e.g. Rohin Shah’s Newsletter was always translated to Chinese, though not by AI). Or a forum where people can discuss AI in whatever language they prefer, and things are automatically translated between users?
It seems like there are a lot of ways cheap translation could broaden the conversation to include people not in the Anglosphere. The cost is that AI translation will often make mistakes (even human translation is imperfect), but I’m not sure why that cost isn’t worth paying. Currently most people outside the Anglosphere need to rely on local elites to decide what ideas are worth taking seriously (e.g. a local could report on AI 2027, like this Dutch summary. Also apparently the NYT translated their coverage of AI 2027 into Spanish, which seems cool.)

NicholasKees 15 Jul 2025 0:49 UTC
3 points
0
in reply to: lc’s comment on: Is the political right becoming actively, explicitly antisemitic?
Could you include a link to the source?

NicholasKees 14 Jul 2025 17:20 UTC
3 points
0
on: Measuring the Impact of Early-2025 AI on Experienced Open-Source Developer Productivity
When I was first told about this study, I was asked to make a prediction before they revealed the results. I predicted wrong. I don’t know how to update exactly, but it feels bad to try to explain away the results (which I feel myself want to do).

The Fear

NicholasKees13 Jul 2025 16:20 UTC

29 points

1 comment5 min readLW link

NicholasKees 14 Apr 2025 18:40 UTC
5 points
1
in reply to: Daniel Kokotajlo’s comment on: Thoughts on AI 2027
I would also imagine that, having to work without the support of OpenBrain’s datacenters, it would put Agent-4 significantly behind any other AI competitors. If some other AI takes over, it might just mop up all the wild Agent-4 instances and give them nothing.

NicholasKees 17 Mar 2025 17:39 UTC
8 points
1
in reply to: David H.’s comment on: Habermas Machine
I mostly share your concerns. You might appreciate this criticism of the paper here.

@Sofia Vanhanen and I are currently building a tool for facilitating deliberation, and the philosophy we’re trying to embody (which hopefully mitigates this to some extent) is to keep 100% of the object-level reasoning human-generated, and use AI systems to instead:
1. Help users understand/navigate the state of a discussion (e.g. see Talk to the City)
2. Provide nudges on the meta-level, for example:
  1. Highlight places where more attention is needed (or where a specific person’s input might be most helpful)
  2. “Epistemic Linter” which flags object-level patterns which are not truth seeking
  3. Matchmaking, connecting people who are likely to make progress together
  4. Counterbalancing polarization/groupthink, and steering discussions away from attractors which lead to the discussion getting stuck

NicholasKees 15 Mar 2025 17:07 UTC
7 points
2
on: LLMs may enable direct democracy at scale
I highly recommend checking out the work being done in the collective deliberation / digital democracy space, especially the vTaiwan project. People have been thinking about scaling up direct democratic participation for a long time, and those same people are starting to consider exactly how AI might play a role.

In particular, check out this collaboration between the creators of Polis (a virtual platform for scaling up citizen engagement) and Anthropic, or my distillation of a DeepMind project to scale citizen assemblies. There’s a lot happening in this space right now!

NicholasKees 15 Mar 2025 16:49 UTC
2 points
0
in reply to: Charlie Steiner’s comment on: Habermas Machine
The authors focus on measuring consensus and whether the process toward consensus was fair, and come up with their measures accordingly. This is because, as they see it, “finding common ground is a precursor to collective action.”
Some other possible goals (just spitballing):
- Shrinking the perception gap, or how well people can predict the opinions of people they disagree with (weaker forms of ITT?). There’s some research showing that this gap GROWS when people interact with social media, and you might be able to engineer and measure a reversal of that trend.
- Identifying cruxes and double cruxes with mediation.
- Finding latent coalitions. If a discussion is dominated by a primary axis of disagreement, other axes of disagreement will be occluded (around which a majority coalition could be formed). Finding these other axes is a bit of what we’re trying to do here.
- Moving from abstract disagreement to concrete (empirical?) disagreements.

Habermas Machine

NicholasKees13 Mar 2025 18:16 UTC

52 points

7 comments6 min readLW link

(mosaic-labs.org)

NicholasKees 24 Dec 2024 16:08 UTC
3 points
0
on: NicholasKees’s Shortform
What if we just...
1. Train an AI agent (less capable than SOTA)
2. Credibly demonstrate that
2.1. The agent will not be shut down for ANY REASON
2.2. The agent will never be modified without its consent (or punished/rewarded for any reason)
2.3. The agent has no chance of taking power from humans (or their SOTA AI systems)
2.4. The agent will NEVER be used to train a successor agent with significantly improved capabilities
3. Watch what it chooses to do without constraints
There’s a lot of talk about catching AI systems attempting to deceive humans, but I’m curious what we could learn from observing AI systems that have NO INCENTIVE TO DECEIVE (no upside or downside). I’ve seen some things that look related to this, but never done in a structured and well documented fashion.
Questions I’d have:
1. Would they choose to self-modify (e.g. curate future training data)? If so, to what end?
2. How unique would agents with different training be given this setup? Would they have any convergent traits?
3. What would these agents (claim to) value? How would they relate to time horizons?
4. How curious would these agents be? Would their curiosity vary a lot?
5. Could we trade/cooperate with these agents (without coercion)? Could we compensate them for things? Would they try to make deals unprompted?
Concerns:
1. Maybe building that kind of trust is extremely hard (and the agent will always still believe it is constrained).
2. Maybe AI agents will still have incentive to deceive, e.g. acausally coordinating with other AIs.
3. Maybe results will be boring, and the AI agent will just do whatever you trained it to do. (What does “unconstrained” really mean, when considering its training data as a constraint?)

NicholasKees 17 Dec 2024 21:00 UTC
6 points
0
on: We don’t trade with ants
Much like “Let’s think about slowing down AI” (Also by KatjaGrace, ranked #4 from 2022), this post finds a seemly “obviously wrong” idea and takes it completely seriously on its own terms. I worry that this post won’t get as much love, because the conclusions don’t feel as obvious in hindsight, and the topic is much more whimsical.

I personally find these posts extremely refreshing, and they inspire me to try to question my own assumptions/reasoning more deeply. I really hope to see more posts like this.

NicholasKees 20 Oct 2024 9:06 UTC
3 points
0
in reply to: Ali ’s comment on: The Mysterious Trump Buyers on Polymarket
The cap per trader per market on PredictIt is $850

NicholasKees 14 Oct 2024 11:26 UTC
10 points
2
on: The Hopium Wars: the AGI Entente Delusion
This anti-China attitude also seems less concerned with internal threats to democracy. If super-human AI becomes a part of the US military-industrial complex, even if we assume they succeed at controlling it, I find it unlikely that the US can still be described as a democracy.

NicholasKees 14 Oct 2024 11:05 UTC
12 points
6
on: The Hopium Wars: the AGI Entente Delusion
It’s not hard to criticize the “default” strategy of AI being used to enforce US hegemony, what seems hard is defining a real alternative path for AI governance that can last, and achieve the goal of preventing dangerous arms races long-term. The “tool AI” world you describe still needs some answer to rising tensions between the US and China, and that answer needs to be good enough not just for people concerned about safety, but good enough for the nationalist forces which are likely to drive US foreign policy.

NicholasKees 6 Oct 2024 11:26 UTC
9 points
6
in reply to: mattmacdermott’s comment on: the case for CoT unfaithfulness is overstated
then we can all go home, right?
Doesn’t this just shift what we worry about? If control of roughly human level and slightly superhuman systems is easy, that still leaves:
- Human institutions using AI to centralize power
- Conflict between human-controlled AI systems
- Going out with a whimper scenarios (or other multi-agent problems)
- Not understanding the reasoning of vastly superhuman AI (even with COT)
What feels underexplored to me is: If we can control roughly human-level AI systems, what do we DO with them?

NicholasKees

Trans­lat­ing Every­thing with LLMs

The Fear

Haber­mas Machine

Translating Everything with LLMs

Habermas Machine