particlemania

Karma: 194

particlemania Apr 9, 2025, 5:05 PM
1 point
0
in reply to: gwern’s comment on: Recent AI model progress feels mostly like bullshit
I expect it matters to the extent we care about whether the generalizing to the new question is taking place in the expensive pretraining phase, or in the active in-context phase.

particlemania Mar 3, 2025, 12:29 PM
8 points
4
in reply to: cousin_it’s comment on: Historiographical Compressions: Renaissance as An Example
Not to pick on you specifically, but just as a general comment, I’m getting a bit worried about the rationalist decontextualized content policing. It seems it usually goes like this: someone cultivates an epistemological practice (say how to extract conceptual insights from diverse practices) → they decide to cross-post their thoughts on a community blog interested in epistemology → somebody else unfamiliar with the former’s body of work comes across it → interprets it into a pattern they might rightfully have identified as critique-worthy → dump the criticism there. So maybe it’d be better if comments were written by people who can click through the author’s profile to interpret the post in the right context.

[Epistemic status of this comment: Performative, but not without substance.]

particlemania Jun 16, 2024, 12:08 PM
1 point
0
in reply to: quiet_NaN’s comment on: The Problem With the Word ‘Alignment’
I would agree that it would be good and reasonable to have a term to refer to the family of scientific and philosophical problem spanned by this space. At the same time, as the post says, the issue is when there is semantic dilution, people talking past each other, and coordination-inhibiting ambiguity.
P3 seems helpful but insufficient for good long term outcomes
Now take a look at something I could check with a simple search: an ICML Workshop that uses the term alignment mostly to mean P3 (task-reliability) https://arlet-workshop.github.io/

One might want to use alignment one way or the other, and be careful of the limited overlap with P3 in our own registers, but by the time the larger AI community has picked up on the use-semantics of ‘RLHF is an alignment technique’ and associated alignment primarily with task-reliability, you’d need some linguistic interventions and deliberation to clear the air.

particlemania Jun 16, 2024, 11:53 AM
LW: 1 AF: 1
0
AF
in reply to: Lucas Teixeira’s comment on: The Problem With the Word ‘Alignment’
First of all, these are all meant to denote very rough attempts at demarcating research tastes.

It seems possible to be aiming to solve P1 without thinking much of P4, if a) you advocate ~Butlerian pause, or b) if you are working on aligned paternalism as the target behavior (where AI(s) are responsible for keeping humans happy, and humans have no residual agency or autonomy remaining).

Also a lot of people who focus on the problem from a P4 perspective tend to focus on the human-AI interface, where most of the relevant technical problems lie, but this might reduce their attention on issues of mesa-optimizers or emergent agency despite the massive importance of those issues to their project in the long run.

The Problem With the Word ‘Alignment’

peligrietzer and particlemania

May 21, 2024, 3:48 AM

63 points

8 comments6 min readLW link

Paradigms and Theory Choice in AI: Adaptivity, Economy and Control

particlemaniaAug 28, 2023, 10:19 PM

4 points

0 comments16 min readLW link

particlemania Jul 8, 2023, 8:12 PM
LW: 2 AF: 2
0
AF
in reply to: clem_acs’s comment on: “Concepts of Agency in Biology” (Okasha, 2023) - Brief Paper Summary
Okasha’s paper is addressing emerging discussions in biology that are talking about organisms-as-agents in particular, otherwise being called the Return of the Organism turn in philosophy of biology.

In the paper, he adds “Various concepts have been offered as ways of fleshing out this idea of organismic autonomy, including goal-directedness, functional organization, emergence, self-maintenance, and individuality. Agency is another possible candidate for the job.”

This seems like a reasonable stance so far as I can tell, since organisms seem to have some structural integrity—in what can make delineated cartesian boundaries well-defined.

For collectives, a similar discussion may surface additional upsides and downsides to agency concepts, that may not apply at organism levels.

Announcing “Key Phenomena in AI Risk” (facilitated reading group)

Nora_Ammann and particlemania

May 9, 2023, 12:31 AM

65 points

4 comments2 min readLW link

particlemania Mar 17, 2023, 6:18 AM
4 points
0
in reply to: johnswentworth’s comment on: Wittgenstein’s Language Games and the Critique of the Natural Abstraction Hypothesis
My understanding of Steel Late Wittgenstein’s response would be that you could agree with that words and concepts are distinct, and mapping is not always 1-1, but that what concepts get used is also significantly influenced by which features of the world are useful in some contexts of language (/word) use.

particlemania Jan 12, 2023, 7:08 PM
2 points
1
on: Reward is not Necessary: How to Create a Compositional Self-Preserving Agent for Life-Long Learning
Rewards and Utilities are different concepts. To reject that reward is necessary to get/build agency is not the same thing as rejecting EU maximization as a basin of idealized agency.

particlemania Dec 25, 2022, 9:20 PM
LW: 4 AF: 3
0
AF
in reply to: davidad’s comment on: You can still fetch the coffee today if you’re dead tomorrow
As an addendum, it seems to me that you may not necessarily need a ‘long-term planner’ (or ‘time-unbounded agent’) in the environment. A similar outcome may also be attainable if the environment contains a tiling of time-bound agents who can all trade across each other in ways such that the overall trade network implements long term power seeking.

Reflections on the PIBBSS Fellowship 2022

Nora_Ammann and particlemania

Dec 11, 2022, 9:53 PM

32 points

0 comments18 min readLW link

particlemania Dec 8, 2022, 6:21 AM
LW: 1 AF: 1
0
AF
on: particlemania’s Shortform
Concept Dictionary.

Concepts that I intend to use or invoke in my writings later, or are parts of my reasoning about AI risk or related complex systems phenomena.

particlemania’s Shortform

particlemaniaDec 8, 2022, 6:21 AM

2 points

1 comment LW link

The economy as an analogy for advanced AI systems

rosehadshar and particlemania

Nov 15, 2022, 11:16 AM

28 points

0 comments5 min readLW link

Epistemic Artefacts of (conceptual) AI alignment research

Nora_Ammann and particlemania

Aug 19, 2022, 5:18 PM

31 points

1 comment5 min readLW link

[Linkpost] Danger of motivatiogenesis in interdisciplinary work

particlemaniaNov 25, 2021, 12:13 AM

9 points

0 comments1 min readLW link

particlemania

Concept Dictionary.