Noosphere89

Karma: 1,493

Noosphere89 26 Jul 2024 16:01 UTC
2 points
0
in reply to: chasmani’s comment on: Confusing the metric for the meaning: Perhaps correlated attributes are “natural”
This wasn’t specifically connected to the post, just providing general commentary.

Noosphere89 25 Jul 2024 16:14 UTC
3 points
1
on: Confusing the metric for the meaning: Perhaps correlated attributes are “natural”
If I were to take anything away from this, it’s that you can have cognition/intelligence that is efficient, or rational/unexploitable cognition like full-blown Bayesianism, but not both.

And that given the constraints of today, it is far better to have efficient cognition than rational/unexploitable cognition, because the former can actually be implemented, while the latter can’t be implemented at all.

Noosphere89 18 Jul 2024 20:07 UTC
6 points
1
in reply to: Dweomite’s comment on: Optimistic Assumptions, Longterm Planning, and “Cope”
My point isn’t that the easier option always exists, or even that a problem can’t be impossible.

My point is that if you are facing a problem that requires 1-shot complete plans, and there’s no second try, you need to do something else.

There is a line where a problem becomes too difficult to productively work on, and that constraint is a great sign of an impossible problem (if it exists.)

Noosphere89 18 Jul 2024 16:54 UTC
3 points
1
in reply to: mishka’s comment on: Francois Chollet inadvertently limits his claim on ARC-AGI
I was focusing on runs eligible for the prize in this short linkpost.

Noosphere89 18 Jul 2024 16:53 UTC
5 points
2
in reply to: sunwillrise’s comment on: Optimistic Assumptions, Longterm Planning, and “Cope”
Plans obviously need some robustness to things going wrong, and in a sense I agree with John Wentworth, if weakly, that some robustness is a necessary feature of a plan, and some verification is actually necessary.

But I have to agree that there is a real failure mode identified by moridinamael and Quintin Pope, and that is perfectionism, meaning that you discard ideas too quickly as not useful, and this constraint is the essence of perfectionism:

I have an exercise where I give people the instruction to play a puzzle game (“Baba is You”), but where you normally have the ability to move around and interact with the world to experiment and learn things, instead, you need to make a complete plan for solving the level, and you aim to get it right on your first try.

It asks for both a complete plan to solve the whole level, and also asks for the plan to work on the first try, which outside of this context implies either the problem is likely unsolvable or you are being too perfectionist with your demands.

In particular, I think that Quintin Pope’s comment here is genuinely something that applies in lots of science and problem solving, and that it’s actually quite difficult to reasoin well about the world in general without many experiments.

Noosphere89 18 Jul 2024 16:36 UTC
4 points
0
on: Optimistic Assumptions, Longterm Planning, and “Cope”
What I take away from this is that they should have separated the utility from an assumption being true, from the probability/likelihood of an assumption being true, and indeed this shows some calibration problems.

There is slipping into more convenient worlds for reasons based on utility rather than evidence, which is a problem (assuming it’s solvable for you.)

This is an important takeaway, but I don’t think your other takeaways help as much as this one.

That said, this constraint IRL makes almost all real-life problems impossible for humans and AIs:

I have an exercise where I give people the instruction to play a puzzle game (“Baba is You”), but where you normally have the ability to move around and interact with the world to experiment and learn things, instead, you need to make a complete plan for solving the level, and you aim to get it right on your first try.

In particular, if such a constraint exists, then it’s a big red flag that the problem you are solving is impossible to solve, given that constraint.

Almost all plans fail on the first try, even for really competent plans and humans, and outside of very constrained regimes, 0 plans work out on the first try.

Thus, if you are truly in a situation where you are encountering such constraints, you should give up on the problem ASAP, and rest a little to make sure that the constraint actually exists.

So while this is a fun experiment, with real takeaways, I’d warn people that constraining a plan to work on the first try and requiring completeness makes lots of problems impossible to solve for us humans and AIs.

Francois Chollet inadvertently limits his claim on ARC-AGI

Noosphere8916 Jul 2024 17:32 UTC

12 points

3 comments1 min readLW link

(x.com)

Noosphere89 12 Jul 2024 15:23 UTC
2 points
0
in reply to: gwern’s comment on: Paper: LLMs trained on “A is B” fail to learn “B is A”
Very interesting. Yeah, I’m starting to doubt the idea that Reversal Curse is any sort of problem for LLMs at all, and is probably trivial to fix.

Noosphere89 12 Jul 2024 4:19 UTC
2 points
0
in reply to: jacquesthibs’s comment on: jacquesthibs’s Shortform
In retrospect, I probably should have updated much less than i did, I though that it was actually testing a real LLM, which makes me less confident in the paper.

Should have responded long ago, but responding now.

Noosphere89 11 Jul 2024 21:14 UTC
2 points
0
in reply to: Steven Byrnes’s comment on: Daniel Kokotajlo’s Shortform
Where are your DMs so I can get the links?

Noosphere89 6 Jul 2024 3:27 UTC
2 points
0
in reply to: Dalcy’s comment on: When Are Results from Computational Complexity Not Too Coarse?
Yeah, I probably messed up here quite a bit, sorry.

Noosphere89 4 Jul 2024 15:48 UTC
2 points
0
in reply to: Vladimir_Nesov’s comment on: Static Analysis As A Lifestyle
The point is that it can get really, really hard for a static analyzer to be complete if you ask for enough generality in your static analyzer.

The proof basically works by showing that if you figured out a way to say, automatically find bugs in programs and making sure the program meets the specification, or figuring out whether a program actually is platonically implementing the square function infallibly, or any other program that identifies non-trivial, semantic properties, we could convert it into a program that solves the halting problem, and thus the program must be at least be able to solve all recursively enumerable problems.

For a more in practice example of static analysis being hard, I’d say a lot of NP and Co-NP completeness results of lots of problems, or even PSPACE-completeness for problems like model checking show that unless huge assumptions are made about physics, completeness of static analysis is a pipe dream for even limited areas.

Static analysis will likely be very hard for a long time to come.

Noosphere89 4 Jul 2024 15:14 UTC
2 points
0
on: Static Analysis As A Lifestyle
I want to point out that one other big reason for static analysis being incomplete in practice is that it’s basically impossible to get completeness of static analysis for lots of IRL stuff, even in limited areas without huge discoveries in physics that would demand extraordinary evidence, and the best example of this is Rice’s theorem:

https://en.wikipedia.org/wiki/Rice’s_theorem

Which is a huge limiter to how much we can perform static analysis IRL, though a more relevant result would probably be the Co-NP completeness result for Tautology problems, which again are related to static analysis.

Noosphere89 4 Jul 2024 1:56 UTC
7 points
3
on: When Are Results from Computational Complexity Not Too Coarse?
While this is a useful result, I’d caution that lots of NP-complete problems are not like this, where the parameterized complexity is easy while the general complexity is hard, and assuming FPT != W[1], lots of NP-complete problems like the Clique problem are still basically impossible to solve in practice, so be wary of relying on parameterized complexity too much.

That also neatly solves the issue of whether P vs NP matters in practice: The answer is very likely yes, it does matter a lot in practice.

Noosphere89 4 Jul 2024 1:47 UTC
2 points
0
in reply to: Dalcy’s comment on: Darcy’s Shortform
as an aside, does the P vs NP distinction even matter in practice?

Yes, it does, for several reasons:
1. It basically is necessary to prove P != NP to get a lot of other results to work, and for some of those results, proving P != NP is sufficient.
2. If P != NP (As most people suspect), it fundamentally rules out solving lots of problems generally and quickly without exploiting structure, and in particular lets me flip the burden of proof to the algorithm maker to explain why their solution to a problem like SAT is efficient, rather than me having to disprove the existence of an efficient algorithm.
It’s either by exploiting structure, somehow having a proof that P=NP, or relying on new physics models that enable computing NP-complete problems efficiently, and the latter 2 need very, very strong evidence behind them.

This in particular applies to basically all learning problems in AI today.
1. It explains why certain problems cannot be reasonably solved optimally, without huge discoveries, and the best examples are travelling salesman problems for inability to optimally solve, as well as a whole lot of other NP-complete problems. There are also other NP problems where there isn’t a way to solve them efficiently at all, especially if FPT != W[1] holds.
Also a note that we also expect a lot of NP-complete problems to also not be solvable by fast algorithms even in the average case, which basically means it’s likely to be very relevant quite a lot of the time, so we don’t have to limit ourselves to the worst case either.

Noosphere89 2 Jun 2024 17:07 UTC
4 points
0
on: Value Claims (In Particular) Are Usually Bullshit
The big reason that value claims tend to be on the more bullshit side is that values/morality has far, far more degrees of freedom than most belief claims, primarily because there are too many right answers to the question of what is ethical.

Belief claims can also have sort of effect (I believe the Mathematical Multiverse/Simulation Hypothesis idea by Max Tegmark and others like Nick Bostrom, while true, are basically useless claims for almost any attempt at prediction because they allow basically everything to be predicted, so it’s an extremely weak predictive model, as opposed to an extremely strong generative model, which is why I hate the discourse on the Simulation/Mathematical hypotheses.), but value claims tend to be worst offenders of not being entangled and having far too many right answers.

Noosphere89 17 May 2024 23:06 UTC
2 points
0
in reply to: Thomas Kwa’s comment on: Catastrophic Goodhart in RL with KL penalty
My expectation is that error and utility are both extremely heavy tailed, and arguably in the same order of magnitude for heavy tails.

But thanks for answering, the real answer is we can predict effectively nothing without independence, and thus we can justify virtually every outcome of real-life Goodhart.

Maybe it’s catastrophic, maybe it doesn’t matter, or maybe there’s anti-goodhart, but I don’t see a way to predict what will reasonably happen.

Also, why do you think that error is heavier tailed than utility?

Noosphere89 17 May 2024 17:28 UTC
2 points
0
on: Catastrophic Goodhart in RL with KL penalty
I have a question about this post, and it has to do with the case where both utility and error are heavy tailed:

Where does the expected value converge to if both utility and errors are heavy tailed? Is it 0, infinity, some other number, or does it not converge to any number at all?

Noosphere89 2 May 2024 22:12 UTC
4 points
2
in reply to: mako yass’s comment on: Please stop publishing ideas/insights/research about AI

Privacy of communities isn’t a solvable problem in general, as soon as your community is large enough to compete with the adversary, it’s large enough and conspicuous enough that the adversary will pay attention to it and send in spies and extract leaks.

I disagree with this in theory as a long-term concern, but yes in practice the methods to have privacy of communities haven’t been implemented or tested at all, and I agree with the general sentiment that it isn’t worth the steep drawbacks of privacy to protect secrets, which does unfortunately make me dislike the post due to it’s strength of recommendations.

So while I could in theory disagree with you, in practice right now I mostly have to agree with the comment that there will not be such an infrastructure for private alignment ideas.

Also to touch on something here that isn’t too relevant and could be considered a tangent:

If your acceptable lower limit for basically anything is zero you wont be allowed to do anything, really anything.

This is why perfectionism is such a bad thing, and why you need to be able to accept that failure happens. You cannot have 0 failures IRL.

Noosphere89 2 May 2024 14:59 UTC
1 point
−1
in reply to: tlevin’s comment on: tlevin’s Shortform

Unless you’re talking about financial conflicts of interest, but there are also financial incentives for orgs pursuing a “radical” strategy to downplay boring real-world constraints, as well as social incentives (e.g. on LessWrong IMO) to downplay boring these constraints and cognitive biases against thinking your preferred strategy has big downsides.

It’s not just that problem though, they will likely be biased to think that their policy is helpful for safety of AI at all, and this is a point that sometimes gets forgotten.

But correct on the fact that Akash’s argument is fully general.

Noosphere89

Fran­cois Chol­let in­ad­ver­tently limits his claim on ARC-AGI

Francois Chollet inadvertently limits his claim on ARC-AGI