RussellThor

Karma: 307

RussellThor 17 May 2024 10:02 UTC
1 point
0
in reply to: RobertM’s comment on: Against “argument from overhang risk”
In terms of the big labs being inefficient, with hindsight perhaps. Anyway I have said that I can’t understand why they aren’t putting much more effort into Dishbrain etc. If I had ~$1B and wanted to get ahead on a 5 year timescale I would give it more probability expectation etc.
For
1. I am here for credibility. I am sufficiently highly confident they are not X-risk to not want to recommend stopping. I want the field to have credibility for later.
2. Yes, but I don’t think stopping the training runs is much of an otherwise good thing if at all. To me it seems more like inviting a fire safety expert and they recommend a smoke alarm in your toilet but not kitchen. If we can learn alignment stuff from such training runs, then stopping is an otherwise bad thing.
3. OK I’m not up with the details but some experts sure think we learnt a lot from 3.5/4.0. Also my belief about it often being a good idea to deploy the most advanced non X-risk AI as defense. (This is somewhat unclear, usually what doesn’t kill makes stronger, but I am concerned about AI companion/romantic partner etc. That could weaken society in a way to make it more likely to make bad decisions later. But that seems to have already happened and very large models being centralized could be secured against more capable/damaging versions.)

RussellThor 17 May 2024 9:15 UTC
1 point
0
in reply to: RobertM’s comment on: Against “argument from overhang risk”
If you are referring to this:
If we institute a pause, we should expect to see (counterfactually) reduced R&D investment in improving hardware capabilities, reduced investment in scaling hardware production, reduced hardware production, reduced investment in research, reduced investment in supporting infrastructure, and fewer people entering the field.
This seems an extreme claim to me (if these effects are argued to be meaningful), especially “fewer people entering the field”! Just how long do you think you would need a pause to make fewer people enter the field? I would expect that not only would the pause have to have lasted say 5+ years but there would have to be a worldwide expectation that it would go on for longer to actually put people off.
Because of flow on effects and existing commitments, reduced hardware R&D investment wouldn’t start for a few years either. Its not clear that it will meaningfully happen at all if we want to deploy existing LLM everywhere also. For example in robotics I expect there will be substantial demand for hardware even without AI advances as our current capabilities havn’t been deployed there yet.
As I have said here, and probably in other places, I am quite a bit more in favor of directly going for a hardware pause specifically for the most advanced hardware. I think it is achievable, impactful, and with clearer positive consequences (and not unintended negative ones) than targeting training runs of an architecture that already seems to be showing diminishing returns.
If you must go for after FLOPS for training, then build in large factors of safety for architectures/systems that are substantially different from what is currently done. I am not worried about unlimited FLOPS on GPT-X but could be for >100* less on something that clearly looks like it has very different scaling laws.

RussellThor 16 May 2024 22:28 UTC
5 points
0
on: Against “argument from overhang risk”
First, a pause straightforwardly buys you time in many worlds where counterfactual (no-pause) timelines were shorter than the duration of the pause.
Only if you pause everything that could bring ASI. That is hardware, training runs, basic science on learning algorithms, brain studies etc.

RussellThor 16 May 2024 21:19 UTC
3 points
0
on: Against “argument from overhang risk”
Another perspective.
If you believe like me that it is >90% that the current LLM approach is plateauing then your cost/benefit for pausing large training runs is different. I believe that the current AI lacks something like the generalization power of the human brain, this can be seen where Tesla auto-pilot has needed >10,000* the training data as a person and is still not human level. This could potentially be overcome by a better architecture, or could require different hardware as well because of the Von Neumann Bottleneck. If this is the case then a pause on large training runs can hardly be helpful. I believe that if LLM are not X-risk, then their capabilities should be fully explored and integrated fast into society to provide defense against more dangerous AI. It is a radically improved architecture or hardware that you should be worried about.
Three potential sources of danger
1. Greatly improved architecture
2. Large training run with current arch
3. Greatly improved HW
We are paying more attention to (2) when to me it is the least impactful of the three and could even hurt. There are obvious ways this can hurt the cause.
1. If such training runs are not dangerous then the AI safety group loses credibility.
2. It could give a false sense of security when a different arch requiring much less training appears and is much more dangerous than the largest LLM.
3. It removes the chance to learn alignment and safety details from such large LLM
A clear path to such a better arch is studying neurons. Whether this is Dishbrain, through progress in neural interfaces, brain scanning or something else, I believe it is very likely by 2030 we will have understood the brain/neural algorithm, characterized it pretty well and of course have the ability to attempt to implement it in our hardware.
So in terms of pauses, I think one targeted towards chip factories is better. It is achievable and it is clear to me that if you delay a large factory opening by 5 years, then you can’t make up the lost time in anything like the same way for software.
Stopping (1) seems impossible i.e. “Don’t study the human brain” seems likely to backfire. We would of course like some agreement that if a much better arch is discovered, it isn’t immediately implemented.

RussellThor 16 May 2024 2:20 UTC
4 points
0
in reply to: Seth Herd’s comment on: Instruction-following AGI is easier and more likely than value aligned AGI
I think value alignment will be expected/enforced as a negative to some extent. E.g. don’t do something obviously bad (many such things are illegal anyway) and I expect that constraint to get tighter. That could give some kind of status quo bias on what AI tools are allowed to do also as an unknown new thing could be bad or seen as bad.
Already the AI could “do what I mean and check” a lot better. for coding tasks etc it will often do the wrong thing when it could clarify. I would like to see a confidence indicator that it knows what I want before it continues. I don’t want to guess how much to clarify which what I currently have to do—this wastes time and mental effort. You are right there will be commercial pressure to do something at least somewhat similar.

RussellThor 14 May 2024 20:17 UTC
3 points
0
in reply to: jacquesthibs’s comment on: OpenAI releases GPT-4o, natively interfacing with text, voice and vision
How soon with what degree of confidence do you have? I think they have a big slower model that isn’t that much of a performance improvement and hardly economic to release.

RussellThor 14 May 2024 20:12 UTC
3 points
0
in reply to: mishka’s comment on: OpenAI releases GPT-4o, natively interfacing with text, voice and vision
ChatGPT interface like I usually do for GPT4.0. some GPT4.0 queries done by cursor AI IDE

RussellThor 14 May 2024 7:43 UTC
16 points
1
on: OpenAI releases GPT-4o, natively interfacing with text, voice and vision
I have just used it for coding for 3+ hours and found it quite frustrating. Definitely faster than GPT 4.0 but less capable. More like an improvement for 3.5. To me a seems a lot like LLM progress is plateauing.
Anyway in order to be significantly more useful a coding assistant needs to be able to see debug output, in mostly real time, have the ability to start/stop the program, automatically make changes, keep the user in the loop and read/use GUI as that is often an important part of what we are doing. I havn’t used any LLM that are even low-average ability at debugging kind of thought processes yet.

RussellThor 7 May 2024 20:50 UTC
1 point
0
in reply to: Radford Neal’s comment on: Rapid capability gain around supergenius level seems probable even without intelligence needing to improve intelligence
Not following—where could the ‘low hanging fruit’ possibly be hiding? We have many of “Other attributes conducive to breakthroughs are a …” in our world of 8 billion. The data strongly suggests we are in diminishing returns. What qualities could an AI of Einstein intelligence realistically have that would let it make such progress where no person has. It would seem you would need to appeal to other less well defined qualities such as ‘creativity’ and argue that for some reason the AI would have much more of that. But that seems similar to just arguing that it in fact has > Einstein intelligence.

RussellThor 7 May 2024 5:30 UTC
6 points
4
on: Rapid capability gain around supergenius level seems probable even without intelligence needing to improve intelligence
Capabilities are likely to cascade once you get to Einstein-level intelligence, not just because an AI will likely be able to form a good understanding of how it works and use this to optimize itself to become smarter^[4]^[5], but also because it empirically seems to be the case that when you’re slightly better than all other humans at stuff like seeing deep connections between phenomena, this can enable you to solve hard tasks like particular research problems much much faster (as the example of Einstein suggests).
1. Aka: Around Einstein-level, relatively small changes in intelligence can lead to large changes in what one is capable to accomplish.
OK but if that were true then there would have been many more Einstein like breakthroughs since then. More likely is that such low hanging fruit have been plucked and a similar intellect is well into diminishing returns. That is given our current technological society and >50 year history of smart people trying to work on everything if there are such breakthroughs to be made, then the IQ required is now higher than in Einsteins day.

RussellThor 25 Apr 2024 5:54 UTC
1 point
0
in reply to: Wei Dai’s comment on: AI Regulation is Unsafe
No I have not seen a detailed argument about this, just the claim that once centralization goes past a certain point there is no coming back. I would like to see such an argument/investigation as I think it is quite important. “Yuval Harari” does say something similar in “Sapiens”

RussellThor 24 Apr 2024 10:05 UTC
8 points
7
in reply to: Daniel Kokotajlo’s comment on: AI Regulation is Unsafe
There is a belief among some people that our current tech level will lead to totalitarianism by default. The argument is that with 1970′s tech the soviet union collapsed, however with 2020 computer tech (not needing GenAI) it would not. If a democracy goes bad, unlike before there is no coming back. For example Xinjiang—Stalin would have liked to do something like that but couldn’t. When you add LLM AI on everyone’s phone + Video/Speech recognition, organized protest is impossible.
Not sure if Rudi C is making this exact argument. Anyway if we get mass centralization/totalitarianism worldwide, then S risk is pretty reasonable. AI will be developed under such circumstances to oppress 99% of the population—then goes to 100% with extinction being better.
I find it hard to know how likely this is. Is clear to me that tech has enabled totalitarianism but hard to give odds etc.

RussellThor 19 Apr 2024 21:31 UTC
1 point
−2
in reply to: cousin_it’s comment on: When is a mind me?
Such optimizations are a reason I believe we are not in a simulation. Optimizations are essential for a large sim. I expect them not to be consciousness preserving

RussellThor 17 Apr 2024 21:31 UTC
1 point
0
on: When is a mind me?
But it could matter if its digital vs continuous. <OK longer post and some thoughts a bit off topic perhaps>
Your A,B,C,D … leads to some questions about what is conscious (C) and what isn’t.
Where exactly does the system stop being conscious
1. Biological mind with neurons
2. Very high fidelity render in silicon with neurons modelled down to chemistry rather than just firing pulses
3. Classic neural net spiking approx done in discrete maths that appears almost indistinguishable to 1,2. Producing system states A,B,C,D
4. same as (3) but states are saved/retrieved in memory not calculated.
5. States retrieved from memory many times - A,B,C,D … A,B,C,D … does this count as 1 or many experiences?
6. States retrieved in mixed order A,D,C,B....
7 States D,D,D,D,A,A,A,A,B,B,B,B,C,C,C,C .. does this count 4* or nothing.
A possible cutoff is between ³⁄₄. Retrieving instead of calculating makes it non-conscious. But what about caching, some calc, some retrieved?
As you prob know this has been gone over before, e.g. Scott Aaronson. Wonder what your position is?
https://scottaaronson.blog/?p=1951
with quote:
“Maybe my favorite thought experiment along these lines was invented by my former student Andy Drucker. In the past five years, there’s been a revolution in theoretical cryptography, around something called Fully Homomorphic Encryption (FHE), which was first discovered by Craig Gentry. What FHE lets you do is to perform arbitrary computations on encrypted data, without ever decrypting the data at any point. So, to someone with the decryption key, you could be proving theorems, simulating planetary motions, etc. But to someone without the key, it looks for all the world like you’re just shuffling random strings and producing other random strings as output.
You can probably see where this is going. What if we homomorphically encrypted a simulation of your brain? And what if we hid the only copy of the decryption key, let’s say in another galaxy? Would this computation—which looks to anyone in our galaxy like a reshuffling of gobbledygook—be silently producing your consciousness?”
and last but not least:
“But, in addition to performing complex computations, or passing the Turing Test, or other information-theoretic conditions that I don’t know (and don’t claim to know), there’s at least one crucial further thing that a chunk of matter has to do before we should consider it conscious. Namely, it has to participate fully in the Arrow of Time. ”
https://www.scottaaronson.com/papers/giqtm3.pdf

RussellThor 17 Apr 2024 21:10 UTC
1 point
0
on: Moving on from community living
Sounds interesting. Always relevant because arguably the “natural state” of humans is hunter-gatherer tribes. In my country high end retirement villages are becoming very popular because of the Pro type reasons you give. It seems some retirees, and gangs! lol are most in tune with their roots.
I had half expected the communal living thing to go more mainstream by now (similar things in fiction like https://en.wikipedia.org/wiki/Too_Like_the_Lightning) It seems it needs a lot more critical mass, e.g. specifically designed house/houses to get the right balance between space and togetherness school right nearby, gated suburb etc so its child safe.
Longer term, I expect to see some interesting social stuff to come from space colonies as there kind of experiments are forced on the inhabitants.

RussellThor 7 Mar 2024 4:22 UTC
1 point
0
in reply to: Seth Herd’s comment on: Are we so good to simulate?
OK but why would you need high res for the minds? If its an ancestor sim and chatbots can already pass the Turing test etc, doesn’t that mean you can get away with compression or lower res? The major arc of history won’t be affected unless they are pivotal minds. If its possible to compress the sims so they experience lesser consciousness than us but still are very close to the real thing (and havn’t we almost already proven that can be done with our LLM’s), then an ancestor simulator would do that.

RussellThor 6 Mar 2024 0:37 UTC
1 point
0
in reply to: Seth Herd’s comment on: Are we so good to simulate?
If thats right, and its almost always low-res sims that are sufficient then that destroys the main ancestor sim argument for our conscious experience being simulated. Low res is not conscious in the same way we are, different reference class to base reality bio-consciousness

RussellThor 4 Mar 2024 6:02 UTC
20 points
3
on: Are we so good to simulate?
If Windows95 was ever conscious (shock!) it would be very sure it was in a virtual machine (i.e like simulated) if it existed at the time when VM’s existed. It would reason about Moores law/resources going up exponentially. and be convinced it was in a VM. However I am pretty sure it would be wrong most of the time? Most Win95 instances in history were not run in VM and we have stopped bothering now? An analogy sort of but gives an interesting result.

RussellThor 2 Mar 2024 6:00 UTC
3 points
0
on: RussellThor’s Shortform
Random ideas to expand on
https://www.theguardian.com/technology/2023/jul/21/australian-dishbrain-team-wins-600000-grant-to-develop-ai-that-can-learn-throughout-its-lifetime
https://newatlas.com/computers/human-brain-chip-ai/
https://newatlas.com/computers/cortical-labs-dishbrain-ethics/
Could this be cheaper than chips in an extreme silicon shortage? How did it learn, can we map connections forming and make better learning algorithms.

Birds vs ants/bees.
A flock of birds can be dumber than the dumbest individual bird, a colony of bees/ants can be smarter than than the individual, and smarter than a flock of birds! Bird avoiding predator in geometrical pattern—no intelligence as predictability like fluid has no processing. Vs bees swarming the scout hornet or ants building a bridge etc. Even though no planning in ants, no overall plan in individual neurons?
The more complex pieces the less well they fit together. Less intelligent units can form a better collective in this instance. Not like human orgs.
Progression from simple cell to mitochondria—mito have no say anymore but fit in perfectly. Multi organism like hive are next level up—simpler creatures can have more cohesion in upper level. Humans have more effective institutions in spite of complexity b/c of consciousness, language etc.
RISC vs CISC Intel vs NVIDIA, GPU for super computers. I though about this years ago, led to prediction that Intel or other CISC max business would lose to cheaper.
Time to communicate a positive singularity/utopia
Spheres of influence, like we already have, uncontacted tribes, Amish etc. Taking that further, Super AI must leave earth, perhaps solar system, enhanced ppl to of earth eco-system, space colonies, or Mars etc.
Take the best/happy nature to expand, don’t take suffering to >million stars.
Humans can’t do interstellar faster than AI anyway even if that was the goal, it would have to prepare it first, and can travel faster. So no question majority of humanity interstellar is AI. Need to keep earth for people. What is max CEV? Well keep earth ecosystem, humans can progress, discover on their own?
Is the progression to go outwards, human, posthuman/Neuralink, WBE? it is is some sci-fi Peter Hamilton/ Culture (human to WBE)
Long term all moral systems don’t know what to say on pleasure vs self determination/achievement. Eventually we run out of inventing things—should it go asymptotically slower.
Explores should be on the edge of civilization. For astronomers, shouldn’t celebrate JWST, but complain about Starlink—that is inconsistent. Edge of civilization has expanded past low earth orbit, that is why we get JWST. Obligation then to put telescopes further out.
Go to WBE instead of super AI—know for sure it is conscious.
Is industry, tech about making stuff less conscious with time? e.g. mechanical things have zero, vs a lot when done by people. Is that a principle for AI/robots? then there are no slaves etc.
Can ppl get behind this? - implied contract with future AI? acausal bargaining.
https://www.lesswrong.com/posts/qZJBighPrnv9bSqTZ/31-laws-of-fun
Turing test for WBE—how would you know?
Intelligence processing vs time
For search, exponential processing power gives linear increate in rating, Chess, Go. However this is a small search space. For life, does the search get bigger the further out you go.
e.g. 2 steps is 2^2 but 4 steps is 4^4. This makes sense if there are more things to consider the further ahead you look. e.g. house price for 1 month, general market, + economic trend. 10+ years then demographic trends, changing govt policy, unexpected changes in transport patterns, (new rail nearby or in competing suburb etc)
If applies to tech, then regular experiments shrink the search space, need physical experimentation to get ahead.
For AI, if its like intuition/search then need search to improve intuition. Can only learn from long term.
Long pause or not?
How long should we pause? 10 years? Even in stable society there is diminishing returns—seen this with pure maths, physics, philosophy, when we reach human limits, then more time simply doesn’t help. Reasonable to assume with CEV like concept also.
Pause carries danger? Is it like the clear pond before a rapid, are we already in the rapid, then trying to stop is dangerous having baby is fatal etc. “Emmett Shear” of go fast slow, stop, pause, Singularity seems ideal, though possible? WBE better than super AI—cultural as elder?
1984 quote “If you want a vision of the future, imagine a boot stamping on a human face—forever.”
“Heaven is high and the emperor is far away” is a Chinese proverb thought to have originated from Zhejiang during the Yuan dynasty.
Not possible earlier but is possible now. If democracies go to dictatorship but not back then pause is bad. Best way to keep democracies is to leave hence space colonies. Now in Xinjiang, the emperor is in your pocket, LLM can understand anything—how far back to go before this is not possible? 20 years, if not possible, then we are in the white water, and we need to paddle forwards, can’t stop.
Deep time breaks all common ethics?
Utility monster, experience machine, moral realism tiling the universe etc. Self determination and achievement will be in the extreme minority over many years. What to do, fake it forget it and keep achieving again? Just keep options open until we actually experience it.
All our training is about intrinsic motivation and valuing achievement rather than pleasure for its own sake. Great asymmetry in common thought “meaningless pleasure” makes sense and seems bad or not good, but “meaningless pain” doesn’t make it less bad. Why should that be the case. Evolution has biased us to not value pleasure or experience it as much as we “should”? Learn to take pleasure regard thinking “meaningless pleasure” is itself a defective attitude? If you could change yourself, should you dial down the need to achieve if you lived in a solved world?
What is “should” in is-ought. Moral realism in the limit? “Should” is us not trusting our reason, as we shouldn’t. If reason says one thing, then it could be flawed as it is in most cases. Especially as we evolved, then if we always trusted it, then mistakes are bigger than benefits, so the feeling “you don’t do what you should” is two systems competing, intuition/history vs new rational.

RussellThor 22 Feb 2024 6:34 UTC
2 points
0
in reply to: Ilio’s comment on: On coincidences and Bayesian reasoning, as applied to the origins of COVID-19
“most likely story you can think of that would make it be wrong”—that can be the hard part. For investments its sometimes easy—just they fail to execute, their competitors get better, or their disruption is itself disrupted.
Before the debate I put Lab leak at say 65-80%, now more like <10%. The most likely story/reason I had for natural origin being correct (before I saw the debate) was that the host was found, and the suspicious circumstances where a result of an incompetent coverup and general noise/official lies mostly by the CCP around this.
Well I can’t say for sure that LL was wrong of course, but I changed my mind for a reason I didn’t anticipate—i.e. a high quality debate that was sufficiently to my understanding.
For some other things its hard to come up with a credible story at all, i.e. AGW being wrong I would really struggle to do.