lukehmiles

Karma: 645

Opinions expressed are my own and not endorsed by anyone.

Formerly @ ARC Evals aka METR

lukehmiles 2 Apr 2024 5:43 UTC
18 points
6
on: OMMC Announces RIP
You know making the omnicide machine really is the fastest way to understand how you could have done it safely.

lukehmiles 28 Feb 2023 17:41 UTC
11 points
0
on: Buddhist Psychotechnology for Withstanding Apocalypse Stress
Could you state the problem and solution more succinctly?

lukehmiles 11 Apr 2024 21:44 UTC
10 points
8
on: lcmgcd’s Shortform
I wish LW questions had an “accepted answer” thing like stackexchange

lukehmiles 15 Mar 2024 11:24 UTC
9 points
4
on: AI #55: Keep Clauding Along
It took me 4 hours to read your newsletter. I did click some links. I need a newsletter newsletter.

lukehmiles 22 Nov 2023 11:07 UTC
8 points
0
on: Sam Altman’s sister, Annie Altman, says Sam has (severely) abused her
Re: plausibility of shadowban claims: You can pay clickfarms to mark someone as spam.

lukehmiles 27 Apr 2024 0:01 UTC
7 points
5
on: We are headed into an extreme compute overhang
This seems correct and important to me.

lukehmiles 17 Apr 2024 17:59 UTC
7 points
3
on: lcmgcd’s Shortform
I wonder how many recent trans people tried/considered doubling down on their sex (eg males taking more testosterone) instead first. Maybe (for some people) either end of gender spectrum is comfortable and being in the middle feels bad¿ Anybody know? Don’t want to ask my friends because this Q will certainly anger them

lukehmiles 13 Dec 2023 8:05 UTC
7 points
4
in reply to: GeneSmith’s comment on: Significantly Enhancing Adult Intelligence With Gene Editing May Be Possible
This matrix closes the case in my book

lukehmiles 26 Oct 2022 10:45 UTC
LW: 7 AF: 6
2
AF
on: What does it take to defend the world against out-of-control AGIs?
There is a lot of room between “ignore people; do drastic thing” and “only do things where the exact details have been fully approved”. In other words, the Overton window has pretty wide error bars.

I would be pleased if someone sent me a computer virus that was actually a security fix. I would be pretty upset if someone fried all my gadgets. If someone secretly watched my traffic for evil AI fingerprints I would be mildly annoyed but I guess glad?

Even google has been threatening unpatched software people to patch it or else they’ll release the exploit iirc

So some of the Q of “to pivotally act or not to pivotally act” is resolved by acknowledging that extent is relevant and you can be polite in some cases

lukehmiles 12 Apr 2024 0:27 UTC
6 points
0
in reply to: gilch’s comment on: Any evidence or reason to expect a multiverse / Everett branches?
I must abstain from further culturewaring but this thought experiment is blowing my mind. I hadn’t heard of it.

lukehmiles 22 Mar 2024 7:12 UTC
6 points
0
on: “Deep Learning” Is Function Approximation
I think this is a strong starting point but I think the nice crisp “neural net = function approximator” mostly falls apart as a useful notion when you do fancy stuff with your neural net like active learning or RLAIF. Maybe it’s not technically the neural net doing that...

I guess we don’t have great terms to delineate all these levels of the system:
- code that does a forward pass (usually implicitly also describing backward pass & update given loss)
- code that does that plus training (ie data fetch and loss function)
- that plus RL environment or training set
- that plus “training scaffolding” code that eg will do active learning or restart the game if it freezes
- just code & weights for a forward pass during inference (presuming that the system has separable training and inference stages)
- that plus all the “inference scaffolding” code which will eg do censorship or internet search or calculator integration
- that plus the “inference UI”. (Consider how differently people use gpt4 api vs chatgpt website.) (This could also eg be the difference between clicking a checkbox for who to kill and clicking the cancel button on the notification!)
- the actual final system turned on in the wild with adversarial users and distribution shift and so on
I wonder if some folks are taking past each other by implicitly referring to different items above...

lukehmiles 26 Nov 2023 9:15 UTC
6 points
0
on: Moral Reality Check (a short story)
I can’t work it out myself. Please tell me the correct opinion to have

lukehmiles 18 Aug 2023 22:17 UTC
6 points
1
in reply to: lukehmiles’s comment on: Self-driving car bets
I know of one: the steam engine was “working” and continuously patented and modified for a century (iirc) before someone used it in boats at scale. https://youtu.be/-8lXXg8dWHk

lukehmiles 21 Sep 2022 11:38 UTC
6 points
1
on: You Are Not Measuring What You Think You Are Measuring
The Rationality Cheater’s Move is to have more information. Really beats analysis and reason in many cases.

lukehmiles 27 Apr 2018 1:37 UTC
6 points
0
in reply to: zarraha’s comment on: The Pyramid And The Garden
Thanks for pointing that out; I noticed that too. This is perhaps partly made up for by the fact that he doesn’t count feet or the mouth of the Nile river as possibilities.

lukehmiles 14 Apr 2024 19:27 UTC
5 points
4
in reply to: Garrett Baker’s comment on: Alexander Gietelink Oldenziel’s Shortform
Have you ever tried hiring someone or getting a job? Mostly lemons all around (apologies for the offense, jobseekers, i’m sure you’re not the lemon)

lukehmiles 29 Feb 2024 7:05 UTC
LW: 5 AF: 2
4
AF
on: Post series on “Liability Law for reducing Existential Risk from AI”
One of those ideas that’s so obviously good it’s rarely discussed?

lukehmiles 5 Nov 2023 7:56 UTC
5 points
1
on: Go flash blinking lights at printed text right now
Some other things to try:
- find a long hair on your arm
- find an imperfection in your mirror or fridge door
- count the number of quarters in a small pile
- find an object you lost recently
- figure out the weave on your jeans

lukehmiles 20 Jun 2023 10:12 UTC
5 points
0
on: Lessons On How To Get Things Right On The First Try
I’m just patting myself on the back here for predicting the cup would get knocked over. That shouldn’t count. You want the ball in the cup—what use is a knocked over cup and ball on the ground.

Do you have more things like this? I would participate or run one

lukehmiles 19 Jun 2023 19:53 UTC
5 points
0
on: When did you orient?
Kyle Scott roughly said that when you know where to look and what to ignore you are oriented. Imagine a general freaking out at all the explosions vs one who knows how severe the explosions are expected to be and the threshold for changing course.