lc

Karma: 10,367

lc Jun 11, 2025, 1:08 AM
30 points
15
on: lc’s Shortform
Old internet arguments about religion and politics felt real. Yeah, the “debates” were often excuses to have a pissing competition, but a lot of people took the question of “who was right” seriously. And if you actually didn’t care, you were at least motivated to pretend you did to the audience.

Nowadays people don’t even seem to pretend to care about the underlying content. If someone seems like they’re being too earnest, others just reply with a picture of their face. It’s sad.

lc Jun 10, 2025, 5:48 PM
2 points
−11
in reply to: Jan Betley’s comment on: Jan Betley’s Shortform

When I heard about this for the first time, I though: this model wants to make the world a better place. It cares. This is good. But some smart people, like Ryan Greenblatt and Sam Marks, say this is actually not good and I’m trying to understand where exactly we differ.

People who cry “misalignment” about current AI models on twitter generally have chameleonic standards for what constitutes “misaligned” behavior, and the boundary will shift to cover whatever ethical tradeoffs the models are making at any given time. When models accede to users’ requests to generate meth recipes, they say it’s evidence models are misaligned, because meth is bad. When models try to actively stop the user from making meth recipes, they say that, too is bad news because it represents “scheming” behavior and contradicts the users’ wishes. Soon we will probably see a paper about how models sometimes take no action at all, and this is sloth and dereliction of duty.

lc Jun 7, 2025, 3:51 PM
10 points
4
in reply to: johnswentworth’s comment on: lc’s Shortform

If the interjection is about your personal hobbyhorse or pet peave or theory or the like, then definitely shut up and sit down.

I make the simpler request because often rationalists don’t seem to be able to tell when this is (or at least tell when others can tell)

lc Jun 7, 2025, 3:50 PM
9 points
8
in reply to: Chris_Leong’s comment on: lc’s Shortform
Sure; unfortunately what’s happening at rationalist conferences is that frequently the most socially unaware/attention seeking person in the room is speaking up, in a way that does not actually contribute, and encourages other socially unaware people to go do it at other talks.

lc Jun 6, 2025, 10:54 PM
44 points
46
on: lc’s Shortform
If you attend a talk at a rationalist conference, please do not spontaneously interject unless the presenter has explicitly clarified that you are free to do so. Neither should you answer questions on behalf of the presenter during a Q&A portion. People come to talks to listen to the presenter, not a random person in the audience.

If you decide to do this anyways, you will usually not get audiovisual feedback from the other audience members that it was rude/cringeworthy for you to interject, even if internally they are desperate for you to stop doing it.

lc Jun 5, 2025, 5:15 AM
4 points
−2
in reply to: Thane Ruthenis’s comment on: Thane Ruthenis’s Shortform
“Successionism” is such a bizarre position that I’d look for the underlying generator rather than try to argue with it directly.

lc Jun 2, 2025, 5:54 PM
4 points
2
in reply to: Donald Hobson’s comment on: POC || GTFO culture as partial antidote to alignment wordcelism
It doesn’t take “8 weeks” to come up with a good example if you already understand the problem. Most of the “8 weeks” is spent realizing you didn’t understand the problem correctly enough to create the POC, or to propose a solution that works.

lc May 27, 2025, 11:37 PM
2 points
0
on: lc’s Shortform
I bought two tickets for LessOnline, one for me and one for a friend. I used the same email for both, but unfortunately now we can’t login to the vercel app where we sign up for events! Any way an operator can help me here?

lc May 22, 2025, 9:35 PM
12 points
14
in reply to: Thane Ruthenis’s comment on: Jimrandomh’s Shortform Posts
Twitter users are awful.

lc May 22, 2025, 9:03 PM
7 points
0
in reply to: jimrandomh’s comment on: Jimrandomh’s Shortform Posts
What is the context here?

lc May 14, 2025, 10:28 PM
3 points
3
in reply to: keltan’s comment on: Eliezer and I wrote a book: If Anyone Builds It, Everyone Dies
I think the main problem is that the second cover looks really rushed.

lc May 14, 2025, 9:10 PM
30 points
23
on: Eliezer and I wrote a book: If Anyone Builds It, Everyone Dies
Love the title

lc May 13, 2025, 3:53 PM
14 points
2
on: lc’s Shortform
One large part of the AI 2027 piece is contigent on the inability of nation state actors to steal model weights. The authors’ take is that while China is going to be able to steal trade secrets, they aren’t going to be able to directly pilfer the model weights more than once or twice. I haven’t studied the topic as deeply as the authors but this strikes me as naive, especially when you consider side-channel^[1] methods of ‘distilling’ the models based on API output.
@Daniel Kokotajlo @ryan_greenblatt Did you guys consider these in writing the post? Is there some reason to believe these will be ineffective, or not provide the necessary access that raw weights lifted off the GPUs would give?
1. ^
  I would normally consider “side channel attacks work” to be the naive position, but in the AI 2027 post, the thesis is that there exists a determined, well resourced attacker (China) that already has insiders who can inform them on relevant details about OpenAI infrastructure, and already understands how the models were developed in the first place.

lc May 6, 2025, 6:14 AM
17 points
14
in reply to: ouguoc’s comment on: Eukryt Wrts Blg

are… are you sure you read the post you’re responding to?

We definitely read the same words!

lc May 6, 2025, 5:54 AM
17 points
−7
in reply to: eukaryote’s comment on: Eukryt Wrts Blg
I think Cremieux is an honest^[1], truthseeking, and intelligent guest speaker, and I would be extraordinarily disappointed in the organizers if they disinvited him. I also have a very high opinion of LessOnline’s organizers, so I’m not particularly worried about them cowtowing to attempts to chill speech.
1. ^
  (In the sense of e.g. his work output being factually correct, not speaking to his character personally)

lc May 1, 2025, 9:47 PM
14 points
0
in reply to: Daniel Kokotajlo’s comment on: Mo Putera’s Shortform
The full article sort of explains the bizarre kafkaesque academic dance that went on from 2020-2022, and how the field talked about these changes.

lc May 1, 2025, 6:14 PM
4 points
2
on: Monthly Roundup #29: April 2025
My hypothesis there is that we have systematized VC-backed YC-style founders. The rules are a lot easier to discover and follow, the track record there makes it a career path one can essentially plan on in a way that it wasn’t before, and the people who gate progress with money are there to reward those who internalize and follow those principles.
But a paved road would create more outsized successes, not fewer.
I think a better theory is that corp dev has systematized acquiring companies-that-look-like-Google very early. Early multiples on revenue for YC startups are insane. 23 year old founders coming out of the gate today with Google’s growth circa 2000 get a buyout offer 10-100x what Larry & Sergei did and sell immediately.

lc Apr 29, 2025, 4:18 PM
4 points
2
on: lc’s Shortform
I loved the MASK benchmarks. Does anybody here have any other ideas for benchmarks people could make that measure LLM honesty or sycophancy? I am quite interested in the idea of building an LLM that you can trust to give the right answer to things like political questions, or a way to identify such an LLM.

lc Apr 23, 2025, 2:59 PM
3 points
0
in reply to: Guive’s comment on: Three Months In, Evaluating Three Rationalist Cases for Trump
Wow, I didn’t realize.

lc Apr 22, 2025, 9:05 PM
16 points
6
in reply to: Dagon’s comment on: lc’s Shortform
The two guys from Epoch on the recent Dwarkesh Patel podcast repeatedly made the argument that we shouldn’t fear AI catastrophe, because even if our successor AIs wanted to pave our cities with datacenters, they would negotiate a treaty with us instead of killing us. It’s a ridiculous argument for many reasons but one of them is that they use abstract game theoretic and economic terms to hide nasty implementation details