Chris_Leong

Karma: 7,716

Link: Let’s Think Dot by Dot: Hidden Computation in Transformer Language Models by Jacob Pfau, William Merrill & Samuel R. Bowman

Chris_LeongApr 27, 2024, 1:22 PM

12 points

10 votes

Overall karma indicates overall quality.

0 comments1 min readLW link

(twitter.com)

“You’re the most beautiful girl in the world” and Wittgensteinian Language Games

Chris_LeongApr 20, 2024, 2:54 PM

5 points

22 votes

Overall karma indicates overall quality.

18 comments1 min readLW link

The argument for near-term human disempowerment through AI

Chris_LeongApr 16, 2024, 4:50 AM

22 points

7 votes

Overall karma indicates overall quality.

2 comments1 min readLW link

(link.springer.com)

Reverse Regulatory Capture

Chris_LeongApr 11, 2024, 2:40 AM

12 points

10 votes

Overall karma indicates overall quality.

3 comments1 min readLW link

On the Confusion between Inner and Outer Misalignment

Chris_LeongMar 25, 2024, 11:59 AM

18 points

8 votes

Overall karma indicates overall quality.

10 comments1 min readLW link

The Best Essay (Paul Graham)

Chris_LeongMar 11, 2024, 7:25 PM

25 points

8 votes

Overall karma indicates overall quality.

2 comments1 min readLW link

(paulgraham.com)

[Question] Can we get an AI to “do our alignment homework for us”?

Chris_LeongFeb 26, 2024, 7:56 AM

55 points

27 votes

Overall karma indicates overall quality.

33 comments1 min readLW link

[Question] What’s the theory of impact for activation vectors?

Chris_LeongFeb 11, 2024, 7:34 AM

61 points

20 votes

Overall karma indicates overall quality.

12 comments1 min readLW link

Notice When People Are Directionally Correct

Chris_LeongJan 14, 2024, 2:12 PM

137 points

69 votes

Overall karma indicates overall quality.

8 comments2 min readLW link

Are Metaculus AI Timelines Inconsistent?

Chris_LeongJan 2, 2024, 6:47 AM

17 points

9 votes

Overall karma indicates overall quality.

7 comments2 min readLW link

Random Musings on Theory of Impact for Activation Vectors

Chris_LeongDec 7, 2023, 1:07 PM

8 points

3 votes

Overall karma indicates overall quality.

0 comments1 min readLW link

Goodhart’s Law Example: Training Verifiers to Solve Math Word Problems

Chris_LeongNov 25, 2023, 12:53 AM

27 points

10 votes

Overall karma indicates overall quality.

2 comments1 min readLW link

(arxiv.org)

Upcoming Feedback Opportunity on Dual-Use Foundation Models

Chris_LeongNov 2, 2023, 4:28 AM

3 points

3 votes

Overall karma indicates overall quality.

0 comments1 min readLW link

On Having No Clue

Chris_LeongNov 1, 2023, 1:36 AM

20 points

14 votes

Overall karma indicates overall quality.

11 comments1 min readLW link

Is Yann LeCun strawmanning AI x-risks?

Chris_LeongOct 19, 2023, 11:35 AM

26 points

18 votes

Overall karma indicates overall quality.

4 comments1 min readLW link

Don’t Dismiss Simple Alignment Approaches

Chris_LeongOct 7, 2023, 12:35 AM

138 points

63 votes

Overall karma indicates overall quality.

9 comments4 min readLW link

[Question] What evidence is there of LLM’s containing world models?

Chris_LeongOct 4, 2023, 2:33 PM

17 points

10 votes

Overall karma indicates overall quality.

17 comments1 min readLW link

The Role of Groups in the Progression of Human Understanding

Chris_LeongSep 27, 2023, 3:09 PM

11 points

3 votes

Overall karma indicates overall quality.

0 comments2 min readLW link

The Flow-Through Fallacy

Chris_LeongSep 13, 2023, 4:28 AM

21 points

12 votes

Overall karma indicates overall quality.

7 comments1 min readLW link

Chariots of Philosophical Fire

Chris_LeongAug 26, 2023, 12:52 AM

12 points

4 votes

Overall karma indicates overall quality.

0 comments1 min readLW link

(l.facebook.com)

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer