RSS

James Chua

Karma: 462

https://​​jameschua.net/​​about/​​

Back­door aware­ness and mis­al­igned per­sonas in rea­son­ing models

Jun 20, 2025, 11:38 PM
34 points

12 votes

Overall karma indicates overall quality.

8 comments6 min readLW link

Thought Crime: Back­doors & Emer­gent Misal­ign­ment in Rea­son­ing Models

Jun 16, 2025, 4:43 PM
68 points

22 votes

Overall karma indicates overall quality.

2 comments8 min readLW link

OpenAI Re­sponses API changes mod­els’ behavior

Apr 11, 2025, 1:27 PM
53 points

24 votes

Overall karma indicates overall quality.

6 comments2 min readLW link

New, im­proved mul­ti­ple-choice TruthfulQA

Jan 15, 2025, 11:32 PM
72 points

28 votes

Overall karma indicates overall quality.

0 comments3 min readLW link

In­fer­ence-Time-Com­pute: More Faith­ful? A Re­search Note

Jan 15, 2025, 4:43 AM
69 points

21 votes

Overall karma indicates overall quality.

10 comments11 min readLW link

Tips On Em­piri­cal Re­search Slides

Jan 8, 2025, 5:06 AM
93 points

37 votes

Overall karma indicates overall quality.

4 comments6 min readLW link

James Chua’s Shortform

James ChuaMay 23, 2024, 6:13 AM
2 points

1 vote

Overall karma indicates overall quality.

2 comments1 min readLW link

My MATS Sum­mer 2023 experience

James ChuaMar 20, 2024, 11:26 AM
29 points

16 votes

Overall karma indicates overall quality.

0 comments3 min readLW link
(jameschua.net)

A library for safety re­search in con­di­tion­ing on RLHF tasks

James ChuaFeb 26, 2023, 2:50 PM
10 points

7 votes

Overall karma indicates overall quality.

2 comments1 min readLW link