RSS

martinkunev

Karma: 175

Un­der­stand­ing Agent Preferences

martinkunevFeb 24, 2025, 5:46 PM
6 points
0 comments14 min readLW link

What is Ran­dom­ness?

martinkunevSep 27, 2024, 5:49 PM
11 points
2 comments10 min readLW link

[Question] Is CDT with pre­com­mit­ment enough?

martinkunevMay 25, 2024, 9:40 PM
10 points
17 comments1 min readLW link

What is On­tol­ogy?

martinkunevFeb 12, 2024, 11:01 PM
4 points
0 comments4 min readLW link

[Question] Choos­ing a book on causality

martinkunevFeb 7, 2024, 9:16 PM
4 points
3 comments1 min readLW link

[Question] Would you have a baby in 2024?

martinkunevDec 25, 2023, 1:52 AM
24 points
76 comments1 min readLW link

How use­ful is Cor­rigi­bil­ity?

martinkunevSep 12, 2023, 12:05 AM
11 points
4 comments5 min readLW link

Dis­in­cen­tiviz­ing de­cep­tion in mesa op­ti­miz­ers with Model Tampering

martinkunevJul 11, 2023, 12:44 AM
3 points
0 comments2 min readLW link