RSS

Arjun Panickssery(ModusTrollens)

Karma: 1,164

An­a­lyz­ing Deep­Mind’s Prob­a­bil­is­tic Meth­ods for Eval­u­at­ing Agent Capabilities

22 Jul 2024 16:17 UTC
54 points
0 comments16 min readLW link

Un­der­rated Proverbs

Arjun Panickssery13 Jun 2024 12:30 UTC
10 points
9 comments1 min readLW link
(arjunpanickssery.substack.com)

“Suc­cess­ful lan­guage model evals” by Ja­son Wei

Arjun Panickssery25 May 2024 9:34 UTC
10 points
0 comments1 min readLW link
(www.jasonwei.net)

“Why I Write” by Ge­orge Or­well (1946)

Arjun Panickssery25 Apr 2024 16:02 UTC
58 points
2 comments9 min readLW link
(www.orwellfoundation.com)

[Fic­tion] A Confession

Arjun Panickssery18 Apr 2024 16:28 UTC
37 points
2 comments5 min readLW link
(arjunpanickssery.substack.com)

LLM Eval­u­a­tors Rec­og­nize and Fa­vor Their Own Generations

17 Apr 2024 21:09 UTC
44 points
1 comment3 min readLW link
(tiny.cc)