RSS

Sunishchal Dev

Karma: 58

From Di­a­mond Min­ing to Open-World Sur­vival: Align­ment and Emer­gent Be­hav­ior in Minecraft Agents

30 Jun 2025 3:17 UTC
15 points
0 comments16 min readLW link

Im­prov­ing Model-Writ­ten Evals for AI Safety Benchmarking

15 Oct 2024 18:25 UTC
30 points
0 comments18 min readLW link