RSS

Nina Panickssery

Karma: 2,486

https://​​ninapanickssery.com/​​

Views purely my own unless clearly stated otherwise

Breast­feed­ing and IQ: Effects shrink as you con­trol for more confounders

Nina Panickssery25 Aug 2025 18:43 UTC
39 points
2 comments1 min readLW link
(blog.ninapanickssery.com)

In­te­ri­ors can be more fun

Nina Panickssery13 Aug 2025 22:42 UTC
34 points
6 comments4 min readLW link
(blog.ninapanickssery.com)

Nega­tive util­i­tar­i­anism is more in­tu­itive than you think

Nina Panickssery11 Aug 2025 16:13 UTC
13 points
25 comments3 min readLW link
(blog.ninapanickssery.com)

[Fic­tion] Our Trial

Nina Panickssery21 Jul 2025 3:56 UTC
68 points
1 comment3 min readLW link
(ninapanickssery.substack.com)

Why do LLMs hal­lu­ci­nate?

Nina Panickssery13 Jul 2025 0:09 UTC
24 points
1 comment5 min readLW link
(ninapanickssery.substack.com)

Sur­prises and learn­ings from al­most two months of Leo Panickssery

Nina Panickssery12 Jul 2025 23:33 UTC
208 points
12 comments6 min readLW link
(ninapanickssery.substack.com)

[Question] How does the LessWrong team gen­er­ate the web­site illus­tra­tions?

Nina Panickssery23 Jun 2025 0:05 UTC
13 points
1 comment1 min readLW link

My fa­vorite Soviet songs

Nina Panickssery15 Jun 2025 2:48 UTC
22 points
1 comment5 min readLW link
(ninapanickssery.substack.com)

Why I am not a successionist

Nina Panickssery4 May 2025 19:08 UTC
67 points
54 comments2 min readLW link
(ninapanickssery.substack.com)

Role em­bed­dings: mak­ing au­thor­ship more salient to LLMs

7 Jan 2025 20:13 UTC
50 points
0 comments8 min readLW link

Nina Pan­ickssery’s Shortform

Nina Panickssery7 Jan 2025 2:06 UTC
7 points
51 commentsLW link

In­ves­ti­gat­ing the Abil­ity of LLMs to Rec­og­nize Their Own Writing

30 Jul 2024 15:41 UTC
32 points
0 comments15 min readLW link

Jailbreak steer­ing generalization

20 Jun 2024 17:25 UTC
41 points
4 comments2 min readLW link
(arxiv.org)

Soviet com­edy film recommendations

Nina Panickssery9 Jun 2024 23:40 UTC
42 points
11 comments2 min readLW link
(open.substack.com)

Steer­ing Llama-2 with con­trastive ac­ti­va­tion additions

2 Jan 2024 0:47 UTC
125 points
29 comments8 min readLW link
(arxiv.org)

Com­par­ing rep­re­sen­ta­tion vec­tors be­tween llama 2 base and chat

Nina Panickssery28 Oct 2023 22:54 UTC
36 points
5 comments2 min readLW link

In­ves­ti­gat­ing the learn­ing co­effi­cient of mod­u­lar ad­di­tion: hackathon project

17 Oct 2023 19:51 UTC
94 points
5 comments12 min readLW link

In­fluence func­tions—why, what and how

Nina Panickssery15 Sep 2023 20:42 UTC
75 points
6 comments8 min readLW link

Red-team­ing lan­guage mod­els via ac­ti­va­tion engineering

Nina Panickssery26 Aug 2023 5:52 UTC
69 points
6 comments9 min readLW link

The Low-Hang­ing Fruit Prior and sloped valleys in the loss landscape

23 Aug 2023 21:12 UTC
82 points
1 comment13 min readLW link