AI Safety Thursdays: Are LLMs aware of their learned behaviors?

Description

​At this event, we’ll explore self-awareness in LLMs, as described in the paper Tell me about yourself: LLMs are aware of their learned behaviors. Guiding us through the topic will be one of the paper’s co-authors, Jenny Bao.

​​​Event Schedule

​6:00 to 6:45 - Networking and refreshments

​6:45 to 8:00 - Main Presentation

​8:00 to 9:00 - Breakout Discussions

No comments.