georgia_berg

Karma: 30

Hallucinating Certificates: Using Generative Language Models for Testing TLS Software Parsing

georgia_berg and Mario Gibney

27 Mar 2026 16:34 UTC

2 points

0 comments1 min readLW link

Eliciting Harmful Capabilities by Fine-Tuning on Safeguarded Outputs

georgia_berg and Mario Gibney

27 Mar 2026 16:17 UTC

2 points

0 comments1 min readLW link

“Is AI a Bubble?” Discussion

georgia_berg and Mario Gibney

2 Mar 2026 16:42 UTC

2 points

0 comments1 min readLW link

Testing LLM Cooperation in Multi-Agent Simulation

georgia_berg and Mario Gibney

2 Mar 2026 16:40 UTC

2 points

0 comments1 min readLW link

The Anthropic-Pentagon Stand-Off

georgia_berg and Mario Gibney

2 Mar 2026 16:39 UTC

2 points

0 comments1 min readLW link

Aligning Brain-Like AGI

georgia_berg and Mario Gibney

2 Mar 2026 16:37 UTC

2 points

0 comments1 min readLW link

Open-weight LLMs – Their Strategic Import to Key AI Jurisdictions and The Growing Need for Safety Research, Dialogues, and Coordination

georgia_berg and Mario Gibney

2 Mar 2026 16:34 UTC

2 points

0 comments1 min readLW link

Network Topologies for AI and the Implications for Governance

georgia_berg and Mario Gibney

2 Mar 2026 16:31 UTC

2 points

0 comments1 min readLW link

Latest Updates on AI Safety Work at Jinesis AI Lab

georgia_berg and Mario Gibney

11 Feb 2026 21:13 UTC

2 points

0 comments1 min readLW link

The Moltverse meets Reality

georgia_berg and Mario Gibney

11 Feb 2026 21:08 UTC

2 points

0 comments1 min readLW link

AI Safety Thursday: Risks Emerging from Agent Swarms

georgia_berg and Mario Gibney

30 Jan 2026 18:00 UTC

2 points

0 comments1 min readLW link

AI Policy Tuesday: AI Safety in Healthcare

georgia_berg and Mario Gibney

30 Jan 2026 17:59 UTC

2 points

0 comments1 min readLW link

AI Policy Tuesday: Regulating AI Agents—Lessons from the EU AI Act

georgia_berg and Mario Gibney

29 Jan 2026 19:44 UTC

2 points

0 comments1 min readLW link

AI Safety Thursday: Implications of Continual Learning in LLM Agents

georgia_berg and Mario Gibney

29 Jan 2026 19:13 UTC

2 points

0 comments1 min readLW link

“The World After AGI” Discussion

georgia_berg and Mario Gibney

29 Jan 2026 17:41 UTC

2 points

0 comments1 min readLW link

AI Policy Tuesday: Regulating Catastrophic AI Risk Through Liability Insurance

georgia_berg and Mario Gibney

10 Jan 2026 16:10 UTC

2 points

0 comments1 min readLW link

Beyond Adversarial Robustness—Rethinking Sociopolitical Safety in AI Systems

georgia_berg and Mario Gibney

10 Jan 2026 16:08 UTC

2 points

0 comments1 min readLW link

Emergent Misalignment from Reward Hacking

georgia_berg and Mario Gibney

10 Jan 2026 16:06 UTC

2 points

0 comments1 min readLW link

AI Safety Thursdays: Why Attackers Are Winning and What We Can Do About It

georgia_berg and Mario Gibney

5 Jan 2026 20:40 UTC

2 points

0 comments1 min readLW link

AI Safety Thursday: Agentic Bug Detection—Progress and Deployment

georgia_berg and Mario Gibney

4 Dec 2025 20:08 UTC

2 points

0 comments1 min readLW link