Arthur Bigot

Karma: 0

Arthur Bigot 7 Jun 2026 23:01 UTC
1 point
0
on: Open Thread Summer 2026
Hey everyone,
I just created this account even if I did hear about this forum a few times in the past especially on X!
I am currently doing research on viral proteins modelling capabilities by LLMs and PLMs (Protein Language Models) and had a few interesting empirical results I wanted to share about how frontier LLMs seems to become surprisingly capable at proteins related tasks (classifying a protein as viral or not), reconstructing a masked protein, etc..
I thought this could spark some interesting discussions (what’s actually going on into the pre training dataset of these models, how scaling is affecting these ‘emerging’ capabilities, etc..) but I was wondering if this would be an appropriate topic for the forum.

Let me know!