I just created this account even if I did hear about this forum a few times in the past especially on X!
I am currently doing research on viral proteins modelling capabilities by LLMs and PLMs (Protein Language Models) and had a few interesting empirical results I wanted to share about how frontier LLMs seems to become surprisingly capable at proteins related tasks (classifying a protein as viral or not), reconstructing a masked protein, etc..
I thought this could spark some interesting discussions (what’s actually going on into the pre training dataset of these models, how scaling is affecting these ‘emerging’ capabilities, etc..) but I was wondering if this would be an appropriate topic for the forum.
Hey everyone,
I just created this account even if I did hear about this forum a few times in the past especially on X!
I am currently doing research on viral proteins modelling capabilities by LLMs and PLMs (Protein Language Models) and had a few interesting empirical results I wanted to share about how frontier LLMs seems to become surprisingly capable at proteins related tasks (classifying a protein as viral or not), reconstructing a masked protein, etc..
I thought this could spark some interesting discussions (what’s actually going on into the pre training dataset of these models, how scaling is affecting these ‘emerging’ capabilities, etc..) but I was wondering if this would be an appropriate topic for the forum.
Let me know!