astle dsa

Karma: 3

astle dsa 29 Jun 2026 22:12 UTC
1 point
0
in reply to: mishka’s comment on: Agentic Frameworks: Or different ways to make LLM API calls
The process that implements the logic being a model itself feels closer to routing, but the premise is very interesting. Thanks for pointing me toward’s Sakana’s research!

astle dsa 29 Jun 2026 22:11 UTC
1 point
0
in reply to: Brendan Long’s comment on: Tree Transformers: A step towards generalizing the transformer architecture
My plan was to gather tree-shaped inputs, and observe whether tree-transformers offer any advantage over vector transformers.
I do not think the reason we perform attention on 1D vectors is because of the data’s shape, rather, as I mentioned earlier, we more often force our data to be flattened arrays since it offers a multitude of pragmatic advantages which are hard to ignore.

Tree Transformers: A step towards generalizing the transformer architecture

astle dsa24 Jun 2026 0:44 UTC

2 points

2 comments4 min readLW link

(astledsa.substack.com)

Agentic Frameworks: Or different ways to make LLM API calls

astle dsa24 Jun 2026 0:44 UTC

3 points

2 comments5 min readLW link

(astledsa.substack.com)