Does anyone have thoughts on Muse Spark that they’ve written up? Do we have any speculations on what its good at/it’s size/whether it has good/bad post-training?
I’m excited to try it on the “recommend me a book” benchmark, since it is reportedly good at that, whereas other models haven’t been. Haven’t gotten around to that yet, though. I’d be interested to learn if this replicates for anyone!
Does anyone have thoughts on Muse Spark that they’ve written up? Do we have any speculations on what its good at/it’s size/whether it has good/bad post-training?
There are a lot of comments on hacker news and maybe some are useful: https://news.ycombinator.com/item?id=47692043
I’m excited to try it on the “recommend me a book” benchmark, since it is reportedly good at that, whereas other models haven’t been. Haven’t gotten around to that yet, though. I’d be interested to learn if this replicates for anyone!
The only things which I remember on LW is Brendan Long’s comment along with a take by Rauno Arike and my expression of distrust and irritation.