rokosbasilisk

Karma: 51

Scaling Laws for LLM Based Data Compression

rokosbasilisk5 Aug 2025 7:06 UTC

8 points

0 comments3 min readLW link

rokosbasilisk 21 Oct 2024 7:40 UTC
1 point
0
in reply to: jacob_drori’s comment on: Exploring the Platonic Representation Hypothesis Beyond In-Distribution Data
Thanks for the feedback! working on refining the writeup.

Exploring the Platonic Representation Hypothesis Beyond In-Distribution Data

rokosbasilisk20 Oct 2024 8:40 UTC

13 points

2 comments1 min readLW link

Understanding Hidden Computations in Chain-of-Thought Reasoning

rokosbasilisk24 Aug 2024 16:35 UTC

6 points

1 comment1 min readLW link

rokosbasilisk 11 Jan 2024 11:54 UTC
1 point
0
on: Goodbye, Shoggoth: The Stage, its Animatronics, & the Puppeteer – a New Metaphor
though as Geoff Hinton has pointed out, ‘confabulations’ might be a better word
I think yann lecun was the first one to using this word https://twitter.com/ylecun/status/1667272618825723909

rokosbasilisk 21 Nov 2023 15:54 UTC
6 points
0
in reply to: Charlie Steiner’s comment on: Alignment can improve generalisation through more robustly doing what a human wants—CoinRun example
not much information is given regarding that so far, i was curious about that too

rokosbasilisk 21 Nov 2023 13:16 UTC
4 points
4
in reply to: Charlie Steiner’s comment on: Alignment can improve generalisation through more robustly doing what a human wants—CoinRun example
“Algorithm for Concept Extrapolation”

rokosbasilisk 8 Aug 2023 19:37 UTC
1 point
0
in reply to: jkim2’s comment on: Prizes for matrix completion problems
I don’t see any recent publications for paul christiano related to this. So i guess the problem[s] is still open.

Goal-misgeneralization is ELK-hard

rokosbasilisk10 Jun 2023 9:32 UTC

2 points

0 comments1 min readLW link

rokosbasilisk 4 Jun 2023 21:18 UTC
1 point
0
on: Information Loss --> Basin flatness
parameters before L is less than $m$ ,
should this be after?

rokosbasilisk 13 Apr 2023 19:07 UTC
18 points
8
on: On AutoGPT
AutoGPT was created by a non-coding VC
It looks like you are confusing autoGPT with babyagi which was created by yohei nakajima who is a VC. the creator of autoGPT (Toran Bruce Richards) is a game-developer with a decent programming (game-development) experience. Even the figure shown here is that from babyagi (https://yoheinakajima.com/task-driven-autonomous-agent-utilizing-gpt-4-pinecone-and-langchain-for-diverse-applications/).

rokosbasilisk 9 Apr 2023 10:07 UTC
1 point
0
on: interpreting GPT: the logit lens
47 layers layer
47 layers later ?

rokosbasilisk 28 Mar 2023 8:27 UTC
1 point
0
in reply to: Mitchell_Porter’s comment on: Hutter-Prize for Prompts
really interesting idea.

rokosbasilisk 25 Mar 2023 9:13 UTC
3 points
0
in reply to: faul_sname’s comment on: Hutter-Prize for Prompts
Regarding the first one i am not expecting a single-prompt to generate the entirity of enwiki8/9. I am more interested in finding a set of prompts with a lookup table if possible to replicate enwiki data.
Thanks for the pointer for chincilla post, will look into it.

rokosbasilisk 25 Mar 2023 7:37 UTC
1 point
0
in reply to: the gears to ascension’s comment on: rokosbasilisk’s Shortform
not yet

Hutter-Prize for Prompts

rokosbasilisk24 Mar 2023 21:26 UTC

5 points

10 comments1 min readLW link

rokosbasilisk 30 Jan 2023 6:19 UTC
1 point
0
on: Alignment By Default
This requires hitting a window—our data needs to be good enough that the system can tell it should use human values as a proxy, but bad enough that the system can’t figure out the specifics of the data-collection process enough to model it directly. This window may not even exist.
are there any real world examples of this? not necessarily in human-values setting

rokosbasilisk 22 Mar 2022 14:02 UTC
1 point
0
on: ARC’s first technical report: Eliciting Latent Knowledge
From a complexity theoretic viewpoint, how hard could ELK be? is there any evidence that ELK is decidable?

rokosbasilisk 13 Mar 2022 13:43 UTC
1 point
0
on: ELK prize results
is there a separate post for “train a reporter that is useful for another AI” proposal?

rokosbasilisk 6 Dec 2021 16:27 UTC
1 point
0
on: Visible Thoughts Project and Bounty Announcement
silly idea: instead of thought-annotating ai-dungeon plays, we can start with annotating thoughts for akinator gameruns.
pros: much more easier and faster way to build a dataset, with less ambiguity
cons: somewhat restricted than the original idea.