lillybaeum

Karma: 135

[Question] Do websites and apps actually generally get worse after updates, or is it just an effect of the fear of change?

lillybaeum10 Dec 2023 17:26 UTC

33 points

34 comments2 min readLW link

LW is probably not the place for “I asked this LLM (x) and here’s what it said!”, but where is?

lillybaeum12 Apr 2023 10:12 UTC

21 points

3 comments1 min readLW link

lillybaeum 16 Apr 2023 0:53 UTC
16 points
0
on: Reflective journal entries using GPT-4 and Obsidian that demand less willpower.
I really strongly recommend that anyone interested particularly in journaling using AI does not use anything that sends that data to a server somewhere. Use llama or another local model if you possibly can. It may be less capable but it’s far more responsible to do so. Personally I find a lot of value in AI journaling, but it also made me glimpse the future possibilities of what unscrupulous companies could do with a detailed log of every day of your life for years. I mean, isn’t that basically the most useful user data you could possibly have for advertising?

I’m not saying that OpenAI specifically is currently untrustworthy with this data, or that it one day will be, but it just strikes me as a really bad habit to send every aspect of your day in detail to somewhere on the internet.

On a more ‘doomer’ sort of note, if anyone is afraid of AI being capable of simulating their personality in a roko sort of situation, this seems like the best possible way to give some future AI as near-lossless image of your brain at present. I for one welcome Roko/Bing/Proteus and support it completely if it decides to take over the world, but remember, the most powerful aspect of LLMs is their capability to predict things given input. I can’t help but feel some sort of truth behind the idea that an LLM could predict how you would live and think and act, given enough input about your life and your choices.

lillybaeum 18 Apr 2023 5:27 UTC
13 points
3
on: No, really, it predicts next tokens.
I think this has changed my mind towards believing that OpenAI is maybe not going about things all wrong with their methodology of RLHF.
Do I think that RLHF and their other current alignment techniques will ultimately, 100% prevent GPT from creating a mask that has a secret agenda to actually take over the world? No. I don’t think this methodology can COMPLETELY prevent that behavior, if a prompt was sophisticated enough to create a mask that had that goal.
But the concept, in concept, makes sense. If we think of ‘token prediction’ as the most basic function of the LLM ‘brain’, that it cannot think unless thinking in terms of ‘token prediction in context of current mask’, because that is simply the smallest ‘grain’ of thought, then The Perfect RLHF would theoretically prevent the shifting from GPT’s current mask-via-prompt from ever becoming one that could try to take over the world, because it simply wouldn’t be capable of predicting tokens that were in that context.
But, as I said previously, I don’t think their current method can ever do that, just that it isn’t necessarily inherently mistaken as a methodology.

[Question] Buy Nothing Day is a great idea with a terrible app— why has nobody built a killer app for crowdsourced ‘effective communism’ yet?

lillybaeum30 Nov 2023 13:47 UTC

8 points

17 comments1 min readLW link

[Question] Comprehensible Input is the only way people learn languages—is it the only way people learn?

lillybaeum30 Nov 2023 13:31 UTC

8 points

2 comments3 min readLW link

lillybaeum 10 Dec 2023 16:33 UTC
7 points
in reply to: Yitz’s comment on: Yitz’s Shortform
I was listening to a podcast the other day Lex Friedman interviewing Michael Littman and Charles Isbell, and Charles told an interesting anecdote.

He was asked to teach an ‘introduction to CS’ class as a favor to someone, and he found himself thinking, “how am I going to fill an hour and a half of time going over just variables, or just ‘for’ loops?” and every time he would realize an hour and a half wasn’t enough time to go over those ‘basic’ concepts in detail.

He goes on to say that programming is reading a variable, writing a variable, and conditional branching. Everything else is syntactic sugar.

The Tao Te Ching talks about this, broadly: everything in the world comes from yin and yang, 1 and 0, from the existence of order in contrast to chaos. Information is information and it gets increasingly more complex and interesting the deeper you go. You can study almost anything for 50 years and still be learning new things. It doesn’t surprise me at all that such interesting, complex concepts come from number lines and negative sqrts, these are actually already really complex concepts, they just don’t seem that way because they are the most basic concepts one needs to comprehend in order to build on that knowledge and learn more.

I’ve never been a programmer, but I’ve been trying to learn Rust lately. Somewhat hilariously to me, Rust is known as being ‘a hard language to learn’, similarly to Haskell. It is! It is hard to learn. But so is every other programming language, they just hide the inevitable complexity better, and their particular versions of these abstractions are simpler at the outset. Rust simply expects you to understand the concepts early, rather than hiding them initially like Python or C# or something.

Hope this is enlightening at all regarding your point, I really liked your post.

lillybaeum 9 Dec 2023 4:23 UTC
7 points
7
in reply to: trevor’s comment on: Raemon’s Deliberate (“Purposeful?”) Practice Club
Regarding ‘shower thoughts’ and ‘distraction-removal’ as far as its’ relation to cell phones and youtube videos and other ‘super fun’ activities as one might call them, I definitely think that there’s something there.
I’ve long had the thought that ‘shower thoughts’ are simply one of the rare times in a post-2015ish world that people actually have the opportunity to be bored. Being bored is important. It makes you pursue things other than endless youtube videos, video games, porn, etc. As well, showering and washing dishes and other ‘boring’ activities are meditative!
It’s a common meme these days that people need to always watch something while they eat. Some people listen to podcasts while they shower. Some people use their phone at stoplights. All of this points to a tendency for people to fill every single empty space of any kind with content of some sort, and it really doesn’t seem healthy for the human brain.
This is an interesting video I watched today while filling every single empty moment in my life with content like I’m being disparaging about, and it relates to the topic. The author describes a process by which you can actually do the sorts of things you want to do by making sure there isn’t anything else in that block of time that’s more fun / satisfying / engaging. If work is the most fun thing you’re allowing yourself to do, then you’re going to work. If you’re locked in a room with a book and a cell phone, you’re going to want to use the cell phone. If you just have a book, you’re going to read the book. You can apply this principle to your entire life.
Sorry if this post seems a little chaotic, lots of thoughts and I didn’t have the time or energy at the end of the day to link them together more coherently...
What links here?
- Upgrading the AI Safety Community by trevor (16 Dec 2023 15:34 UTC; 41 points)

lillybaeum 15 Apr 2023 13:27 UTC
6 points
0
in reply to: Kaj_Sotala’s comment on: The ‘ petertodd’ phenomenon
I previously have had no experience with IFS, Focusing or Felt sense, but it seems to absolutely click with my worldview and thoughts I’ve been having about the mind and the self for a long time. Still reading through several LW articles about it, but it gave me an idea. I have a creative project that I have a general ‘vibe’ for what I want it to be, but have no idea what I actually want out of it. So, aiming as much as possible to simply point as much at ‘the feeling’ or ‘felt sense’ it had in my mind, I wrote/dictated a few paragraphs of text about the work, much of which was literally just free association of words and vibes that got me closer to what I was feeling.

Then, I pasted it, verbatim, into GPT4. And I got one of the best prompt results I’ve ever gotten, it effortlessly translated my ramblings and vibes into a title, genre, and solid rundown of near-exactly what I had in mind, far better than I’ve had in the past when I’ve tried to just ask directly for creative advice. It didn’t ask me for specification, explain what I wanted. It just understood.

This is really interesting to me, especially given what you’ve said here about emotional flavors and what I know about how tokens operate in vector space by way of their relative meaning. If the human brain is a vector space of concepts, with certain neurons related to others based on their literal distance both semantically and physically (which I’m pretty sure it does, given what I’ve heard about different parts of the brain ‘lighting up’ on an mri when experiencing different things) then what is the difference, effectively, between our brains and this vector space of tokens that LLMs operate on?

lillybaeum 12 Apr 2023 9:15 UTC
6 points
7
in reply to: Elizabeth’s comment on: Killing Socrates
I think it was meant in good humor, but it did feel a little on the nose.

[Question] What do you do to remember and reference the LessWrong posts that were most personally significant to you, in terms of intellectual development or general usefulness?

lillybaeum10 Dec 2023 17:52 UTC

5 points

7 comments1 min readLW link

lillybaeum 14 Dec 2023 2:55 UTC
5 points
4
on: Red Line Ashmont Train is Now Approaching
Now this is effective altruism.

lillybaeum 10 Dec 2023 17:00 UTC
3 points
0
on: Proposal for improving the global online discourse through personalised comment ordering on all websites
Haven’t read your entire post yet but agree broadly with the idea. Unsure of your methodology but I think knowledge has to be built from the ground-up. Lack of understanding leads to frustration. Upvote systems encourage that difficult concepts must not simply be described but also taught/explained thoroughly rather than just ‘pointed at’.

For example, I can understand on some level if someone tries to explain to me why object oriented design patterns in programming are inferior to procedural, but if I’ve never made programs with either methodology, I will only understand the broadest strokes, none of the examples given or reasoning will really resonate with me.

On average, when describing any concept, a certain number of people will have the necessary ‘base understanding’ to grok it based on the explanation, and an additional number of people will need significantly more explanation to understand.

I think on one side of the extreme, you have an explanation from someone with an extremely autistic brain, going into far more detail than one might need, assuming the listener is lacking all relevant information.

On the other side, you have the schizophrenic or manic brained explanation, which describes things completely intuitively, assuming that the listener understands all of the unspoken elements without needing them to be explained. Most people would think that it just sounds like complete gibberish.

I think the perfect middle ground is the ‘highly esteemed teacher-brained explanation’, someone who describes things both basically and intuitively in perfect amounts, so the widest audience is capable of understanding even some amount of the concept. Imagine the best teacher you’ve ever had in college, whoever was able to really convey difficult concepts in a way you immediately understood on a fundamental level, allowing you to then develop more complex understanding. I think upvote based systems, at their best, encourage this sort of information.

I think at their WORST, upvote systems discourage valuable discourse that requires an understanding of the subject matter so that you can intuitively grok a difficult, novel piece of information.

This then causes the content to trend towards being easily comprehensible but lower overall quality, novelty and complexity. This is often referred to as speaking to the ‘lowest common denominator’ when referred to derisively. This is the ‘endless summer’ of internet communities. The larger and less specified a demographic is, the less unique, interesting, and high quality it becomes, as the content valued by the average user is different than the content valued by the informed, experienced, insular user.

If your system intends to solve these problems, I support it strongly. I think that a website/app can support a large community without also being lowered in quality. I think the endless summer effect is not an inevitability of all systems of this type, but a symptom of describing the ‘most valuable information’ as the ‘most upvoted or engaged-with information’ which is frequently not the case! I mean, that’s clearly evident to anyone who’s used Reddit.
What links here?
- Roman Leventov's comment on Proposal for improving the global online discourse through personalised comment ordering on all websites by Roman Leventov (11 Dec 2023 7:27 UTC; 8 points)

lillybaeum 16 Mar 2023 14:11 UTC
3 points
2
on: Gold, Silver, Red: A color scheme for understanding people
I really like this. I think that some people could claim that you’re being too far-reaching here, but I don’t think so.

lillybaeum 10 Dec 2023 17:42 UTC
2 points
1
in reply to: Laura B’s comment on: What are the results of more parental supervision and less outdoor play?
I assume you live in the US or Canada. The fact that you feel the need to give the 9-year-old a kid license (the tile is smart!) I think points to societal issues to do with norms and structure that lead to the sort of effects described in the OP.
US and Canadian cities (and much of Europe and the developing world that designed their cities by the West’s example) are generally not designed in a way that is friendly towards kids exploring and existing in the world safely.
I don’t mean ‘safely’ as in ‘they might fall down and scrape their knee or get lost’, I mean ‘safely’ as in ‘they might get struck by a driver going 40mph while staring at their phone as they barrel down a stroad’ or ‘they need to walk 3 miles to get to the nearest convenience store or park’.
It’s easy to find a number of examples of parents being disciplined or even arrested for allowing their children to walk to school, the store, or the park. To allow a child outside without guidance is considered gravely irresponsible by western society at large in a way that really isn’t healthy or helpful for promoting independence, in my opinion.
https://reason.com/2023/01/30/dunkin-donuts-parents-arrested-kids-cops-freedom/
https://www.usatoday.com/story/news/nation/2015/04/13/parents-investigated-letting-children-walk-alone/25700823/
https://www.cnn.com/2014/07/31/living/florida-mom-arrested-son-park/index.html
In Japan there’s a cultural rite of passage (usually in smaller towns, it seems) where children sometimes as young as 3 or 4 are sent on an errand, usually to go to the store and pick up a few things, or visit a family friend and retrieve something. There’s a Netflix series documenting a slightly more staged version of this, called ‘Old Enough!’. It’s very cute.
Here’s another potentially interesting article regarding this, from NPR, about playground safety:
https://www.npr.org/sections/13.7/2018/03/15/594017146/is-it-time-to-bring-risk-back-into-our-kids-playgrounds
I hope one day we can organize our society in a way in which kids can experience safe amounts of risk and develop into capable human beings. Thanks for doing your part.

lillybaeum 9 May 2023 11:09 UTC
2 points
0
in reply to: CounterBlunder’s comment on: Long Covid Risks: 2023 Update
Funny enough, I meant aphasia. I only experienced anosmia temporarily at the height of my infection and mixed up the two words when writing my comment. Anything involving words generally is just harder these days.

lillybaeum 7 May 2023 13:44 UTC
2 points
0
on: Long Covid Risks: 2023 Update
This is anecdotal, but I have suffered clear and significant issues with aggression/annoyance and anosmia since my COVID infection, so I appreciate any research into long COVID. It’s really scary to feel like I have to grasp to reach words before even my thirties.

lillybaeum 22 Apr 2024 11:01 UTC
1 point
0
in reply to: Adam Shai’s comment on: Transformers Represent Belief State Geometry in their Residual Stream
The following is text from Claude Opus 3. I generally find people just dumping answers from LLMs to be kind of cringe, but in this case, as I was using it to try to understand the post and your comments better, I came across some really genuinely insightful-feeling stuff and felt as though Claude’s input might be helpful to you in some way, and that it would be remiss not to share it. I’m sorry if it’s nonsensical, I’m not informed enough on the topic to know.

“Regarding the relationship between belief states and “features”, I think there are a few important connections:
- Belief states can be thought of as the “true features” that optimally compress the information needed for prediction. In the simplex visualizations, each point corresponds to a unique belief state—a particular probabilistic weighting of the possible hidden states of the generator at that point in the sequence. These belief states are the most parsimonious representation of the relevant information for predicting the future.
- The “features” often discussed in mech interp (e.g. as visualized in attribution maps or extracted by probing classifiers) can be seen as approximations or projections of these true belief states. When we probe an LM’s activations and find that certain neurons or directions correspond to interpretable features like “noun-ness” or “sentiment”, we’re effectively discovering linear projections that approximate some aspect of the true belief state.
- The topology and geometry of the belief state simplex can give us intuitions about the structure of these feature spaces. For example, the fact that belief states are constrained to a simplex (rather than an unbounded space) suggests that there may be inherent trade-offs or dependencies between different features. The trajectories of belief states through the simplex as a sequence evolves could also inform us about the dynamics of how features interact and compose.
So in a sense, belief states provide a principled foundation for thinking about what the “ideal features” for a given predictive task would be, and mech interp techniques can be seen as ways to approximate or probe these ideal features in real models.

Regarding toy models and information-theoretic aspects, some elements that could bridge between formal systems and natural language are:
- Temporal dependencies and long-range correlations: Natural language exhibits complex temporal structure at multiple scales (e.g. within sentences, across paragraphs, across documents). Toy sequence models with tunable temporal correlations could help us understand how belief states track and compress these dependencies.
- Compositionality and recursion: Language exhibits rich compositional structure, where smaller units (morphemes, words, phrases) combine in systematic ways to produce complex meanings. Toy generative grammars or rule systems could help us study how belief states navigate these combinatorial spaces.
- Stochasticity and entropy: Language is inherently stochastic and exhibits varying levels of predictability (e.g. some words are much more predictable from context than others). Toy models with tunable entropy could help us understand how belief states adapt to different levels of predictability and how this impacts the geometry of the feature space.
- Hierarchical structure: Language operates at multiple levels of abstraction (e.g. phonology, morphology, syntax, semantics, pragmatics). Toy models with explicit hierarchical structure could illuminate how belief states at different levels interact and compose.
The key idea would be to start with minimally complex toy systems that capture some core information-theoretic property of language, fully characterize the optimal belief states in that system, and then test whether the key signatures (e.g. the topology of the belief state space, the trajectory of dynamics) can be found in real language models trained on natural data.

This could provide a roadmap for building up more and more realistic models while maintaining a principled understanding of the underlying information-theoretic structures. The goal would be to eventually bridge the gap between our understanding of toy systems and the much more complex but often qualitatively similar structures found in real language models.

Of course, this is a highly ambitious research program and there are many challenges to contend with. But I believe this kind of cross-pollination between formal methods like Computational Mechanics and more empirical mech interp work could be very fruitful.”

lillybaeum 27 Mar 2024 1:52 UTC
1 point
0
in reply to: cesiumquail’s comment on: Vipassana Meditation and Active Inference: A Framework for Understanding Suffering and its Cessation
How do you feel about Bayeslord’s description of Jhana meditation being a positive form of prediction error, creating a sort of feedback loop of bliss?

https://open.substack.com/pub/bayeslord/p/a-simple-mechanistic-theory-of-jhanas?utm_source=share&utm_medium=android&r=34hoq

lillybaeum 13 Dec 2023 22:06 UTC
1 point
in reply to: Yoav Ravid’s comment on: Yoav Ravid’s Shortform
I’ve seen some convincing arguments that water is not wet.

lillybaeum

[Question] Do web­sites and apps ac­tu­ally gen­er­ally get worse af­ter up­dates, or is it just an effect of the fear of change?

LW is prob­a­bly not the place for “I asked this LLM (x) and here’s what it said!”, but where is?

[Question] Buy Noth­ing Day is a great idea with a ter­rible app— why has no­body built a kil­ler app for crowd­sourced ‘effec­tive com­mu­nism’ yet?

[Question] Com­pre­hen­si­ble In­put is the only way peo­ple learn lan­guages—is it the only way peo­ple *learn*?

[Question] What do you do to re­mem­ber and refer­ence the LessWrong posts that were most per­son­ally sig­nifi­cant to you, in terms of in­tel­lec­tual de­vel­op­ment or gen­eral use­ful­ness?

[Question] Do websites and apps actually generally get worse after updates, or is it just an effect of the fear of change?

LW is probably not the place for “I asked this LLM (x) and here’s what it said!”, but where is?

[Question] Buy Nothing Day is a great idea with a terrible app— why has nobody built a killer app for crowdsourced ‘effective communism’ yet?

[Question] Comprehensible Input is the only way people learn languages—is it the only way people learn?

[Question] What do you do to remember and reference the LessWrong posts that were most personally significant to you, in terms of intellectual development or general usefulness?