Bruce W. Lee

Karma: 114

I maintain a pretty transparent online presence if you ever wanted to know more about who I am.

https://brucewlee.github.io/

Bruce W. Lee 18 Nov 2023 22:26 UTC
4 points
0
in reply to: Charlie Steiner’s comment on: Towards Evaluating AI Systems for Moral Status Using Self-Reports
Some food for thought:

A → Nature of Self-Reports in Cognitive Science: In cognitive science, self-reports are a widely used tool for understanding human cognition and consciousness. Training AI models to use self-reports (in an ideal scenario, this is analogous to giving a microphone, not giving the singer) does not inherently imply they become conscious. Instead, it provides a framework to study how AI systems represent and process information about themselves, which is crucial for understanding their limitations and capabilities.

B → Significance of Linguistic Cues: The use of personal pronouns like “I” and “you” in AI responses is more about exploring AI’s ability to model relational and subjective experiences than about inducing consciousness. These linguistic cues are essential in cognitive science (if we were to view LLMs as a semi-accurate model of how intelligence works) for understanding perspective-taking and self-other differentiation, which are key areas of study in human cognition. Also, considering that some research points to that a certain degree of self-other overlap is necessary for truly altruistic behaviors, tackling this self-other issue can be an important stepping stone to developing an altruistic AGI. In the end, what we want is AGI, not some statistical, language-spitting, automatic writer.

C → Ethical Implications and Safety Measures: The concerns about situational awareness and inadvertently creating consciousness in AI are valid. However, the paper’s proposal involves numerous safety measures and ethical considerations. The focus is on controlled experiments to understand AI’s self-modeling capabilities, not on indiscriminately pushing the boundaries of AI consciousness.

Bruce W. Lee 20 Nov 2023 19:42 UTC
7 points
0
in reply to: Bruce W. Lee’s comment on: Towards Evaluating AI Systems for Moral Status Using Self-Reports
Sorry for commenting twice, and I think this second one might be a little out of context (but I think it makes a constructive contribution to this discussion).

I think we must make sure that we are working on the “easy problems” of consciousness. This portion of consciousness has a relatively well-established philosophical explanation. For example, the Global Workspace Theory provides a good high-level interpretation of human consciousness. It proposes a cognitive architecture to explain consciousness. It suggests that consciousness operates like a “global workspace” in the brain, where various neural processes compete for attention. The information that wins this competition is broadcast globally, becoming accessible to multiple cognitive processes and entering conscious awareness. This theory addresses the question of how and why certain neural processes become part of conscious experience while others remain subconscious. The theory posits that through competitive and integrative mechanisms, specific information dominates our conscious experience, integrating different neural processes into a unified conscious experience.

However, the Global Workspace Theory primarily addresses the functional and mechanistic aspects of consciousness, often referred to as the “Easy Problems” of consciousness. These include understanding how cognitive functions like perception, memory, and decision-making become conscious experiences. However, the Hard Problem of Consciousness, which asks why and how these processes give rise to subjective experiences or qualia, remains largely unaddressed by GWT. The Hard Problem delves into the intrinsic nature of consciousness, questioning why certain brain processes are accompanied by subjective experiences. While GWT offers insights into the dissemination and integration of information in the brain, it doesn’t explain why these processes lead to subjective experience, leaving the Hard Problem essentially unresolved.

Until we have a good high-level philosophical foundation for this Hard Problem, it might be a good approach to draw the line between the two and work on the easy problems first.
Bringing home the point: 1. That is, for now, it will be extremely difficult for us to figure out whether LLMs (or any other human beings for the matter) have “phenomenal” or “subjective” dimensions consciousness or not. 2. Rather, focus on the easy, reductively-explainable dimensions of consciousness first. 3. We should make clear distinctions of these two categories when talking about consciousness

An Idea on How LLMs Can Show Self-Serving Bias

Bruce W. Lee23 Nov 2023 20:25 UTC

6 points

6 comments3 min readLW link

Bruce W. Lee 25 Nov 2023 15:06 UTC
1 point
0
in reply to: Tristan Wegner’s comment on: How LLMs Can Show Self-Serving Bias
Thanks for pointing that out. Sometimes, the rows will not add up to 100 because there were some responses where the model refused to answer.

Bruce W. Lee 25 Nov 2023 15:06 UTC
2 points
0
in reply to: red75prime’s comment on: How LLMs Can Show Self-Serving Bias
How is this possible? We are only inferencing

Facing Up to the Problem of Consciousness

Bruce W. Lee10 Dec 2023 23:31 UTC

8 points

0 comments3 min readLW link

Bruce W. Lee 14 Dec 2023 1:29 UTC
2 points
0
in reply to: Tristan Wegner’s comment on: How LLMs Can Show Self-Serving Bias
Yeah, I see it. It’s fixed now. Thanks!

Benchmark Study #1: MMLU (Pile, MCQ)

Bruce W. Lee5 Jan 2024 21:35 UTC

10 points

0 comments5 min readLW link

(arxiv.org)

Benchmark Study #2: TruthfulQA (Task, MCQ)

Bruce W. Lee6 Jan 2024 2:39 UTC

11 points

2 comments4 min readLW link

(arxiv.org)

Benchmark Study #3: HellaSwag (Task, MCQ)

Bruce W. Lee7 Jan 2024 4:59 UTC

2 points

4 comments6 min readLW link

(arxiv.org)

Benchmark Study #4: AI2 Reasoning Challenge (Task(s), MCQ)

Bruce W. Lee7 Jan 2024 17:13 UTC

6 points

0 comments5 min readLW link

Bruce W. Lee 7 Jan 2024 17:16 UTC
2 points
0
in reply to: Owain_Evans’s comment on: Benchmark Study #2: TruthfulQA
Thanks, Owain, for pointing this out. I will make two changes as time allows: 1. make it clearer for all posts when the benchmark paper is released, and 2. for this post, append the additional results and point readers to them.

Bruce W. Lee 7 Jan 2024 19:46 UTC
2 points
0
in reply to: jacobjacob’s comment on: Benchmark Study #3: HellaSwag
Thanks for the feedback. This is similar to the feedback that I received from Owain. Since my posts are getting upvotes (which I never really expected thank you), it is of course important to not mislead anyone.

But yes, I did have several major epistemic concerns about the reliability of current academic reporting practices in performance scores. Even if a certain group of researchers were very ethical, as a reader, how will we ever confirm that the numbers are indeed correct, or even that there was an experiment run ever?

I was weighing the overall benefits of reporting such non-provable numbers (in my opinion) and just focusing on the situation that the paper is written and enjoying the a-ha moments that the authors would have felt back then.

Anyway, before I post another benchmark study blog tomorrow, I’ll devise some steps of action to satisfy both my concern and yours. It’s always a joy to post here on LessWrong. Thanks for the comment!

Bruce W. Lee 8 Jan 2024 2:59 UTC
1 point
0
in reply to: jacobjacob’s comment on: Benchmark Study #3: HellaSwag
Thanks for the recommendation, though I’ll think of a more fundamental solution to satisfy all ethical/communal concerns.

”Gemini and GPT-4 authors report results close to or matching human performance at 95%, though I don’t trust their methodology.” Regarding this, just to sort everything out, because I’m writing under my real name, I do trust the authors and ethics of both OpenAI and DeepMind. It’s just me questioning everything when I still can as a student. But I’ll make sure not to cause any further confusion, as you recommended!