RSS

Bruce W. Lee

Karma: 114

I maintain a pretty transparent online presence if you ever wanted to know more about who I am.

https://​​brucewlee.github.io/​​

Re­search Post: Tasks That Lan­guage Models Don’t Learn

22 Feb 2024 18:52 UTC
39 points
23 comments2 min readLW link
(arxiv.org)

In­fants ask for help to avoid er­rors.

Bruce W. Lee2 Apr 2024 18:10 UTC
12 points
0 comments1 min readLW link
(www.pnas.org)

Bench­mark Study #2: Truth­fulQA (Task, MCQ)

Bruce W. Lee6 Jan 2024 2:39 UTC
11 points
2 comments4 min readLW link
(arxiv.org)

Bench­mark Study #1: MMLU (Pile, MCQ)

Bruce W. Lee5 Jan 2024 21:35 UTC
10 points
0 comments5 min readLW link
(arxiv.org)

Fac­ing Up to the Prob­lem of Consciousness

Bruce W. Lee10 Dec 2023 23:31 UTC
8 points
0 comments3 min readLW link

In­fants’ un­der­stand­ing of the causal power of agents and tools

Bruce W. Lee27 Feb 2024 18:36 UTC
8 points
0 comments4 min readLW link
(www.pnas.org)

Bench­mark Study #5: So­cial In­tel­li­gence QA (Task, MCQ)

Bruce W. Lee7 Feb 2024 4:41 UTC
6 points
0 comments5 min readLW link
(arxiv.org)

An Idea on How LLMs Can Show Self-Serv­ing Bias

Bruce W. Lee23 Nov 2023 20:25 UTC
6 points
6 comments3 min readLW link

Bench­mark Study #4: AI2 Rea­son­ing Challenge (Task(s), MCQ)

Bruce W. Lee7 Jan 2024 17:13 UTC
6 points
0 comments5 min readLW link

Shared sys­tem for or­der­ing small and large num­bers in mon­keys and humans

Bruce W. Lee9 Feb 2024 4:45 UTC
6 points
0 comments1 min readLW link
(pubmed.ncbi.nlm.nih.gov)

Num­ber Trumps Area for 7-Month-Old Infants

Bruce W. Lee9 Feb 2024 4:58 UTC
5 points
0 comments2 min readLW link
(pubmed.ncbi.nlm.nih.gov)

Re­la­tional Think­ing in An­i­mals and Humans

Bruce W. Lee19 Feb 2024 18:34 UTC
4 points
0 comments4 min readLW link
(psycnet.apa.org)

Core sys­tems of number

Bruce W. Lee9 Feb 2024 2:19 UTC
3 points
0 comments3 min readLW link
(www.sciencedirect.com)

Rep­re­sen­ta­tions of Ab­stract Re­la­tions in Infancy

Bruce W. Lee20 Feb 2024 17:40 UTC
2 points
0 comments3 min readLW link
(direct.mit.edu)

Bench­mark Study #3: Hel­laSwag (Task, MCQ)

Bruce W. Lee7 Jan 2024 4:59 UTC
2 points
4 comments6 min readLW link
(arxiv.org)