RSS

Bogdan Ionut Cirstea

Karma: 1,517

Automated /​ strongly-augmented safety research.

LLMs Do Not Think Step-by-step In Im­plicit Reasoning

Bogdan Ionut Cirstea28 Nov 2024 9:16 UTC
11 points
0 comments1 min readLW link
(arxiv.org)

Do Large Lan­guage Models Perform La­tent Multi-Hop Rea­son­ing with­out Ex­ploit­ing Short­cuts?

Bogdan Ionut Cirstea26 Nov 2024 9:58 UTC
9 points
0 comments1 min readLW link
(arxiv.org)

Disen­tan­gling Rep­re­sen­ta­tions through Multi-task Learning

Bogdan Ionut Cirstea24 Nov 2024 13:10 UTC
14 points
1 comment1 min readLW link
(arxiv.org)

Re­ward Bases: A sim­ple mechanism for adap­tive ac­qui­si­tion of mul­ti­ple re­ward type

Bogdan Ionut Cirstea23 Nov 2024 12:45 UTC
11 points
0 comments1 min readLW link

A Lit­tle Depth Goes a Long Way: the Ex­pres­sive Power of Log-Depth Transformers

Bogdan Ionut Cirstea20 Nov 2024 11:48 UTC
16 points
0 comments1 min readLW link
(openreview.net)