mic comments on ARC Evals new report: Evaluating Language-Model Agents on Realistic Autonomous Tasks

mic Aug 4, 2023, 4:00 AM
2 points
0
Fine-tuning will be generally available for GPT-4 and GPT-3.5 later this year. Do you think this could enable greater opportunities for misuse and stronger performance on dangerous capability evaluations?