/future-agi/get-started/evaluation/builtin-evals/task-completion/
/docs/evaluation/builtin/task-completion