Run New Evals on Executions

Run new evaluation configs on completed test executions. Specify eval config UUIDs and target executions or use selectAll. Returns call execution count being evaluated.

POST https://api.futureagi.com/simulate/run-tests/{run_test_id}/run-new-evals/

Authentication

X-Api-Key API Key Required

Your Future AGI API key used to authenticate requests. You can find and manage your API keys in the Dashboard under Settings.

X-Secret-Key Secret Key Required

Your Future AGI secret key, used alongside the API key for request authentication. This is generated when you create an API key in the Dashboard.

Path parameters

run_test_id UUID Required

UUID of the test run containing the executions to evaluate.

Request body

test_execution_ids array of strings Optional

Array of test execution UUIDs to evaluate. Required when select_all is false. Only COMPLETED executions are eligible.

select_all boolean Optional

When true, evaluates all completed executions, ignoring test_execution_ids. Defaults to false.

eval_config_ids array of strings Required

Array of evaluation configuration UUIDs to run on the selected executions.

enable_tool_evaluation boolean Optional

When true, also evaluates tool usage by the agent. Defaults to false.

Response

200 OK
message string
Confirmation that evaluations were started.
run_test_id string
UUID of the parent test run.
call_execution_count integer
Number of call executions being evaluated.

Errors

400 Bad Request Optional

Validation error. Common causes: missing eval_config_ids, neither select_all nor test_execution_ids provided, no completed executions found.

{"error": "Either 'select_all' must be True or 'test_execution_ids' must be provided"}

Or when no completed executions exist:

{"error": "No test executions found to run evaluations on."}

Or when executions are not completed:

{"error": "Only test executions with COMPLETED status can have new evaluations run on them."}
401 Unauthorized Optional

Missing or invalid X-Api-Key or X-Secret-Key headers.

404 Not Found Optional

No test run found with the specified run_test_id.

{"detail": "No RunTest matches the given query."}
500 Internal Server Error Optional

Unexpected server error.

{"error": "Failed to run evaluations: <message>"}
GET /
Authentication
REQUEST
 
RESPONSE