FutureAGI
Deterministic Evaluation
Ensures that model outputs are restricted to a specific set of valid responses.
Deterministic evaluation is an evaluation method that outputs a fixed set of predefined choices. It ensures that model outputs are restricted to a specific set of valid responses, making the evaluation process fully deterministic and predictable.
Client Setup
Initialize the evaluation client with your API credentials:
Configuration
The evaluation accepts the following configuration parameters:
Parameter | Description | Required | Default |
---|---|---|---|
multi_choice | Whether to allow multiple choices in output | Yes | false |
choices | List of valid choices/outputs | Yes | [] |
rule_prompt | Custom prompt for evaluation rules | Yes | "" |
input | Input strings to test | Yes | {} |
Test Case Setup
The evaluation requires test cases with inputs and outputs: