Evaluation Using Interface
Input:- Required Inputs:
- output: The output column generated by the model.
- Optional Inputs:
- context: The context column provided to the model.
- input: The input column provided to the model.
- Configuration Parameters:
- criteria: Text description of the evaluation criteria (e.g., “Evaluate if the output directly answers the question in the input, considering the provided context for background information.”).
- check_internet: Boolean - Whether to check external sources during evaluation based on the criteria.
- Score: Percentage score between 0 and 100
- Higher scores: Indicate strong alignment between the input, output, and context according to the specified criteria.
- Lower scores: Suggest that the output does not meet the defined criteria in relation to the input and context.
Evaluation Using Python SDK
Click here to learn how to setup evaluation using the Python SDK.
Input Type | Parameter | Type | Description |
---|---|---|---|
Required Inputs | output | string | The output generated by the model. |
Optional Inputs | context | string | The context provided to the model. |
input | string | The input provided to the model. | |
Configuration Parameters | criteria | string | The evaluation criteria. |
check_internet | bool | Whether to check internet for evaluation based on the criteria. |
Output | Type | Description |
---|---|---|
Score | float | Returns a score between 0 and 1, where higher values indicate better alignment based on criteria. |