Measures how closely an output follows given prompt instructions, checking for completion of requested tasks and adherence to specified constraints or formats. This evaluation is crucial for ensuring that generated content meets the intended requirements and follows given instructions accurately.
Click here to learn how to create prompt column.Output:
Click here to learn how to setup evaluation using the Python SDK.Input:
string
- The output column generated by the model.float
- Returns score between 0 and 1