Evaluation Using Interface
Input:- Required:
- context: The contextual information provided to the model.
- output: The response generated by the language model.
- Optional:
- input: The original query or instruction given to the model.
- Score: A percentage score between 0 and 100.
- Higher scores: Indicate that the model effectively and extensively incorporated the provided context into its response.
- Lower scores: Suggest that the model minimally used or ignored the provided context.
Evaluation Using SDK
Click here to learn how to setup evaluation using SDK.
Input | Parameter | Type | Description |
---|---|---|---|
Required | context | string or list[string] | The contextual information provided to the model. |
output | string | The response generated by the language model. | |
Optional | input | string | The original query or instruction given to the model. |
Output | Type | Description |
---|---|---|
Score | float | Returns score between 0 and 1. |
What to Do When Chunk Utilization Score is Low
- Ensure that the context provided is relevant and sufficiently detailed for the model to utilise effectively.
- Modify the input prompt to better guide the model in using the context. Clearer instructions may help the model understand how to incorporate the context into its response.
- If the model consistently fails to use context, it may require retraining or fine-tuning with more examples that emphasise the importance of context utilization.