Allows you to check for semantic similarity or dissimilarity between input and output and a target value, so you can avoid sentences that you don’t want to be present without having to match on the exact text.
Successful evaluation
processed, skipped, error Evaluation score
Whether the evaluation passed
Evaluation label
Additional details about the evaluation
Raw response from the evaluator
Type of error if status is 'error'
Error traceback if status is 'error'