Ragas Response Context Precision

import langwatch

df = langwatch.dataset.get_dataset("dataset-id").to_pandas()

evaluation = langwatch.evaluation.init("my-incredible-evaluation")

for index, row in evaluation.loop(df.iterrows()):
    # your execution code here       
    evaluation.run(
        "ragas/response_context_precision",
        index=index,
        data={
            "input": row["input"],
            "contexts": row["contexts"],
            "output": output,
            "expected_output": row["expected_output"],
        },
        settings={}
    )

[
  {
    "status": "processed",
    "score": 123,
    "passed": true,
    "label": "<string>",
    "details": "<string>",
    "cost": {
      "currency": "<string>",
      "amount": 123
    },
    "raw_response": {},
    "error_type": "<string>",
    "traceback": [
      "<string>"
    ]
  }
]

POST

ragas

response_context_precision

evaluate

import langwatch

df = langwatch.dataset.get_dataset("dataset-id").to_pandas()

evaluation = langwatch.evaluation.init("my-incredible-evaluation")

for index, row in evaluation.loop(df.iterrows()):
    # your execution code here       
    evaluation.run(
        "ragas/response_context_precision",
        index=index,
        data={
            "input": row["input"],
            "contexts": row["contexts"],
            "output": output,
            "expected_output": row["expected_output"],
        },
        settings={}
    )

[
  {
    "status": "processed",
    "score": 123,
    "passed": true,
    "label": "<string>",
    "details": "<string>",
    "cost": {
      "currency": "<string>",
      "amount": 123
    },
    "raw_response": {},
    "error_type": "<string>",
    "traceback": [
      "<string>"
    ]
  }
]

Authorizations

X-Auth-Token

string

header

required

Body

application/json

data

object

required

Show child attributes

data.input

string

required

The input text to evaluate

data.contexts

string[]

required

Context information for evaluation

data.output

string

The output text to evaluate

data.expected_output

string

Expected output for comparison

settings

object

Evaluator settings

Show child attributes

settings.model

string

default:openai/gpt-5

The model to use for evaluation.

settings.max_tokens

number

default:2048

The maximum number of tokens allowed for evaluation, a too high number can be costly. Entries above this amount will be skipped.

Response

Successful evaluation

status

enum<string>

required

Available options:

processed,

skipped,

error

score

number

Evaluation score

passed

boolean

Whether the evaluation passed

label

string

Evaluation label

details

string

Additional details about the evaluation

cost

object

Show child attributes

cost.currency

string

required

cost.amount

number

required

raw_response

object

Raw response from the evaluator

error_type

string

Type of error if status is 'error'

traceback

string[]

Error traceback if status is 'error'

Ragas Faithfulness Ragas Response Context Recall

⌘I

Get Started

Observability

Agent Simulations

Evaluation

Prompt Management

Platform

Examples & Cookbooks

Ragas Response Context Precision

Authorizations

Body

Response