Skip to main content

ContextPositionMetric

Scorers

This documentation refers to the legacy evals API. For the latest scorer features, see Scorers.

The ContextPositionMetric class evaluates how well context nodes are ordered based on their relevance to the query and output. It uses position-weighted scoring to emphasize the importance of having the most relevant context pieces appear earlier in the sequence.

Basic UsageDirect link to Basic Usage

import { openai } from "@ai-sdk/openai";
import { ContextPositionMetric } from "@mastra/evals/llm";

// Configure the model for evaluation
const model = openai("gpt-4o-mini");

const metric = new ContextPositionMetric(model, {
  context: [
    "Photosynthesis is a biological process used by plants to create energy from sunlight.",
    "The process of photosynthesis produces oxygen as a byproduct.",
    "Plants need water and nutrients from the soil to grow.",
  ],
});

const result = await metric.measure(
  "What is photosynthesis?",
  "Photosynthesis is the process by which plants convert sunlight into energy.",
);

console.log(result.score); // Position score from 0-1
console.log(result.info.reason); // Explanation of the score

Constructor ParametersDirect link to Constructor Parameters

model:

ModelConfig

Configuration for the model used to evaluate context positioning

options:

ContextPositionMetricOptions

Configuration options for the metric

ContextPositionMetricOptionsDirect link to ContextPositionMetricOptions

scale?:

number

= 1

Maximum score value

context:

string[]

Array of context pieces in their retrieval order

measure() ParametersDirect link to measure() Parameters

input:

string

The original query or prompt

output:

string

The generated response to evaluate

ReturnsDirect link to Returns

score:

number

Position score (0 to scale, default 0-1)

info:

object

Object containing the reason for the score

string

reason:

string

Detailed explanation of the score

Scoring DetailsDirect link to Scoring Details

The metric evaluates context positioning through binary relevance assessment and position-based weighting.

Scoring ProcessDirect link to Scoring Process

Evaluates context relevance:
- Assigns binary verdict (yes/no) to each piece
- Records position in sequence
- Documents relevance reasoning
Applies position weights:
- Earlier positions weighted more heavily (weight = 1/(position + 1))
- Sums weights of relevant pieces
- Normalizes by maximum possible score

Final score: (weighted_sum / max_possible_sum) * scale

Score interpretationDirect link to Score interpretation

(0 to scale, default 0-1)

1.0: Optimal - most relevant context first
0.7-0.9: Good - relevant context mostly early
0.4-0.6: Mixed - relevant context scattered
0.1-0.3: Suboptimal - relevant context mostly later
0.0: Poor ordering - relevant context at end or missing

Example with AnalysisDirect link to Example with Analysis

import { openai } from "@ai-sdk/openai";
import { ContextPositionMetric } from "@mastra/evals/llm";

// Configure the model for evaluation
const model = openai("gpt-4o-mini");

const metric = new ContextPositionMetric(model, {
  context: [
    "A balanced diet is important for health.",
    "Exercise strengthens the heart and improves blood circulation.",
    "Regular physical activity reduces stress and anxiety.",
    "Exercise equipment can be expensive.",
  ],
});

const result = await metric.measure(
  "What are the benefits of exercise?",
  "Regular exercise improves cardiovascular health and mental wellbeing.",
);

// Example output:
// {
//   score: 0.5,
//   info: {
//     reason: "The score is 0.5 because while the second and third contexts are highly
//           relevant to the benefits of exercise, they are not optimally positioned at
//           the beginning of the sequence. The first and last contexts are not relevant
//           to the query, which impacts the position-weighted scoring."
//   }
// }

Context Relevancy Metric

On this page

Basic Usage
Constructor Parameters
- ContextPositionMetricOptions
measure() Parameters
Returns
Scoring Details
- Scoring Process
- Score interpretation
Example with Analysis
Related