Skip to main content

Pairwise Embedding Distance

One way to measure the similarity (or dissimilarity) between two predictions on a shared or similar input is to embed the predictions and compute a vector distance between the two embeddings.

You can load the pairwise_embedding_distance evaluator to do this.

Note: This returns a distance score, meaning that the lower the number, the more similar the outputs are, according to their embedded representation.

import { OpenAIEmbeddings } from "langchain/embeddings/openai";
import { loadEvaluator } from "langchain/evaluation";

const embedding = new OpenAIEmbeddings();

const chain = await loadEvaluator("pairwise_embedding_distance", { embedding });

const res = await chain.evaluateStringPairs({
prediction: "Seattle is hot in June",
predictionB: "Seattle is cool in June.",
});

console.log({ res });

/*
{ res: { score: 0.03633645503883243 } }
*/

const res1 = await chain.evaluateStringPairs({
prediction: "Seattle is warm in June",
predictionB: "Seattle is cool in June.",
});

console.log({ res1 });

/*
{ res1: { score: 0.03657957473761331 } }
*/

API Reference: