List per-sample comparisons for an evaluation run.

Returns the per-sample comparisons for a completed run, paginated and ordered by each sample's original position in the dataset. This is the row-level detail behind the run's aggregate statistics — use it to inspect which individual cases passed or regressed.

Authorization

Bearer

AuthorizationBearer <token>

JWT Authorization header using the Bearer scheme. Enter 'Bearer' [space] and then your token.

In: header

Path Parameters

agent*string

The unique identifier of the agent

evaluationRun*string

The evaluation run ID

Formatuuid

tenant*string

The tenant identifier

Query Parameters

PageSize?integer

The maximum number of items to return per page

Formatint32

PageToken.Skip?integer

Formatint32

PageToken.Cursor?string

Opaque cursor token from an external system (e.g. RAG API). When set, M:Fruxon.Common.Collections.PageToken.AsString returns this value directly instead of encoding Fruxon.Common.Collections.PageToken.Skip.

Response Body

curl -X GET "https://api.fruxon.com/v1/tenants/string/agents/string/evaluationRuns/497f6eca-6276-4993-bfeb-53cbbbba6f08/samples"

curl -X GET "https://api.fruxon.com/v1/tenants/string/agents/string/evaluationRuns/497f6eca-6276-4993-bfeb-53cbbbba6f08/samples"

{
  "items": [
    {
      "sampleId": "string",
      "sampleIndex": 0,
      "parameters": {},
      "baseRevisionResult": {
        "response": "string",
        "trace": null,
        "sessionId": "string",
        "links": [],
        "executionRecordId": null,
        "status": "UNSPECIFIED",
        "humanApprovalRequestId": null,
        "suppressOutbound": false,
        "messageBlocks": null
      },
      "candidateRevisionResult": {
        "response": "string",
        "trace": null,
        "sessionId": "string",
        "links": [],
        "executionRecordId": null,
        "status": "UNSPECIFIED",
        "humanApprovalRequestId": null,
        "suppressOutbound": false,
        "messageBlocks": null
      },
      "llmEvaluation": {
        "score": 0,
        "overallAssessment": "string",
        "summary": "string",
        "analysis": {},
        "keyImprovements": [],
        "concernsRisks": [],
        "deploymentRecommendations": [],
        "monitoringPriorities": [],
        "errorCategory": null,
        "errorTitle": "string",
        "errorRecommendation": "string"
      },
      "systemEvaluation": {
        "baseRuntimeMs": 0,
        "candidateRuntimeMs": 0,
        "baseCost": 0,
        "candidateCost": 0,
        "runtimeAnalysis": "string",
        "costAnalysis": "string"
      },
      "baseRuntimeMs": 0,
      "candidateRuntimeMs": 0
    }
  ],
  "nextPageToken": "string",
  "totalCount": 0
}

Empty

{
  "type": "string",
  "title": "string",
  "status": 0,
  "detail": "string",
  "instance": "string",
  "property1": null,
  "property2": null
}

List per-sample comparisons for an evaluation run.

Authorization

Path Parameters

Query Parameters

Response Body

200

401

403

404