Create a new evaluation metric for this tenant.
Adds an LLM-judge metric definition to the tenant library and returns its ID. The metric is not applied to any agent until you list it in `PUT AgentEvaluationMetrics` for that agent. Rate-limited per tenant; see `RateLimitingExtensions.SlidingWindowPolicy` for the window.
Authorization
Bearer JWT Authorization header using the Bearer scheme. Enter 'Bearer' [space] and then your token.
In: header
Path Parameters
The tenant identifier
camelCase key used as the criterion ID in LLM judge prompts and responses. Must be unique within the tenant. Example: "responseQuality".
1 <= lengthHuman-readable display name. Example: "Response Quality".
1 <= lengthWhat the LLM judge should evaluate for this metric. Sent verbatim as part of the evaluation criteria JSON.
1 <= lengthSuggested weight (1–100) shown by the frontend when an agent opts into this metric. Used as the seeding hint when auto-normalizing a new agent's metric set.
int321 <= value <= 100Response Body
curl -X POST "https://api.fruxon.com/v1/tenants/string/evaluationMetrics" \ -H "Content-Type: application/json" \ -d '{ "key": "string", "name": "string", "description": "string" }'{
"id": "00000000-0000-0000-0000-000000000000",
"key": "string",
"name": "string",
"description": "string",
"defaultWeight": 0,
"isDeprecated": false,
"createdAt": 0,
"modifiedAt": 0
}{
"type": "string",
"title": "string",
"status": 0,
"detail": "string",
"instance": "string",
"property1": null,
"property2": null
}Deprecate an evaluation metric
Soft-delete: the row stays so historical eval scores remain joinable and agents still pinned to the metric can render a deprecation banner. Deprecated metrics are excluded from `GET EvaluationMetrics` unless `includeDeprecated=true`. To fully detach an agent, replace its set via `PUT AgentEvaluationMetrics`.
Update an existing evaluation metric.
Mutates the metric in place — every agent currently opted-in picks up the new prompt/weight on the next eval run. Use `POST EvaluationMetricDeprecate` instead of editing when you want to retire a metric without breaking historical comparability. System (non-LLM-judge) metrics cannot be edited and will return 400.