Skip to main content

Guides

Gateway Plugins & Routing

Advanced request features on the gateway — file parsing, response healing, context compression, provider routing, and generation lookup.

Overview

When a request targets a frontier model (any model id with a slash, e.g. openai/gpt-5.4), you can opt into extra processing via the plugins array on a chat completion request.

Plugins

json
{
  "model": "anthropic/claude-sonnet-4.6",
  "messages": [{ "role": "user", "content": "Summarize this PDF" }],
  "plugins": [
    { "id": "web" },
    "file-parser",
    "response-healing",
    "context-compression"
  ]
}
PluginWhat it does
webAdds live web search results as grounding
file-parserExtracts text from attached files before the model sees them
response-healingRepairs malformed/truncated structured output
context-compressionCompresses long context to fit the model window

You can also influence which upstream serves a frontier model via routing preferences in the request body.

Generation Lookup

After a completion, fetch detailed generation metadata (token counts, upstream provider, native cost) by id:

bash
curl https://api.callmissed.com/v1/generation?id=gen_abc123 \
  -H "Authorization: Bearer cm_your_key"
GET/v1/generation Auth

Fetch generation metadata (tokens, provider, cost) for a completion by id

Query Parameters

idstring (generation id)
Was this page helpful?