Anthropic-Compatible API — CallMissed Docs

API Guides & Tutorials

ANTHROPIC-COMPATIBLE API

Use the Anthropic SDK with CallMissed — just change the base URL. Full Messages API compatibility.

Overview

CallMissed provides an Anthropic Messages API-compatible endpoint alongside the OpenAI-compatible API. If you're already using the Anthropic SDK, you can switch to CallMissed by changing only the base_url.

Endpoints:

POST /v1/messages — chat completions (streaming + non-streaming)
POST /v1/messages/count_tokens — token count estimation (real BPE, not char-based)
GET /anthropic/v1/models — list models in Anthropic shape with capability metadata
GET /anthropic/v1/models/{model_id} — single model detail
POST /anthropic/v1/messages — alternate path for the chat endpoint

Authentication: Use either header style:

x-api-key: cm_your_key (Anthropic SDK default)
Authorization: Bearer cm_your_key (OpenAI style)

Basic Usage

import anthropic

client = anthropic.Anthropic(
    api_key="cm_your_key",
    base_url="https://api.callmissed.com"
)

message = client.messages.create(
    model="claude-sonnet-4.6",
    max_tokens=1024,
    system="You are a helpful assistant.",
    messages=[
        {"role": "user", "content": "What is the capital of India?"}
    ]
)

print(message.content[0].text)

Response:

json

{
  "id": "msg-abc123def456",
  "type": "message",
  "role": "assistant",
  "content": [
    {"type": "text", "text": "The capital of India is New Delhi."}
  ],
  "model": "claude-sonnet-4.6",
  "stop_reason": "end_turn",
  "stop_sequence": null,
  "usage": {
    "input_tokens": 25,
    "output_tokens": 12
  }
}

Streaming

Set stream: true to receive Server-Sent Events with the full Anthropic streaming lifecycle:

import anthropic

client = anthropic.Anthropic(
    api_key="cm_your_key",
    base_url="https://api.callmissed.com"
)

with client.messages.stream(
    model="claude-sonnet-4.6",
    max_tokens=1024,
    messages=[{"role": "user", "content": "Tell me a short story."}]
) as stream:
    for text in stream.text_stream:
        print(text, end="", flush=True)

SSE event lifecycle:

bash

event: message_start        → message metadata + input token count
event: content_block_start  → new content block begins
event: content_block_delta  → text chunks (repeats)
event: content_block_stop   → content block complete
event: message_delta        → stop_reason + output token count
event: message_stop          → stream complete

Parameters

Parameter	Type	Required	Description
`model`	string	Yes	Model ID (e.g. `claude-sonnet-4.6`, `sarvam-30b`, `openai/gpt-5.4`)
`max_tokens`	integer	Yes	Maximum tokens to generate
`messages`	array	Yes	List of `{role, content}` objects
`system`	string	No	System prompt (top-level, not in messages)
`stream`	boolean	No	Enable streaming (default: false)
`temperature`	number	No	Sampling temperature (0–1)
`top_p`	float	No	Nucleus sampling (0–1)
`top_k`	integer	No	Top-K sampling
`stop_sequences`	array	No	Stop sequences
`metadata`	object	No	Request metadata (e.g. `{"user_id": "u123"}`)

> Note: Unlike the OpenAI API, max_tokens is required and system is a top-level parameter (not a message with role: "system").

Model Aliasing

On the Anthropic endpoint, bare model names are automatically prefixed with anthropic/ for routing through OpenRouter:

You send	Routed as
`claude-sonnet-4.6`	`anthropic/claude-sonnet-4.6`
`claude-opus-4.6`	`anthropic/claude-opus-4.6`
`sarvam-30b`	`sarvam-30b` (Sarvam native, no prefix)
`openai/gpt-5.4`	`openai/gpt-5.4` (already has prefix)

You can use any model from our Models catalog — not just Anthropic models.

Token Counting

Estimate input token count before sending a request. The endpoint uses a BPE

tokenizer (tiktoken cl100k_base) — close to Claude's real tokenizer on

typical English prompts, and noticeably more accurate than char-length

heuristics.

bash

curl -X POST https://api.callmissed.com/v1/messages/count_tokens \
  -H "x-api-key: cm_your_key" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "claude-sonnet-4.6",
    "messages": [{"role": "user", "content": "Hello, how are you?"}],
    "system": "You are a helpful assistant."
  }'

Response:

json

{"input_tokens": 15}

Image content blocks contribute a fixed estimate (~258 tokens per image) rather

than a fetch-and-resize pass. Tool definitions are counted against the total by

serializing each to JSON and tokenizing the schema.

Listing Models

List all available models via the Anthropic-shape endpoint:

bash

curl https://api.callmissed.com/anthropic/v1/models \
  -H "x-api-key: cm_your_key"

Response:

json

{
  "data": [
    {
      "type": "model",
      "id": "anthropic/claude-sonnet-4.6",
      "display_name": "Claude Sonnet 4.6",
      "created_at": "2023-11-14T22:13:20+00:00",
      "description": "Fast and intelligent. Best balance of speed and capability.",
      "category": "llm",
      "context_window": 200000,
      "context_length": 200000,
      "pricing": {"input": 4.00, "output": 20.00, "unit": "per_million_tokens", "currency": "USD"},
      "supports_streaming": true,
      "supports_tools": true,
      "supports_reasoning": true,
      "supports_vision": true
    }
  ],
  "has_more": false,
  "first_id": "...",
  "last_id": "..."
}

Fetch a single model at GET /anthropic/v1/models/{model_id}.

Vision (Image Input)

You can send images on any model whose supports_vision flag is true in

the model listing. Vision-capable models currently include the OpenAI GPT-5.4

family, Anthropic Claude 4.5 / 4.6 (Opus, Sonnet, Haiku), Google Gemini 3 / 3.1,

xAI Grok 4.20, Qwen 3.5 (Plus, Flash), Moonshot Kimi K2.5 / K2.6,

Google Gemma 4 26B, Mistral Small 3.1 / Small 4, and both auto-routers.

Models without vision support reject image content with a

400 invalid_request_error before the upstream call — you won't be charged.

bash

curl -X POST https://api.callmissed.com/v1/messages \
  -H "x-api-key: cm_your_key" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "claude-sonnet-4.6",
    "max_tokens": 1024,
    "messages": [{
      "role": "user",
      "content": [
        {"type": "text", "text": "What is in this image?"},
        {"type": "image", "source": {"type": "base64", "media_type": "image/png", "data": "<base64>"}}
      ]
    }]
  }'

Error Format

Errors return the Anthropic format (different from the OpenAI endpoints):

json

{
  "type": "error",
  "error": {
    "type": "authentication_error",
    "message": "Invalid API key"
  }
}

Error type	HTTP Status	When
`authentication_error`	401	Bad or missing API key
`permission_error`	403	Account inactive, domain blocked, or free tier model restriction
`invalid_request_error`	400/402	Bad request or insufficient credits
`rate_limit_error`	429	Plan limit or API key rate limit exceeded
`not_found_error`	404	Model not found
`api_error`	502	Provider failure

Rate limit headers are returned in Anthropic format:

bash

anthropic-ratelimit-requests-limit: 60
anthropic-ratelimit-requests-remaining: 45
anthropic-ratelimit-requests-reset: 2026-05-01T00:00:00+00:00

Differences from Anthropic

This endpoint is designed to work with the Anthropic SDK out of the box. Key differences from the official Anthropic API:

`anthropic-version` header is accepted but not required
Model routing — requests can target *any* model in the CallMissed catalogue, not just Anthropic models. Bare claude-* names are auto-prefixed with anthropic/.
Token counting uses a BPE tokenizer approximation (tiktoken cl100k_base). Expect ~5-10% variance from Anthropic's native counts on English prompts; larger on CJK and heavy-punctuation text.
Tools are supported — tools and tool_choice work as documented, and tool_use/tool_result content blocks are preserved.
Vision is supported on models whose supports_vision flag is true. Image content sent to text-only models is rejected with a 400 invalid_request_error before the upstream call, so your credits are safe.
Message Batches API (/v1/messages/batches) is not implemented — use the regular /v1/messages endpoint.
Billing uses CallMissed credits, not Anthropic billing.

Image Generation

Web Search

Was this page helpful?