Speech to Text
Translate audio in any supported language to English text. OpenAI-compatible endpoint.
Overview
The Audio Translation API translates speech in any of 24 supported languages to English text. This is the OpenAI-compatible /v1/audio/translations endpoint.
Unlike Speech to Text (which transcribes in the original language), this endpoint always outputs English.
Endpoint: POST /v1/audio/translations
Supported input languages: Hindi, Bengali, Tamil, Telugu, Kannada, Malayalam, Marathi, Gujarati, Punjabi, Odia, Assamese, Urdu, Nepali, Konkani, Kashmiri, Sindhi, Sanskrit, Santali, Manipuri, Brij, Maithili, Dogri, English — plus auto-detection.
Basic Usage
from openai import OpenAI
client = OpenAI(
api_key="cm_your_key",
base_url="https://api.callmissed.com/v1"
)
# Translate Hindi audio to English text
with open("hindi_audio.wav", "rb") as f:
translation = client.audio.translations.create(
model="saaras:v3",
file=f,
)
print(translation.text)Response:
{"text": "Hello, how are you? I wanted to discuss the project."}Parameters
| Parameter | Type | Required | Description |
|---|---|---|---|
file | file | Yes | Audio file (WAV, MP3, AAC, OGG, FLAC, WebM, M4A) |
model | string | No | Model ID (default: saaras:v3) |
response_format | string | No | json (default), text, or verbose_json |
temperature | float | No | Sampling temperature |
prompt | string | No | Prompt to guide transcription style |
Response Formats
json (default)
{"text": "Hello, how are you?"}text
Returns plain text with no JSON wrapping.
verbose_json
{
"task": "translate",
"language": "en",
"duration": 4.52,
"text": "Hello, how are you?",
"segments": [],
"words": []
}> Tip: For transcription in the original language (not translated), use Speech to Text instead. For output modes like transliteration or code-mixing, use the mode parameter on the transcription endpoint.