mirror of
https://github.com/ksyasuda/dotfiles.git
synced 2026-03-21 06:11:27 -07:00
32 lines
902 B
Markdown
32 lines
902 B
Markdown
# Audio Speech API quick reference
|
|
|
|
## Endpoint
|
|
- Create speech: `POST /v1/audio/speech`
|
|
|
|
## Default model
|
|
- `gpt-4o-mini-tts-2025-12-15`
|
|
|
|
## Other speech models (if requested)
|
|
- `gpt-4o-mini-tts`
|
|
- `tts-1`
|
|
- `tts-1-hd`
|
|
|
|
## Core parameters
|
|
- `model`: speech model
|
|
- `input`: text to synthesize (max 4096 characters)
|
|
- `voice`: built-in voice name
|
|
- `instructions`: optional style directions (not supported for `tts-1` or `tts-1-hd`)
|
|
- `response_format`: `mp3`, `opus`, `aac`, `flac`, `wav`, or `pcm`
|
|
- `speed`: 0.25 to 4.0
|
|
|
|
## Built-in voices
|
|
- `alloy`, `ash`, `ballad`, `cedar`, `coral`, `echo`, `fable`, `marin`, `nova`, `onyx`, `sage`, `shimmer`, `verse`
|
|
|
|
## Output notes
|
|
- Default format is `mp3`.
|
|
- `pcm` is raw 24 kHz 16-bit little-endian samples (no header).
|
|
- `wav` includes a header (better for quick playback).
|
|
|
|
## Compliance note
|
|
- Provide a clear disclosure that the voice is AI-generated.
|