mirror of
https://github.com/ksyasuda/dotfiles.git
synced 2026-03-20 18:11:27 -07:00
902 B
902 B
Audio Speech API quick reference
Endpoint
- Create speech:
POST /v1/audio/speech
Default model
gpt-4o-mini-tts-2025-12-15
Other speech models (if requested)
gpt-4o-mini-ttstts-1tts-1-hd
Core parameters
model: speech modelinput: text to synthesize (max 4096 characters)voice: built-in voice nameinstructions: optional style directions (not supported fortts-1ortts-1-hd)response_format:mp3,opus,aac,flac,wav, orpcmspeed: 0.25 to 4.0
Built-in voices
alloy,ash,ballad,cedar,coral,echo,fable,marin,nova,onyx,sage,shimmer,verse
Output notes
- Default format is
mp3. pcmis raw 24 kHz 16-bit little-endian samples (no header).wavincludes a header (better for quick playback).
Compliance note
- Provide a clear disclosure that the voice is AI-generated.