mirror of
https://github.com/ksyasuda/dotfiles.git
synced 2026-03-21 18:11:27 -07:00
update skills
This commit is contained in:
31
.agents/skills/speech/references/audio-api.md
Normal file
31
.agents/skills/speech/references/audio-api.md
Normal file
@@ -0,0 +1,31 @@
|
||||
# Audio Speech API quick reference
|
||||
|
||||
## Endpoint
|
||||
- Create speech: `POST /v1/audio/speech`
|
||||
|
||||
## Default model
|
||||
- `gpt-4o-mini-tts-2025-12-15`
|
||||
|
||||
## Other speech models (if requested)
|
||||
- `gpt-4o-mini-tts`
|
||||
- `tts-1`
|
||||
- `tts-1-hd`
|
||||
|
||||
## Core parameters
|
||||
- `model`: speech model
|
||||
- `input`: text to synthesize (max 4096 characters)
|
||||
- `voice`: built-in voice name
|
||||
- `instructions`: optional style directions (not supported for `tts-1` or `tts-1-hd`)
|
||||
- `response_format`: `mp3`, `opus`, `aac`, `flac`, `wav`, or `pcm`
|
||||
- `speed`: 0.25 to 4.0
|
||||
|
||||
## Built-in voices
|
||||
- `alloy`, `ash`, `ballad`, `cedar`, `coral`, `echo`, `fable`, `marin`, `nova`, `onyx`, `sage`, `shimmer`, `verse`
|
||||
|
||||
## Output notes
|
||||
- Default format is `mp3`.
|
||||
- `pcm` is raw 24 kHz 16-bit little-endian samples (no header).
|
||||
- `wav` includes a header (better for quick playback).
|
||||
|
||||
## Compliance note
|
||||
- Provide a clear disclosure that the voice is AI-generated.
|
||||
Reference in New Issue
Block a user