Files
dotfiles/.agents/skills/transcribe/references/api.md
2026-03-17 16:53:22 -07:00

457 B

gpt-4o-transcribe-diarize quick reference

  • Input formats: mp3, mp4, mpeg, mpga, m4a, wav, webm.
  • Max file size: 25 MB per request.
  • response_format options: text, json, diarized_json.
  • For audio longer than ~30 seconds, pass chunking_strategy (use "auto" to split into chunks).
  • Known speakers: up to 4 references via extra_body known_speaker_names + known_speaker_references (data URLs).
  • Prompting is not supported for gpt-4o-transcribe-diarize.