Back to skills

speech

ai-tools

Generates spoken audio from text using the OpenAI Audio API. Supports single clips and batch jobs, with built-in voices and optional delivery instructions for tone, pacing, and emphasis. Uses a bundle

Setup & Installation

npx skills add https://github.com/openai/speech --skill speech
or paste the link and ask your coding assistant to install it
https://github.com/openai/speech
View on GitHub

What This Skill Does

Generates spoken audio from text using the OpenAI Audio API. Supports single clips and batch jobs, with built-in voices and optional delivery instructions for tone, pacing, and emphasis. Uses a bundled CLI for reproducible runs. The bundled CLI handles batching, rate limiting, and output organization so you don't have to wire up the API manually each time.

When to use it

  • Working with speech functionality
  • Implementing speech features
  • Debugging speech related issues