Back to skills

fal-audio

ai-tools

Converts text to speech and transcribes audio to text using fal.ai's audio models. Supports multiple TTS engines like MiniMax and ElevenLabs, plus Whisper-based transcription with optional speaker dia

Setup & Installation

npx skills add https://github.com/fal-ai-community/fal-audio --skill fal-audio
or paste the link and ask your coding assistant to install it
https://github.com/fal-ai-community/fal-audio
View on GitHub

What This Skill Does

Converts text to speech and transcribes audio to text using fal.ai's audio models. Supports multiple TTS engines like MiniMax and ElevenLabs, plus Whisper-based transcription with optional speaker diarization and multi-language support. Handles both TTS and STT through one interface with access to multiple model providers, so you don't need separate accounts or scripts for each.

When to use it

  • Working with fal audio functionality
  • Implementing fal audio features
  • Debugging fal audio related issues