Setup & Installation
What This Skill Does
Converts text to speech and transcribes audio to text using fal.ai's audio models. Supports multiple TTS engines like MiniMax and ElevenLabs, plus Whisper-based transcription with optional speaker diarization and multi-language support. Handles both TTS and STT through one interface with access to multiple model providers, so you don't need separate accounts or scripts for each.
When to use it
- Working with fal audio functionality
- Implementing fal audio features
- Debugging fal audio related issues
