1.1 KiB
1.1 KiB
Groq Whisper API (free)
Transcribe audio files using Groq's free Whisper inference API.
Quick start
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a
Defaults:
- Model:
whisper-large-v3(Groq's fastest whisper model) - Output:
<input>.txt
Useful flags
{baseDir}/scripts/transcribe.sh /path/to/audio.ogg --model whisper-large-v3 --out /tmp/transcript.txt
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --language en
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --prompt "Speaker names: Peter, Daniel"
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --json --out /tmp/transcript.json
API key
Uses the GROQ_API_KEY environment variable (already configured on the gateway).
Models available
whisper-large-v3- Latest and fastest on Groq (recommended)whisper-large-v2- Slightly older but still fastwhisper-base- Faster but less accurate
Why Groq?
- Free — no per-minute charges
- Fast — Groq's LPU delivers near-real-time transcription
- No quota limits — generous free tier