Back to Skills Hub
OpenRouter Audio Transcription

OpenRouter Audio Transcription

@obviyus
developmentaudio transcriptionspeech-to-textOpenRouter

Transcribe audio files using OpenRouter's chat completions API with input_audio content type. Supports any audio-capable model including Google Gemini and OpenAI GPT-4o. Automatically converts audio to WAV format, handles base64 encoding, and extracts transcripts with customizable prompts.

🚀 Instantly convert audio files into text using OpenRouter's audio-capable models. Simply point the script at your audio file (M4A, OGG, WAV, etc.), and it handles the rest—converting to the right format, sending to the API, and returning a clean transcript. Works with any audio model available on OpenRouter.

💡 Perfect for transcribing interviews, meetings, voice notes, and podcasts without manual typing. Customize the output with speaker labels, save directly to files, and track usage through your OpenRouter dashboard. Great for content creators, researchers, and anyone drowning in audio files.

✨ No complex setup needed—just set your API key and run. The script intelligently handles large files and format conversions behind the scenes, so you focus on what matters.

GitHub

Requirements

OpenRouter API Key

Valid OpenRouter API key for accessing audio-capable models