Back to Skills Hub
MLX Whisper

MLX Whisper

@kevin37li
developmentspeech-to-textApple Siliconlocal-processing

Local speech-to-text using Apple MLX, optimized for Apple Silicon Macs. Transcribe audio and video files, generate subtitles, and translate speech with multiple model options ranging from lightweight to high-accuracy variants.

🚀 MLX Whisper converts speech to text directly on your Apple Silicon Mac with blazing-fast performance. No cloud uploads needed—your audio stays private and local. Simply point it at an audio or video file, and get accurate transcriptions in seconds.

💡 Perfect for transcribing meetings, podcasts, interviews, and videos. Generate subtitles in SRT format, translate foreign audio to English, or save transcripts as text files. Choose from lightweight models for quick results or powerful models for studio-quality accuracy.

✨ Optimized for M1/M2/M3/M4 Macs, it's faster and more private than cloud solutions. Models auto-download on first use, and the recommended whisper-large-v3-turbo balances speed and excellence.

GitHub

Requirements

MLX Framework

Apple MLX machine learning framework