NVIDIA NeMo Parakeet TDT 0.6B v3 is an automatic speech recognition (ASR) model from NVIDIA's NeMo toolkit. Parakeet models are state-of-the-art ASR models trained on large-scale English audio data.
Links
Tags
Moonshine Tiny is a lightweight speech-to-text model optimized for fast transcription. It is designed for efficient on-device ASR with high accuracy relative to its size.
Links
Tags
WhisperX Tiny is a fast and accurate speech recognition model with speaker diarization capabilities. Built on OpenAI's Whisper with additional features for alignment and speaker segmentation.
Links
Tags
Repository: localaiLicense: apache-2.0
Omnilingual ASR CTC 300M (int8) is a multilingual automatic speech recognition model supporting 1,600+ languages. Based on Meta's omniASR_CTC_300M architecture (Wav2Vec2 with CTC head), quantized to int8 for efficient inference. Uses the sherpa-onnx backend with ONNX Runtime.
Links
Tags
Repository: localaiLicense: apache-2.0
Streaming English ASR: sherpa-onnx zipformer transducer (int8, chunk-16 left-128). Low-latency real-time transcription with endpoint detection via sherpa-onnx's online recognizer. English-only; for multilingual offline ASR see omnilingual-0.3b-ctc-q8-sherpa.
Links
Tags

VibeVoice Realtime 0.5B (C++ / GGML, Q8_0) - native C++ port of Microsoft VibeVoice via the vibevoice-cpp backend. 24kHz mono TTS with voice cloning from a single reference voice prompt. Default voice prompt: en-Carter_man.
Links
Tags

VibeVoice ASR 7B (C++ / GGML, Q4_K) - long-form speech-to-text with speaker diarization. Returns per-speaker JSON segments with start/end timestamps. English-only. ~10 GB download.
Links
Tags
Qwen3-ASR is an automatic speech recognition model supporting multiple languages and batch inference.
Links
Tags
Qwen3-ASR is an automatic speech recognition model supporting multiple languages and batch inference.
Links
Tags
Qwen3-ASR is an automatic speech recognition model supporting multiple languages and batch inference.
Links
Tags
Qwen3-ASR is an automatic speech recognition model supporting multiple languages and batch inference.
Links
Tags
Port of OpenAI's Whisper model in C/C++
Links
Tags
Port of OpenAI's Whisper model in C/C++
Links
Tags
Port of OpenAI's Whisper model in C/C++
Links
Tags
Port of OpenAI's Whisper model in C/C++
Links
Tags
Port of OpenAI's Whisper model in C/C++
Links
Tags
Port of OpenAI's Whisper model in C/C++
Links
Tags
Port of OpenAI's Whisper model in C/C++
Links
Tags
Port of OpenAI's Whisper model in C/C++
Links
Tags