Model Gallery

51 models from 1 repositories

Filter by type:

Filter by tags:

omnilingual-0.3b-ctc-q8-sherpa

Omnilingual ASR CTC 300M (int8) is a multilingual automatic speech recognition model supporting 1,600+ languages. Based on Meta's omniASR_CTC_300M architecture (Wav2Vec2 with CTC head), quantized to int8 for efficient inference. Uses the sherpa-onnx backend with ONNX Runtime.

Repository: localaiLicense: apache-2.0

streaming-zipformer-en-sherpa

Streaming English ASR: sherpa-onnx zipformer transducer (int8, chunk-16 left-128). Low-latency real-time transcription with endpoint detection via sherpa-onnx's online recognizer. English-only; for multilingual offline ASR see omnilingual-0.3b-ctc-q8-sherpa.

Repository: localaiLicense: apache-2.0

silero-vad-sherpa

Silero VAD served through the sherpa-onnx backend. Uses the same ONNX weights as the dedicated silero-vad backend, loaded through sherpa-onnx's C VAD API. Pairs with the sherpa-onnx ASR entries for round-trip audio pipelines.

Repository: localaiLicense: mit

vits-ljs-sherpa

VITS-LJS English single-speaker TTS served through the sherpa-onnx backend. Trained on the LJSpeech corpus at 22.05 kHz. Pairs with the sherpa-onnx ASR entries for round-trip audio pipelines.

Repository: localaiLicense: apache-2.0

vits-piper-it_IT-paola-sherpa

Italian (it_IT) single-speaker Piper VITS voice "paola" (medium quality, 22.05 kHz), served through the sherpa-onnx backend with native streaming TTS. Ships espeak-ng phonemization data, so it works for Italian out of the box.

Repository: localaiLicense: other

vits-piper-it_IT-dii-high-sherpa

Italian (it_IT) single-speaker Piper VITS voice "dii" (high quality, 22.05 kHz), served through the sherpa-onnx backend with native streaming TTS. Ships espeak-ng phonemization data. Non-commercial use only (CC BY-NC-SA 4.0).

Repository: localaiLicense: cc-by-nc-sa-4.0

vits-piper-it_IT-miro-high-sherpa

Italian (it_IT) single-speaker Piper VITS voice "miro" (high quality, 22.05 kHz), served through the sherpa-onnx backend with native streaming TTS. Ships espeak-ng phonemization data. Non-commercial use only (CC BY-NC-SA 4.0).

Repository: localaiLicense: cc-by-nc-sa-4.0

vits-piper-it_IT-riccardo-x_low-sherpa

Italian (it_IT) single-speaker Piper VITS voice "riccardo" (x-low quality, 16 kHz), served through the sherpa-onnx backend with native streaming TTS. Ships espeak-ng phonemization data.

Repository: localaiLicense: other

vits-piper-en_US-amy-sherpa

English (en_US) single-speaker Piper VITS voice "amy" (medium quality, 22.05 kHz), served through the sherpa-onnx backend with native streaming TTS. Ships espeak-ng phonemization data.

Repository: localaiLicense: other

vits-piper-es_ES-davefx-sherpa

Spanish (es_ES) single-speaker Piper VITS voice "davefx" (medium quality, 22.05 kHz), served through the sherpa-onnx backend with native streaming TTS. Ships espeak-ng phonemization data.

Repository: localaiLicense: cc0-1.0

vits-piper-fr_FR-siwis-sherpa

French (fr_FR) single-speaker Piper VITS voice "siwis" (medium quality, 22.05 kHz), served through the sherpa-onnx backend with native streaming TTS. Ships espeak-ng phonemization data.

Repository: localaiLicense: cc-by-4.0

vits-piper-de_DE-thorsten-sherpa

German (de_DE) single-speaker Piper VITS voice "thorsten" (medium quality, 22.05 kHz), served through the sherpa-onnx backend with native streaming TTS. Ships espeak-ng phonemization data.

Repository: localaiLicense: cc0-1.0

vits-piper-en_GB-alan-low-sherpa

English (en_GB) single-speaker Piper VITS voice "alan" (low quality, 16 kHz), served through the sherpa-onnx backend with native streaming TTS. Ships espeak-ng phonemization data.

Repository: localaiLicense: other

vits-piper-en_GB-alan-medium-sherpa

English (en_GB) single-speaker Piper VITS voice "alan" (medium quality, 22.05 kHz), served through the sherpa-onnx backend with native streaming TTS. Ships espeak-ng phonemization data.

Repository: localaiLicense: other

vits-piper-en_GB-alba-medium-sherpa

English (en_GB) single-speaker Piper VITS voice "alba" (medium quality, 22.05 kHz), served through the sherpa-onnx backend with native streaming TTS. Ships espeak-ng phonemization data.

Repository: localaiLicense: cc-by-4.0

vits-piper-en_GB-aru-medium-sherpa

English (en_GB) multi-speaker (12 voices) Piper VITS voice "aru" (medium quality, 22.05 kHz), served through the sherpa-onnx backend with native streaming TTS. Ships espeak-ng phonemization data. Pick a speaker with the numeric voice/speaker id.

Repository: localaiLicense: cc-by-4.0

vits-piper-en_GB-cori-high-sherpa

English (en_GB) single-speaker Piper VITS voice "cori" (high quality, 22.05 kHz), served through the sherpa-onnx backend with native streaming TTS. Ships espeak-ng phonemization data.

Repository: localaiLicense: cc0-1.0

vits-piper-en_GB-cori-medium-sherpa

English (en_GB) single-speaker Piper VITS voice "cori" (medium quality, 22.05 kHz), served through the sherpa-onnx backend with native streaming TTS. Ships espeak-ng phonemization data.

Repository: localaiLicense: cc0-1.0

vits-piper-en_GB-dii-high-sherpa

English (en_GB) single-speaker Piper VITS voice "dii" (high quality, 22.05 kHz), served through the sherpa-onnx backend with native streaming TTS. Ships espeak-ng phonemization data. Non-commercial use only (CC BY-NC-SA 4.0).

Repository: localaiLicense: cc-by-nc-sa-4.0

vits-piper-en_GB-jenny_dioco-medium-sherpa

English (en_GB) single-speaker Piper VITS voice "jenny_dioco" (medium quality, 22.05 kHz), served through the sherpa-onnx backend with native streaming TTS. Ships espeak-ng phonemization data.

Repository: localaiLicense: other

vits-piper-en_GB-miro-high-sherpa

English (en_GB) single-speaker Piper VITS voice "miro" (high quality, 22.05 kHz), served through the sherpa-onnx backend with native streaming TTS. Ships espeak-ng phonemization data. Non-commercial use only (CC BY-NC-SA 4.0).

Repository: localaiLicense: cc-by-nc-sa-4.0

Page 1