Drops silence-only buffers
Silero VAD helps Vocalinux ignore chunks that do not contain speech before they reach the recognition engine.
v0.12.0 Silero VAD
Vocalinux now uses Silero neural voice activity detection when ONNX Runtime support is available. It identifies speech before transcription, drops silence-only buffers, and falls back safely when the neural backend is not installed.
Silero VAD helps Vocalinux ignore chunks that do not contain speech before they reach the recognition engine.
Better speech boundaries reduce empty transcriptions and lower the chance that silence becomes stray text.
The existing 1-5 VAD sensitivity setting works for both Silero and the amplitude fallback.
The official installer attempts to install neural VAD support. If ONNX Runtime is not available, Vocalinux logs the fallback and continues with amplitude-based VAD.
pip install "vocalinux[vad]"The Recognition tab shows which backend is active. Higher VAD sensitivity values are more responsive to quiet speech for both backends.
Silero runs on your machine. Voice activity detection does not require a cloud service.
The VAD step is a lightweight pre-filter before whisper.cpp, Whisper, VOSK, or Remote API recognition.
Cleaner speech boundaries also help short recordings and stop-on-release flows avoid silence-only output.