Release History

Vocalinux Changelog

Track every release and see how Vocalinux has evolved. From initial alpha to stable releases, follow the journey of Linux voice dictation.

v0.13.0-beta
BetaLatest2026-06-30
  • Guided whisper.cpp model selection: pick a size plus a specialization (English-only, quantized Q5/Q8, or Large v3 Turbo) through split Model Size and Specialization dropdowns with in-app guidance; the --model flag also accepts exact IDs like medium.en-q5_0 and large-v3-turbo (PR #465)
  • Dictation now keeps a space between segments spoken with a pause in between, so words no longer run together after a silence (PR #464)
  • Keyboard shortcuts now work on keyboards hotplugged after Vocalinux starts, with the evdev backend rescanning for new devices and recovering from disconnects (PR #467)
  • Vocalinux now detects KDE Plasma Wayland sessions and points you to enable IBus Wayland for reliable text injection, surfaced during install and when wtype injection fails (PR #466)
  • Wayland: fixed garbled text on non-US keyboard layouts (AZERTY/QWERTZ/Dvorak) and a clipboard-copy hang; ydotool now pastes through the clipboard, which is layout-independent (PR #480)
  • Wayland: use wtype/ydotool instead of IBus on compositors that don't bridge it to native apps like COSMIC, Sway, and Hyprland, fixing silent text drops (PR #486)
  • Wayland/IBus: require a real IM engine before using IBus, so a bare xkb layout no longer causes silent text drops on GNOME/Mutter and other compositors (#478)
  • Wayland: keep the keyboard layout intact by not running setxkbmap, which was flipping XWayland apps to us after dictation (#474)
  • Faster ydotool text injection via an explicit --key-delay (PR #488)
  • Settings dialog height capped on high-resolution displays (PR #465)
  • Refreshed website docs with new feature pages for Remote API, Silero VAD, advanced whisper.cpp settings, and desktop reliability, plus responsive layout polish (PR #470)
v0.12.0-beta
Beta2026-06-07
  • Remote API speech recognition engine with installation and configuration support (PR #335)
  • Silero VAD drops silence-only buffers for cleaner dictation when ONNX Runtime support is available (PR #447)
  • Thread safety hardening for Remote API, IBus, and text injection paths (PR #452)
  • IBus preserves user engines for dead keys and captures the current engine during scoped activation (PR #457, #458)
  • Remote Server settings now respect the Advanced toggle and the settings dialog fits lower-resolution screens (PR #454, #456)
  • CUDA diagnostics now include auto-remediation and behavioral tests (PR #451)
  • Corrected whisper.cpp and VOSK model download size metadata (PR #453)
  • Startup now works without the pynput backend (PR #448)
  • Remote API developer test server documentation (PR #455)
  • Website speech demo browser support clarification (PR #449) and GitHub Sponsors funding configuration
v0.11.0-beta
Beta2026-05-30
  • New Advanced Settings tab with whisper.cpp anti-hallucination parameters - temperature, no_speech_threshold, max segment length, and more (PR #415)
  • IBus engine readiness probe at startup with hardened retries (PR #391)
  • IBus runtime failure recovery without app restart (PR #411)
  • IBus engine instance destruction handled on keyboard layout switch (fixes #388, closes #389)
  • Preserve final speech on stop - no more truncated transcriptions (fixes #401)
  • Play stop sound immediately on release and after audio thread joins (PR #426, #436)
  • Repair pywhispercpp library loading in installer (PR #433)
  • Reduce whisper.cpp CPU threads and ensure GPU backend builds in dev mode (PR #439)
  • Correct openSUSE Tumbleweed dependencies with fallback handling (PR #418, #420)
  • Harden Debian compatibility layer in installer (PR #437)
  • Add Python 3.14 support and bump lxml>=6.1.0 (fixes #404)
  • Validate pyproject.toml/setup.py content before entering local repo mode (fixes #396)
  • Reuse existing whispercpp builds during install (PR #421)
  • Refresh ldconfig after openSUSE typelib install, clarify python3XY placeholder convention (PR #438)
  • Clean up runtime log noise and cache hardware detection
  • Test coverage: recognition internals, IBus edge cases, CI notification suppression (PR #410, #414)
  • Clarify PyPI installation requirements (PR #423)
  • Dependency bumps: Next.js security updates (PR #399, #429), PostCSS
v0.10.2-beta
Beta2026-04-08
  • Handle non-ASCII characters (á, é, ñ, etc.) with ydotool via clipboard paste fallback (fixes #362, PR #376)
  • Detect IBus on Wayland without legacy env vars and fix text injection (PR #381)
  • Start IBus engine process before checking registration to fix startup on some systems (fixes #360, PR #361)
  • Add missing dependencies for Pop!_OS and Ubuntu 24.04+ including cmake, libcairo2-dev, libgirepository (PR #379)
  • Systematic code quality refactor across 20 dimensions (PR #377)
  • Clarify missing GNOME AppIndicator support on Debian (PR #385)
  • Redesigned OG image for vocalinux.com - cleaner, professional, text-based layout (PR #392)
  • Test coverage improvements: mock Notify module, tray degraded-startup, IBus socket-readiness branches (PR #384, #386, #390)
v0.10.1-beta
Beta2026-03-30
  • Bundled package resources to prevent missing system tray icons (fixes #349, PR #354)
  • Stopped recognition before engine switches to prevent segfaults (fixes #350, PR #355)
  • Added a dedicated Close button in Settings for better WM compatibility (fixes #323, PR #356)
  • Preserved XKB layout state during Vocalinux IBus activation (fixes #292, PR #343)
  • Auto-recover speech recognition after system suspend/resume via new D-Bus handler (fixes #367, PR #369)
  • Restart keyboard shortcut backend after resume to keep shortcuts working (PR #371)
  • Delayed keyboard restart to allow USB re-enumeration after resume (PR #372)
  • Fixed premature transcription during push-to-talk silence (fixes #358, PR #359)
  • Disabled copy-to-clipboard by default in Settings (PR #370)
  • Maintenance updates: npm/yarn dependency refresh and brace-expansion dev dependency bump (PR #346, #357)
v0.10.0-beta
Beta2026-03-25
  • Generalized keyboard modifier alias matching across layouts for more reliable shortcuts
  • Audio channel probing now validates device-supported sample rates before selection
  • evdev now handles SYN_DROPPED to prevent stale modifier state
  • IBus engine activation now uses register_component for stronger text-injection startup
  • Settings dialog forces window decorations to prevent missing-titlebar behavior
  • Tray icon refresh now uses icon names for better AppIndicator compatibility
  • Coverage increased to 80%+ with additional IBus launch/main-entry tests
  • Installer and CI polish: latest-tag fallback via GitHub API, Node 24 deploy, and path-filtered workflows
v0.9.0-beta
Beta2026-03-14
  • Left/right modifier key distinction - choose Left Ctrl vs Right Ctrl for your shortcut
  • Sound effects toggle - enable or disable audio feedback from Settings
  • Wayland clipboard fallback - auto-copies text when virtual keyboard injection isn't available
  • Display availability check - graceful error when running in headless environments
  • Fixed unwanted leading space at the start of each new transcription session
  • Fixed shortcut mode (toggle/push-to-talk) not applying on startup
  • Improved Debian/pipx installation guidance and cross-distro error messages
  • Grouped shortcut selector UI - shortcuts organised by Either/Left/Right side
v0.8.0-beta
Beta2026-03-01
  • Push-to-talk shortcut mode (hold to speak, release to stop)
  • Optional voice commands with VOSK auto-enable behavior
  • Improved shortcut mode switching and callback reliability
  • IBus active-method detection before text injection
  • Audio hardware compatibility fixes (sample rate and channel count)
  • Fedora startup dialog stability fix
  • Web SEO expansion and homepage visual refresh
v0.7.0-beta
Beta2026-02-22
  • Autostart on login support (XDG autostart)
  • Tabbed settings dialog (Speech Engine, Recognition, Text Injection, Audio Feedback, General)
  • Intel GPU compatibility detection - auto fallback to CPU for incompatible GPUs
  • Single instance prevention - prevents multiple Vocalinux running
  • Evdev device management - removes disconnected devices to prevent CPU spin
  • Improved GPU detection - avoids false positives on systems without dev libraries
  • IBus fallback - skip setup when daemon not running
  • Fedora dnf check-update fix
  • Web SEO - 7 new optimized pages
v0.6.3-beta
Beta2026-02-19
  • Fixed installer default tag pointing to correct version
  • Added missing psutil dependency for fresh installs
  • Process check and interactive prompts in install/uninstall
  • Removed leading space from first speech transcription
v0.6.2-beta
Beta2026-02-18
  • Interactive backend selection (GPU/CPU)
  • Enhanced welcome message
  • Simplified install commands
  • Better GPU support and Vulkan detection
v0.6.0-beta
Beta2026-02-12
  • whisper.cpp as default engine
  • Multi-language support with auto-detection
  • System tray indicator
  • Full Wayland support
  • IBus text injection engine
v0.5.0-beta
Beta2026-02-06
  • First beta release
  • Stable core functionality
  • Multiple speech engine support
  • Improved text injection
v0.4.1-alpha
Alpha2026-01-29
  • Language selector UI
  • App drawer launch fix
  • Better commit handling
  • Improved update mechanism
v0.4.0-alpha
Alpha2026-01-29
  • Multi-language support (French, German, Russian)
  • Debian 13+ compatibility
  • Python 3.12+ support
  • Tag-based version selection in installer
v0.3.0-alpha
Alpha2026-01-21
  • Initial public alpha
  • Basic speech recognition
  • X11 text injection
  • VOSK engine support

Stay Updated

  • Watch the repository on GitHub for release notifications
  • Re-run the installer to update to the latest version
  • Check the install guide for update instructions