Mastodawn

Google `speech transcription linux`:
1. "What's a good STT tool for Linux?" with a few handwavey "you might try..." comments
2. "I couldn't find a good STT tool for Linux, so I made..." posts

[This](https://github.com/OpenWhispr/openwhispr/issues/310) is a page on how to get started with #OpenWhispr (afaict the most popular tool) on Ubuntu (the most popular #Linux distro).

Will it even run on my distro (POP_OS)? Who knows? I only see guides for getting it to run with Gnome, KDE, or Hyprland(?). When I tried:
1. the model selector stalled before even starting to download (so I restarted the app),
2. Then wouldn't let me paste an API key in
3. then stalled after downloading a model (so I restarted the app),
4. then wouldn't capture my keyboard shortcut,
5. then did capture my keyboard shortcut (but didn't transcribe anything),
6. then did transcribe but left a tiny permanent window

Contrast to Win10's built-in tool: Meta+H start/stops #transcription in whatever app you have open.

FWIW https://handy.computer is the best I've found.

Setup Guide for Ubuntu 24.04 LTS · Issue #310 · OpenWhispr/openwhispr

OpenWhispr on Ubuntu 24.04 (GNOME Wayland) — Setup Guide Problem OpenWhispr's auto-paste doesn't work out of the box on Ubuntu 24.04 due to two issues: ydotoold (the daemon for ydotool) is not runn...

GitHub

Habr Apr 17

Голосовой ввод на русско-английском в 2026: WisprFlow, Handy, OpenWhispr, GigaAM v3 — для диктовки нейросетям и кода

Голосом мы говорим в 2-3 раза быстрее, чем печатаем — это давно известно. Вопрос только в том, умеет ли голосовой ввод разбираться с русско-английской смесью, на которой мы общаемся с LLM и пишем код: «объясни на русском», «открой в Cursor», «проверь, что deploy прошёл». За полгода я перепробовал 5+ приложений и 5 моделей, чтобы найти те, что умеют. Приложения : WisprFlow, SpeakFlow, Handy, OpenWhispr, SuperWhisper — облачные и локальные, платные и open source. Модели : Whisper Large v3, Turbo, GigaAM v3 от Сбера, Canary 1B v2 от NVIDIA, Parakeet V3. Внутри: — Замена облачного WisprFlow на бесплатный open source без потери качества. — Один текстовый промпт, починивший пропадающую пунктуацию в 99% случаев — без LLM-постпроцессоров и задержек. — Мой бенчмарк Whisper Turbo vs Large v3 на RTX 5070 Ti (Vulkan на Blackwell внезапно быстрее CUDA на 50%). — GigaAM v3 и Canary 1B v2 — где конкурируют с Whisper, а где ломают английские слова в кириллицу («Gemini» → «Jemni»). — Первый в моей жизни принятый в main pull request в open source. Актуально на апрель 2026.

https://habr.com/ru/articles/1024634/

#whisper #голосовой_ввод #транскрибация #gigaam #распознавание_речи #openwhispr #cuda #vulkan #superwhisper #нейросети

Голосовой ввод на русско-английском в 2026: WisprFlow, Handy, OpenWhispr, GigaAM v3 — для диктовки нейросетям и кода

Эта статья будет особенно в кассу тем, кто много общается с нейросетями или кодит с ассистентами — Claude, ChatGPT, Cursor, Claude Code. В этих сценариях голосовой набор экономит реально много...

Хабр