Локальный голосовой ввод: Whisper + Ollama на Python

Мне нужен был голосовой ввод. Не диктовка в Google Docs, не облачный API — а простая штука: зажал клавишу, сказал, отпустил, текст появился в активном окне. Локально, без отправки аудио куда-либо. Готовых решений, которые бы устроили, сходу не нашёл. Сделал свое. Может, кому будет полезно.

https://habr.com/ru/articles/1009538/

#whisper #ollama #speechtotext #voicetotext #pushtotalk #голосовой_ввод #python #localfirst #privacy

Локальный голосовой ввод: Whisper + Ollama на Python

Мне нужен был голосовой ввод. Не встроенная в ОС диктовка, не облачный API — а простая и главное локальная...

Хабр

People have been raving to me about how good voice-to-text has become, so I'm trying Wispr Flow. But the user experience feels a bit off: I have to stop recording to get it to type out the text? Not really "hands-free", is it?
Is that just how it works, or is there a continuous dictation mode I'm missing? Are there other LLM-based tools that behave more like classic dictation?
Looking forward to your recommendations!

#VoiceToText #AI #Productivity

SoundVibes 0.2.0 is out!

- More people than me that needed to handle multiple languages without the delay of the autodetection.
- Now possible to toggle specific languages to transcribe.

Still modest amount of users, but ⭐ are rising as well as downloads.

#voicetotext made easy on #linux!

https://github.com/kejne/soundvibes/releases/tag/v0.2.0

Release v0.2.0 · kejne/soundvibes

Breaking changes The model & language loading has been given an overhaul, separating the model loading in the daemon with the language triggering by the client Check the docs for the updated conf...

GitHub

🎤✨ Wispr Flow is coming to Android and I'm HYPED! 🚀

Say goodbye to typing and hello to pure voice magic ✍️➡️🎤 This game-changing AI tool is launching Feb 12 and honestly, I can't wait!

Move up the waitlist and get early access: wisprflow.ai/waitlist?ADONTAI1

Drop your referral link with mine and we'll both climb the ranks together! 💪

#WisperFlow #Android #VoiceToText #AI #Tech #Innovation #Productivity #MustHave #EarlyAccess #JoinTheWaitlist #androidapp #aitools #productivity #VoiceTech

This voice application gives Google less access to your data

https://peertube.gravitywell.xyz/w/j7WbKGUYzBJvMkKxrF6hWG

This voice application gives Google less access to your data

PeerTube

How FUTO projects loosen Google's grip on your life!

https://peertube.gravitywell.xyz/w/phdodopg3KuJTWtwp6S3Vw

How FUTO projects loosen Google's grip on your life!

PeerTube

Happy to see some issues filed already! 😀

I managed to publish #soundvibes just before getting a horrible cold, just to "get back" to two useful feature requests and a bug report (which someone already seems to be drafting a PR for).

Grateful to see #opensource contributions so fast and happy that my tool fills a need! 🙌

https://github.com/kejne/soundvibes

#voicetotext #linux

Remember me posting about running my agent while marking footballs during the weekend?

Well, now it's time to share the results!

An open source voice-to-text application for Linux which enables you to hotkey speech capture to input your voice wherever your cursor is!

I was annoyed by the complexity of the tools that were available so I created one which comes as a single binary, written in Rust.

Check it out:
https://soundvibes.teashaped.dev/

Wrote a blog post about the creation of it:
https://www.teashaped.dev/blog/soundvibes-vibe-coding/post/

#linuxvtt #voicetotext #vibecoding #opensource #FAAFO

DeepFlo – công cụ viết & chỉnh sửa văn bản bằng giọng nói, hoạt động trên mọi nền tảng (email, Slack, VS Code,…). Đang tìm beta tester macOS để “roast” và đưa ra phản hồi thẳng thắn. Cần trả lời: giá trị ngay lập tức? Khác biệt so với dictation cơ bản? Đối tượng người dùng thực sự là ai? Nếu muốn thử, hãy để lại ý kiến!

#BetaTest #VoiceToText #SaaS #DeepFlo #CôngNghệ #KiểmThử #AI #ỨngDụng #TríTuệNhânTạo

https://www.reddit.com/r/SaaS/comments/1qo4p08/roast_my_web_app_looking_for_early_beta_te

🗣️ Công cụ mới: "Unchained Vibes for Claude" – extension Chrome cho phép nói chuyện với Claude bằng giọng nói, tự động chuyển sang văn bản. Tính năng: tạm dừng để suy nghĩ, chụp màn hình khi nói lệnh, kích hoạt bằng cụm từ đặc biệt. Tiện lợi cho ai mệt mỏi khi gõ! #AI #Claude #VoiceToText #ChromeExtension #CôngNghệ #TiếngNói #TríTuệNhânTạo

https://www.reddit.com/r/SideProject/comments/1qnel0w/claude_a_different_take_on_voice_to_text/