I have found voice input to be super helpful. I use it entirely for prose: notes, tasks, and reminders. I **don't** use it to make the machine "do things" or for coding (but that's just me ... you do you; and also, I don't use it for coding **yet**).

I currently use two tools:

* Spokenly (phone + laptop)
* Just Press Record (watch + phone)

Each has advantages. Both produce transcriptions which I then "route" to the appropriate place (Things, Obsidian, Fantastical) with Sharing.

Just Press Record also runs on my watch, so I can use it while driving or other situations where the phone is inconvenient, illegal, or unavailable. I can later look at a list and handle them one at a time or in bulk.

Spokenly is significantly more accurate. You pick the underlying model, so you can decide if you want local-only (as I do, so no fees of any kind), size (controls memory used and speed of translation, at the cost of accuracy), and what spoken languages it knows. You can switch at will; and you don't have to have the same model on your phone as on your laptop. You typically handle results in Spokenly immediately (for a while I thought this was the only choice), but they are saved and you can look at them all in History and deal with them from there.

I'm still playing with which models are the best balance of speed and accuracy for my use case. On my phone I'm using "Distil-Whisper Small (English Only)". On my Mac, the same but "Medium".

This doesn't **sound** like a huge win. I certainly didn't expect much when I decided to try it. But it turns out to punch far above its weight.

#VoiceInput #Spokenly #JustPressRecord #Things #Obsidian #Fantastical #Productivity

TestingCatalog News (@testingcatalog)

Google이 Stitch용 'Deep Design' 모드와 잠재적 'Agent Manager', 음성 입력 기능을 개발 중이라는 소식입니다. 이 기능들은 UI 개편의 일부로 아주 초기 단계에 있으며, 디자인 워크플로우 강화와 에이전트 관리·음성 인터랙션 통합을 목표로 하는 변화로 보입니다.

https://x.com/testingcatalog/status/2016941342522212798

#google #stitch #agentmanager #voiceinput #ui

TestingCatalog News 🗞 (@testingcatalog) on X

BREAKING 🚨: Google is working on "Deep Design" mode for Stitch, along with a potential "Agent Manager" and voice input. These features are a part of the upcoming UI overhaul and are in the very early stages. What else? 👀

X (formerly Twitter)

🚀 Giới thiệu Variables – app theo dõi cá nhân trên di động, kết hợp bảng tính + CSDL. Ghi nhật ký mood, giấc ngủ, thói quen… với 6 kiểu dữ liệu (số, văn bản, thời gian, ngày, boolean, enum). Đặc điểm: truy vấn SQL thực, AI tạo câu lệnh từ ngôn ngữ tự nhiên, nhập liệu bằng giọng nói, lưu “dataclips” để xem nhanh. Thử ngay! #VariablesApp #SQL #AI #VoiceInput #Mobile #ứngdụng #phântích #dữliệu

https://www.reddit.com/r/SideProject/comments/1qkqv8d/i_didnt_like_using_spreadsheets_on_my_iphone_so_i/

Friday Front-End (@[email protected])

Today's lunch video is "What if you suddenly couldn't type anymore?" - "Salma recently developed excruciating pain in her hands, and she was kind enough to join me so we could learn what it's been like, and some of the tools she uses to continue writing code." #a11y https://www.youtube.com/watch?v=QYkjgd6_s4o

Hachyderm.io
Google Patents AR Glasses Assistant That Adapts Suggestions Based on Gaze

Google has been granted a patent that describes an automated assistant for smart glasses that adapts suggestions based on the user's gaze or verbal instructions. Although Google abandoned its 'Project Iris' AR smart glasses, it may develop similar hardware for OEM partners, it is uncertain whether Google will launch a pair of AR glasses based on this patent.

Gadgets 360

App tipp for Android:
*Futo Voice Input* - "the Voice Input app for Android that respects your privacy."

https://voiceinput.futo.org/

It's a voice input app that does voice recognition completely on your phone, so your spoken words never leave it.
Depending on chosen recognition quality it might be slower than GAFAM solutions. But well...

Install the APK from their site or add their repo to your F-Droid client (Neo Store recommended here).

#Android #App #Tipp #VoiceInput #Privacy #Futo #FDroid

FUTO Voice Input

FUTO Voice, The Voice Input app that respects your privacy

Тестирование `Voice Input` с `OpenBoard`

#android #speechtotext #VoiceInput

Начальная заметка о `Voice Input` по ссылке ниже.

https://mastodon.ml/@ashed/111487118420651009

Who Let The Dogs Out 🐾 (@[email protected])

Вложение: 2 изображения **Voice Input** #android #texttospeech #VoiceInput Движок голосового ввода speech-to-text с несколькими моделями. Движок неплохо обучен, дружит (правда не всегда) со знаками препинания и поддерживает мультиязычный ввод. Работает с любой клавой, имеющей функцию голосового ввода, я тестировал с OpenBoard. Доступ в сеть нужен только для загрузки языковых пакетов, после этого сеть можно отрубить любым файрволом. Код проекта: https://gitlab.futo.org/alex/voiceinput Репозиторий F-Droid https://app.futo.org/fdroid/repo?fingerprint=39D47869D29CBFCE4691D9F7E6946A7B6D7E6FF4883497E6E675744ECDFA6D6D

Mastodon.ml

**Voice Input**

#android #speechtotext #VoiceInput

Движок голосового ввода `speech-to-text` с несколькими моделями.
Движок неплохо обучен, дружит (правда не всегда) со знаками препинания и поддерживает многоязычный ввод.
Работает с любой клавиатурой, имеющей функцию голосового ввода, тестировался с OpenBoard.
Доступ в сеть нужен только для загрузки языковых пакетов, после этого доступ в сеть для `Voice Input` можно отключить любым файрволом, например AFWall (должен быть заранее установлен).

Код проекта:
https://gitlab.futo.org/alex/voiceinput

Репозиторий F-Droid
https://app.futo.org/fdroid/repo?fingerprint=39D47869D29CBFCE4691D9F7E6946A7B6D7E6FF4883497E6E675744ECDFA6D6D

Продолжение заметки о `Voice Input ` по ссылке ниже.

https://mastodon.ml/@ashed/111487128209285726

Aleksandras Kostarevas / VoiceInput · GitLab

FUTO Voice Input application for Android

GitLab

Ever wanted an offline, open-source voice input solution for de-googled android phones? Well now you can:
https://gitlab.futo.org/alex/voiceinput
https://voiceinput.futo.org/

The results are surprisingly good. Awesome to see that usable language models that aren't controlled by big tech companies are finally popping up!

#FUTO #voiceinput #voicerecognition #speechtotext #Android #opensource

PS: It's open source, but it's still considered paid software. So please honor that and pay the devs if you use the tool. :)

Aleksandras Kostarevas / VoiceInput · GitLab

FUTO Voice Input application for Android

GitLab