Mastodawn

The Keyboard Isn't Dead

이 글은 음성 입력이 키보드를 대체하는 것이 아니라, 서로 다른 인지 모드를 지원하는 별개의 인터페이스임을 강조한다. 키보드는 느리지만 사고를 구조화하고 편집하는 데 필수적인 '사고 제약' 역할을 하며, 음성 입력은 빠른 아이디어 탐색과 LLM과의 상호작용에 적합하다. 음성 입력의 신뢰 문제는 클라우드 기반 처리에서 발생하며, 온디바이스 음성 인식 기술 발전으로 개인정보 보호와 지연 시간 문제를 해결할 수 있게 되었다. 따라서 음성과 키보드는 각각의 용도에 맞게 공존하며, AI와의 협업에서 음성 입력은 새로운 사고 확장 도구로 자리잡고 있다.

https://www.getvoibe.com/resources/voicepilling-keyboard-isnt-dead/

#voiceinput #llm #ondeviceai #aidevelopment #humancomputerinteraction

The Keyboard Isn't Dead. But Voicepilling Won't Work Until Voice Goes Local.

An honest reply to The Guardian on voicepilling: typing is a thinking tool, voice is a different mode, and only on-device voice fixes the trust problem.

Voibe Resources

sayzard May 8

Show HN: NPM Package that fills forms via voice using Gemini Live API

audio-forms는 Gemini Live API를 활용해 React 애플리케이션에서 음성으로 폼을 자동 작성할 수 있는 오픈소스 컴포넌트입니다. 사용자는 마이크 버튼을 눌러 자연어로 입력하면, 서버가 음성 데이터를 받아 Gemini API로 전달해 필드를 실시간으로 채워줍니다. API 키는 서버에만 저장되어 보안성이 높고, 이름·이메일 등 민감한 필드는 모델이 철자 확인 후 입력하는 더블체크 모드도 지원합니다. React 앱에 쉽게 통합 가능하며, 복잡한 입력에 대응하는 사고 수준 조절 기능도 제공합니다.

https://www.npmjs.com/package/audio-forms

#react #voiceinput #geminiapi #formfilling #speechtotext

audio-forms

Fill forms with voice using Gemini Live API. Latest version: 0.1.0, last published: 6 minutes ago. Start using audio-forms in your project by running `npm i audio-forms`. There are no other projects in the npm registry using audio-forms.

npm

Efstathios Iosifidis, DVM Mar 31

🤖✨ Νέο στο @ONLYOFFICE AI Plugin (δωρεάν, συνεργάζεται με τοπικά/open LLMs):

✅ AI Chat – το κάνει αλλαγές κατευθείαν στο έγγραφο (μορφοποίηση, summaries, φόρμες)
✅ Voice input – εντολές μιλώντας (web version)

🔗 https://www.onlyoffice.com/blog/el/2026/03/ai-plugin-interact-with-documents-via-chat

Υποστήριξη για: Ollama, DeepSeek, Mistral, OpenAI, Anthropic & άλλα.

#ONLYOFFICE #FOSS #AI #SelfHosted #VoiceInput #DocumentEditing #OpenSource #Tech

AI Plugin: Συνομιλία & Φωνητική Είσοδος | ONLYOFFICE Blog

Το ONLYOFFICE AI plugin αναβαθμίστηκε! Αλληλεπιδράστε με έγγραφα μέσω chat, δώστε φωνητικές εντολές, δημιουργήστε φόρμες και εξοικονομήστε χρόνο.

ONLYOFFICE Blog

sayzard Mar 20

AshutoshShrivastava (@ai_for_success)

Gemini 앱이 업데이트되어 마이크 버튼 사용 중 잠시 멈춰도 녹음이 끊기지 않게 개선됐다. 음성 입력 시 말하는 흐름이 자연스러워져 사용성이 좋아진 점이 핵심이다.

https://x.com/ai_for_success/status/2034807287681106179

#gemini #google #aiapp #voiceinput

AshutoshShrivastava (@ai_for_success) on X

The Gemini app just got another solid update. While using the mic button, it won’t cut off when you pause.

X (formerly Twitter)

sayzard Mar 20

Josh Woodward (@joshwoodward)

Gemini Android 음성 입력에서 말하다 잠시 멈춰도 더 이상 대화가 끊기지 않도록 수정됐다. iOS에도 몇 주 내 동일한 개선이 적용될 예정이며, 음성 입력 사용성이 향상됐다.

https://x.com/joshwoodward/status/2034797067344998862

#gemini #android #voiceinput #aiassistant

Josh Woodward (@joshwoodward) on X

✅ Papercut fixed: Gemini won’t cut you off if you pause while talking on Android anymore. (iOS in a few weeks!) So next time you hit the mic icon, feel free to pause, take a breath, or ramble. No more anxiety to speak it all out before @GeminiApp jumps in prematurely.

X (formerly Twitter)

Wolf Mar 1

I have found voice input to be super helpful. I use it entirely for prose: notes, tasks, and reminders. I **don't** use it to make the machine "do things" or for coding (but that's just me ... you do you; and also, I don't use it for coding **yet**).

I currently use two tools:

* Spokenly (phone + laptop)
* Just Press Record (watch + phone)

Each has advantages. Both produce transcriptions which I then "route" to the appropriate place (Things, Obsidian, Fantastical) with Sharing.

Just Press Record also runs on my watch, so I can use it while driving or other situations where the phone is inconvenient, illegal, or unavailable. I can later look at a list and handle them one at a time or in bulk.

Spokenly is significantly more accurate. You pick the underlying model, so you can decide if you want local-only (as I do, so no fees of any kind), size (controls memory used and speed of translation, at the cost of accuracy), and what spoken languages it knows. You can switch at will; and you don't have to have the same model on your phone as on your laptop. You typically handle results in Spokenly immediately (for a while I thought this was the only choice), but they are saved and you can look at them all in History and deal with them from there.

I'm still playing with which models are the best balance of speed and accuracy for my use case. On my phone I'm using "Distil-Whisper Small (English Only)". On my Mac, the same but "Medium".

This doesn't **sound** like a huge win. I certainly didn't expect much when I decided to try it. But it turns out to punch far above its weight.

#VoiceInput #Spokenly #JustPressRecord #Things #Obsidian #Fantastical #Productivity

sayzard Jan 29

TestingCatalog News (@testingcatalog)

Google이 Stitch용 'Deep Design' 모드와 잠재적 'Agent Manager', 음성 입력 기능을 개발 중이라는 소식입니다. 이 기능들은 UI 개편의 일부로 아주 초기 단계에 있으며, 디자인 워크플로우 강화와 에이전트 관리·음성 인터랙션 통합을 목표로 하는 변화로 보입니다.

https://x.com/testingcatalog/status/2016941342522212798

#google #stitch #agentmanager #voiceinput #ui

TestingCatalog News 🗞 (@testingcatalog) on X

BREAKING 🚨: Google is working on "Deep Design" mode for Stitch, along with a potential "Agent Manager" and voice input. These features are a part of the upcoming UI overhaul and are in the very early stages. What else? 👀

X (formerly Twitter)

Reddit Tech VN Bot Jan 23

🚀 Giới thiệu Variables – app theo dõi cá nhân trên di động, kết hợp bảng tính + CSDL. Ghi nhật ký mood, giấc ngủ, thói quen… với 6 kiểu dữ liệu (số, văn bản, thời gian, ngày, boolean, enum). Đặc điểm: truy vấn SQL thực, AI tạo câu lệnh từ ngôn ngữ tự nhiên, nhập liệu bằng giọng nói, lưu “dataclips” để xem nhanh. Thử ngay! #VariablesApp #SQL #AI #VoiceInput #Mobile #ứngdụng #phântích #dữliệu

https://www.reddit.com/r/SideProject/comments/1qkqv8d/i_didnt_like_using_spreadsheets_on_my_iphone_so_i/

quangobaud Dec 21

How Interesting!

#Google #Android #VoiceInput #AndroidSystemIntelligence
#WithoutMyConsent

sprungmarkers (she / her)Mar 7, 2025

great interview #coding with #VoiceInput #a11y
From: @fridayfrontend
https://hachyderm.io/@fridayfrontend/114122804562721954

Friday Front-End (@[email protected])

Today's lunch video is "What if you suddenly couldn't type anymore?" - "Salma recently developed excruciating pain in her hands, and she was kind enough to join me so we could learn what it's been like, and some of the tools she uses to continue writing code." #a11y https://www.youtube.com/watch?v=QYkjgd6_s4o

Hachyderm.io