A really nice tool—especially if you like to have English and Chinese voice transcription. I need to wait for the downloads to finish to get a real impression, but the first impression is good. #speechRecognition #STT #SpeechToText #voicetyping

GitHub - wealotwang/voice-inpu...

RE: https://bsky.app/profile/did:plc:daexpe52ebb4bwh3ybzyvmkz/post/3mofaimbuqv2k


RE: https://bsky.app/profile/did:plc:daexpe52ebb4bwh3ybzyvmkz/post/3mofaimbuqv2k

Why type your prompts when you can just say them out loud? In this recipe, we'll add voice input to our chat loop by recording audio from the microphone and transcribing it into text with Spring AI.

https://medium.com/@thetalkingapp/spring-ai-recipe-talking-to-ai-837795d7bf9e

#SpringAI #Java #SpeechToText #VoiceAI #GenAI

dictée: 100% local voice dictation for Linux. Press a key, speak, and the text appears right at your cursor, in any app.

· Wayland (and X11)
· 25+ languages, 4 speech engines (CPU or NVIDIA GPU)
· Speaker diarization for meetings, no length limit on files
· KDE Plasma 6 widget + system tray (GNOME/XFCE too)
· No audio leaves your machine. GPL-3.0.

Built for Plasma 6. cc @kde

https://github.com/rcspam/dictee

#Linux #KDE #FOSS #OpenSource #Wayland #SpeechToText #Privacy #Rust #Gnome #a11y

On the nights I can't get to sleep it is usually because a thought gets in my head and won't shut up.
Tonight I turned my browser back on to check the expected future cataract operation.

Having been told within 18 months, I was thinking about plans to cope for next year. But having moved to noai duckduckgo I just found some real research based on several studies.
It could be 3 months, and about 50% get cataracts that need surgery with 6 months.
I need to go from thinking about planning to doing it, especially finding apps that actually work for dictation.

I've tried a few in the past, but too many are built for American male voices. The last one refused to type 'that' for me even though I have a standard English accent.

My first experience with voice activated cameras for video meeting was a good example. The camera moved to focus on the last man who spoke and completely ignored every woman in the room. And while Dragon worked OK, they no longer do personal licences.

#SpeechToText

Some people with Australian accents reporting problems which are solved by faking UK or USA accents. 😄

And this speech to text (offline) application seems cool.  

👉 https://handy.computer 👈

Thread : https://lemmy.world/post/47731678

#speech2text #opensource #language #speechtotext

Handy

Handy is a cross platform, open-source, speech-to-text application for your computer

Handy
GitHub - Melvynx/Parler: A free, open source, and extensible speech-to-text application that works completely offline.

A free, open source, and extensible speech-to-text application that works completely offline. - Melvynx/Parler

GitHub
I've just added a very early preview of “meetings” to Dictator on the Mac (dictator.robgough.net). Right now, the recommendation is 32GB RAM, but with that you can get fully offline meeting summaries. It’s still going to need tuning, but it works quite well! #buildinpublic #macos #speechtotext

Как мы превращаем звонок риэлтора в карточку лида за 15 секунд: ИИ-автолид изнутри

Риэлтор за рулём. Звонит собственник трёшки на Соколе: “Видел ваше объявление, хочу обсудить продажу”. Двадцать минут живого разговора - район, перепланировка, срочность, вилка по цене. Разговор кончается, риэлтор едет на показ, к вечеру у него ещё пять звонков. Утром он помнит, что “был кто-то по трёшке”, но не помнит ни имени, ни цены, ни телефона. Лид потерян не потому, что плохо отработали, а потому, что между звонком и CRM стоит человек с памятью и руками, которые в этот момент держат руль.

https://habr.com/ru/articles/1044520/

#распознавание_речи #deepgram #llm #speechtotext #crm #автоматизация #пайплайн #недвижимость

Как мы превращаем звонок риэлтора в карточку лида за 15 секунд: ИИ-автолид изнутри

Риэлтор за рулём. Звонит собственник трёшки на Соколе: «Видел ваше объявление, хочу обсудить продажу». Двадцать минут живого разговора — район, перепланировка, срочность, вилка по цене. Разговор...

Хабр
UGREEN NAS: AI Sprachmemos – KI-Tutorial zur iDX-Reihe - gadgetChecks - Apple & Smart Home!

UGREEN NAS AI Sprachmemos der iDX 6011 Pro erklärt: Sprachaufnahmen automatisch transkribieren, Inhalte zusammenfassen und per KI als Mindmap strukturieren.

gadgetChecks - Apple & Smart Home!