GitHub - wealotwang/voice-inpu...
RE: https://bsky.app/profile/did:plc:daexpe52ebb4bwh3ybzyvmkz/post/3mofaimbuqv2k
RE: https://bsky.app/profile/did:plc:daexpe52ebb4bwh3ybzyvmkz/post/3mofaimbuqv2k
Why type your prompts when you can just say them out loud? In this recipe, we'll add voice input to our chat loop by recording audio from the microphone and transcribing it into text with Spring AI.
https://medium.com/@thetalkingapp/spring-ai-recipe-talking-to-ai-837795d7bf9e
dictée: 100% local voice dictation for Linux. Press a key, speak, and the text appears right at your cursor, in any app.
· Wayland (and X11)
· 25+ languages, 4 speech engines (CPU or NVIDIA GPU)
· Speaker diarization for meetings, no length limit on files
· KDE Plasma 6 widget + system tray (GNOME/XFCE too)
· No audio leaves your machine. GPL-3.0.
Built for Plasma 6. cc @kde
https://github.com/rcspam/dictee
#Linux #KDE #FOSS #OpenSource #Wayland #SpeechToText #Privacy #Rust #Gnome #a11y
On the nights I can't get to sleep it is usually because a thought gets in my head and won't shut up.
Tonight I turned my browser back on to check the expected future cataract operation.
Having been told within 18 months, I was thinking about plans to cope for next year. But having moved to noai duckduckgo I just found some real research based on several studies.
It could be 3 months, and about 50% get cataracts that need surgery with 6 months.
I need to go from thinking about planning to doing it, especially finding apps that actually work for dictation.
I've tried a few in the past, but too many are built for American male voices. The last one refused to type 'that' for me even though I have a standard English accent.
My first experience with voice activated cameras for video meeting was a good example. The camera moved to focus on the last man who spoke and completely ignored every woman in the room. And while Dragon worked OK, they no longer do personal licences.
Some people with Australian accents reporting problems which are solved by faking UK or USA accents. 😄
And this speech to text (offline) application seems cool.
Thread : https://lemmy.world/post/47731678
Как мы превращаем звонок риэлтора в карточку лида за 15 секунд: ИИ-автолид изнутри
Риэлтор за рулём. Звонит собственник трёшки на Соколе: “Видел ваше объявление, хочу обсудить продажу”. Двадцать минут живого разговора - район, перепланировка, срочность, вилка по цене. Разговор кончается, риэлтор едет на показ, к вечеру у него ещё пять звонков. Утром он помнит, что “был кто-то по трёшке”, но не помнит ни имени, ни цены, ни телефона. Лид потерян не потому, что плохо отработали, а потому, что между звонком и CRM стоит человек с памятью и руками, которые в этот момент держат руль.
https://habr.com/ru/articles/1044520/
#распознавание_речи #deepgram #llm #speechtotext #crm #автоматизация #пайплайн #недвижимость
UGREEN NAS: AI SPRACHMEMOS – KI-TUTORIAL ZUR IDX-REIHE
https://gadgetchecks.de/ugreen-nas-ai-sprachmemos-ki-tutorial-zur-idx-reihe/
.
.
.
#ugreen #ugreennas #nasync #idx6011pro #aisprachmemos #ki #ai #nas #homelab #transkription #speechtotext #mindmap #audiototext #produktivität #tech #tutorial #aitools #smarthome #netzwerkspeicher #contentcreator