I have zero knowledge of machine learning.

Today I managed to create the speech recognition model for an under-resourced language Nias with the help of Gemini.

I was amazed that the model manages WER rate of 30%!

This is encouraging. I'm thinking to create an Android app with built in Nias speech recognition!

#tech #ai #machinelearning #speechrecognition #nias #whisperai #android

right hand: vibe coding a new tool in gradio to download, convert , and output most streaming media podcasts video audio etc to txt so they can be sent straight into ollama for distillation / left hand : giving my 14 year old 4lb yorkie a neck massage in a triple fleeced blanket on my lap

#VibeCoding #Gradio #OpenSource #Ollama #LLMTools #MediaToText #WhisperAI #AIWorkflow #LocalAI #KnowledgeDistillation #Automation

@borgnetzwerk @Kotoshi @c3voc @C3_LightningTLK @saerdnaer Vielleicht wäre das der Space, wo ich nochmal mit #whisperai die ganze media.ccc nach noch nicht transcribierten videos durchstöbere und automagisch verschriftliche...

Thinking about what you're trying to say is much easier and faster when you don't have to think about how to write it at the same time.
I put that in a transcription tool based off WhisperX to use as a base for what I'm writing, so I'm starting with thousands of words rather than a blank page.

#WhisperAI #CreativeWriting #SpeechToText

Một ứng dụng Voice-to-AI mới đang trong giai đoạn beta sớm, tích hợp công nghệ chuyển giọng nói thành văn bản Whisper, AI của Ollama và chuyển văn bản thành giọng nói (TTS). Ứng dụng này đã có sẵn cho Mac và Windows, phiên bản Linux đang được phát triển.
#AI #VoiceToAI #WhisperAI #Ollama #TTS #Technology #ỨngDụngAI #CôngNghệ #ChuyểnGiọngNóiThànhVănBản

https://www.reddit.com/r/ollama/comments/1on99fw/voicetoai_app_with_whisper_transcription_ollama/

Tin công nghệ: Whisper Large v3 có thể chạy thời gian thực trên MacBook Pro M2 với độ trễ chỉ 350-600ms. Trên iPhone 14 Pro, độ trễ là 650-850ms. Các tối ưu hóa này cũng hoạt động với tất cả các mô hình Whisper. #AI #MachineLearning #AppleNeuralEngine #WhisperAI #CôngNghệAI #AIỨngDụng

https://www.reddit.com/r/LocalLLaMA/comments/1nm0mzw/whisper_large_v3_running_in_realtime_on_a_m2/

@thelinuxEXP I really like Speech Note! It's a fantastic tool for quick and local voice transcription in multiple languages, created by @mkiol

It's incredibly handy for capturing thoughts on the go, conducting interviews, or making voice memos without worrying about language barriers. The app uses strictly locally running LLMs, and its ease of use makes it a standout choice for anyone needing offline transcription services.

I primarily use #WhisperAI for transcription and Piper for voice, but many other models are available as well.

It is available as flatpak and https://github.com/mkiol/dsnote

#TTS #transcription #TextToSpeech #translator translation #offline #machinetranslation #sailfishos #SpeechSynthesis #SpeechRecognition #speechtotext #nmt #linux-desktop #stt #asr #flatpak-applications #SpeechNote

Bei der heutigen Tooltime ging es um noScribe. Automatisierte Interviewtranskription, datenschutzfreundlich & #opensource , mit sehr viel Potential in Forschung und Lehre. Dementsprechend große Resonanz unter den Kolleg*innen an der Fakultät.
Jetzt bich mal gespannt, von welchen Erfahrungen sie mir in 1/2 Jahr berichten 🙂.
#HigherEducation #qualitativeresearch #whisperai
#PotPlayer is the only video player I know that is powered by #AI. It can use #WhisperAI from #OpenAI to generate subtitles from audio.
I really need like 3 #whisperAI installs, one for my podcasts, one for my writing, and a fast one for dictating messages. I know this can't happen, but... I would really like it to.