ElevenLabs: Audio to Text. New Version

ElevenLabs: Audio to Text. New Version

#AudioToText #FreeTranscription #NoLimits #NoCost #StepByStepGuide #TranscriptionHack #AudioConverter #FreeTools #ContentCreation #ProductivityHacks #TranscriptionTips #FreeSoftware #TimeSavingTips #TechTips #ContentCreators #YouTubeTips #TechHacks #AudioEditing #TranscriptionTutorial #FreeForever
Hey folks
We've actually done an unwritten, off-the-cusp trans voice Friday recording today
We've not listened back to it, because voice dysphoria, but we've added full alt text.
In case you're wondering how we've done that without listening back to it, we've once against used an amazing tool called Subtitle Edit, which has audio to text functionality via the Whisper speech recognition engine.
We used the large-v3 model, which is about 3.1 GB, but gives incredibly accurate transcription.
In case anyone can't access the alt text, we've added the full transcript below too.
#TransVoiceFriday #TransVoice #voice #VoiceFeminisation #VoiceFeminization #VoiceTraining #trans #transgender #TransFem #VoiceDysphoria #SubtitleEdit #PurfviewWhisper #AudioToText #SpeechToText #SpeechRecognition
Hey folks, I know that we haven't done a voice note in forever, and that's been for a multitude of reasons, some of which are related to mental health, some of which are related to work, stress, anxiety, depression, etc, things like that, which comes under mental health anyway, yeah, partly due to poor time management, yay for being AuDHD! But not gonna lie, some of it does come down to underlying voice dysphoria, because this is the best we've managed to get since December 2021. And just for anyone who hasn't heard roughly what we sounded like beforehand, we haven't exactly moved our voice up a lot. I mean, the base level would just be down here. So I can move my voice back up here easily now, and this is the comfortable, this is the default voice. But, um... It's not where I want it to be, it's not in the female range, and I can't easily push the pitch up higher without it sounding wrong. But yeah, there's been a lot of stuff going on recently, um, a lot of bad stuff for everyone, don't want to talk about all of that. But, um, let's just focus on supporting each other, helping each other, um, being kind to ourselves and others right now, and being compassionate and empathetic. That's all I've really got to say. I'm trying to do the same thing with ourselves, but yeah, it's hard sometimes. Anyway, ta-ta for now.
Still amazed that I can convert audio to text on my PC for free with a local version of #OpenAI Whisper. Handles multiple audio & video formats and generates txt, srt & 3 more result files.
GitHub repo: https://github.com/openai/whisper - open-source MIT License!
Python library: https://pypi.org/project/openai-whisper/
Easy-to-follow installation video by Kevin Stratvert: https://www.youtube.com/watch?v=ABFqbY_rmEk
Even easier video for free cloud use via a Colab notebook https://www.youtube.com/watch?v=8SQV-B83tPU
Anyone on Masto have recommendations for uploading audio to somewhere that will then transcribe it for you?
Open Source or Free options preferred.
I created a #Python #automation for transcribing any #YouTube video to text files with language detection. You will get accurate, customizable results while saving time and free of cost. It is very easy to use and can be useful for #content #creators, #researchers, and #educators!
Learn more in my latest blogpost: https://www.javedali.net/post/2023-04-audio-to-text/
With the increasing popularity of online video content, there’s a growing need for transcription services. Transcribing audio from YouTube videos is a common task for content creators, researchers, and educators. It can be useful for generating subtitles, creating transcripts for accessibility, or analyzing spoken content. However, manual transcription is a laborious and time-consuming process, especially when dealing with lengthy or numerous videos. Professional transcription services can be expensive, and automated transcription tools may have limitations in terms of accuracy and language support.