Want to run speech‑AI locally? Learn step‑by‑step how to generate a Hugging Face read token, set up PersonaPlex with NVIDIA models, and export it for offline use. We cover token creation, audio codec (Opus) handling, and quick testing. Boost your open‑source projects with secure access! #HuggingFace #AccessToken #SpeechAI #Opus

🔗 https://aidailypost.com/news/how-create-export-hugging-face-read-token-local-speechai

🚀 Demo mới: hệ thống lồng tiếng video‑2‑video chất lượng cao, hiện hỗ trợ dubbing tiếng Anh → Pháp. Pipeline: TIGER (tách âm), WhisperX (diarization & STT), Mistral_Tower (dịch), CosyVoice3 (TTS). Tiếp tục cải thiện giữ nguyên tone & prosody. Mọi ý kiến, đề xuất đều chào đón!

#AI #SpeechAI #VoiceCloning #Dubbing #CôngNghệ #TríTuệNhânTạo #TinCôngNghệ #AIVietnam #VoiceAI #NhậnDạngGiọng #DịchTựĐộng

https://www.reddit.com/r/LocalLLaMA/comments/1qinq1x/we_have_come_a_long_way_in_voice_prosody_cl

Demo hệ thống lồng tiếng video AI chất lượng cao, hiện hỗ trợ dịch tiếng Anh → Pháp. Pipeline: TIGER (tách âm thanh), WhisperX (diarization + STT), Mistral_Tower (dịch), CosyVoice3 (TTS). Tiếng nói chưa giữ được ngữ điệu sau dịch, sẽ cải thiện. Mong nhận ý kiến! #AI #VoiceCloning #SpeechAI #Dubbing #CôngNghệ #AIÂmThanh

https://www.reddit.com/r/LocalLLaMA/comments/1qinq1x/we_have_come_a_long_way_in_voice_prosody_cloning/

Die Stimme kann biometrisch sein:

Aus dem Sprachsignal lässt sich mehr ableiten als Worte – bis hin zu Gesundheit, Bildung und politischen Präferenzen. Und betroffen sind auch Unbeteiligte, wenn ihre Stimme als Hintergrund in Aufnahmen landet.

Konsequenz: Kommunikation konsequent Ende-zu-Ende verschlüsseln – und keine proprietären Sprachassistenten oder Cloud-Transkription nutzen.

https://www.telepolis.de/article/Privatsphaere-endet-wo-das-Sprechen-beginnt-11145260.html

#Datenschutz #Privatsphäre #SpeechAI #Spracherkennung #Überwachung #KI #Biometrie #Datenminimierung #OnDevice #EUAIAct

Privatsphäre endet, wo das Sprechen beginnt

Computer lesen aus der Stimme bald Gesundheit, Bildung und politische Haltung heraus – selbst wenn man gar nicht direkt mitredet.

heise online
FOSS Advent Calendar - Door 14: Bring Text to Life with Coqui TTS

Meet Coqui TTS, a powerful, open-source deep learning toolkit for cutting-edge Text-to-Speech. It turns written words into natural, expressive audio using state-of-the-art neural models, all while running completely offline on your own machine.

Coqui TTS supports a wide range of languages and voices, and its real strength lies in flexibility: you can use pre-trained models for instant results or train custom voices with your own datasets. Everything happens locally, your data stays private, no APIs or subscriptions required. Whether for accessibility tools, narration, creative projects, or research, Coqui gives you full control over synthetic speech, from tone and pace to emotional delivery.

Pro tip: Experiment with voice cloning or fine-tune a model for a unique vocal character. With Coqui, you’re not just generating speech you’re crafting it.

Link: https://github.com/coqui-ai/TTS

What would you create with open-source, local TTS-audiobooks, game dialogue, or your own custom assistant voice?

#AdventCalendar #AI #OpenSource #TTS #Python #MachineLearning #CoquiTTS #AIVoices #VoiceSynthesis #LocalAI #FOSS #Privacy #Accessibility #TextToSpeech #CreativeTech #VoiceTech #DeepLearning #ArtificialIntelligence #TechNerds #Innovation #FOSSAdvent #ContentCreation #EthicalAI #VoiceCloning #DevTools #FutureTech #AITools #SpeechAI #linux #ki #adventskalender

Mô hình giọng nói AI của Sesame gây ấn tượng với khả năng biểu cảm, đối thoại tự nhiên và thông minh vượt trội so với Moshi, dù cả hai dùng công nghệ nền tảng tương tự (Mimi, Llama). Cộng đồng đang tìm hiểu điều gì đã tạo nên bước nhảy vọt này: dữ liệu huấn luyện, hàm mất mát, kiến trúc, tích hợp LLM hay quy trình tổng thể?

#AI #SpeechAI #TextToSpeech #SesameAI #MoshiAI #LLM #Technology #TríTuệNhânTạo #GiọngNóiAI #CôngNghệ #MôHìnhNgônNgữ

https://www.reddit.com/r/LocalLLaMA/comments/1paj990/why

Mô hình giọng nói của Sesame được đánh giá là cảm xúc, tự nhiên và thông minh vượt trội so với Moshi, dù cả hai đều dựa trên công nghệ tương tự (Mimi, Llama). Cộng đồng đang tìm hiểu lý do cho sự khác biệt lớn này: liệu có phải do dữ liệu huấn luyện, hàm mục tiêu, kiến trúc, tích hợp LLM hay kỹ thuật hệ thống?

#AI #SpeechAI #Sesame #Moshi #LLM #MachineLearning
#TríTuệNhânTạo #MôHìnhGiọngNói #HọcMáy #XửLýNgônNgữTựNhiên

https://www.reddit.com/r/LocalLLaMA/comments/1paj990/why_does_sesames_speech

Meta’s new Omnilingual ASR model drops character error rates below 10 % for 78 % of the 1,600 languages it was tested on – a huge leap for low‑resource, under‑represented tongues. The system leverages in‑context learning and is released under Creative Commons, inviting the community to build on it. Read the full benchmark details! #OmnilingualASR #SpeechAI #LowResource #UnderrepresentedLangs

🔗 https://aidailypost.com/news/metas-omnilingual-asr-hits-sub10-error-78-1600-languages

As of today, my computer can __nicely__ read aloud for me !

I'm lazy, i read slowly, so i don't like reading, i skip a lot of articles

I have been looking for a solution for several months

#Accessibility #A11y #Orca #WebBrowser #ZenBrowser #Firefox #Piper #Pied #SpeechAI #AI #Nix #NixOS

Yesterday, I ordered food online. However it went a little off. And I contacted Support. They called me and for one moment, I thought it's a bot or recorded voice or something. And I hated it. Then I realized it's a human on the line.

I was planning to do an LLM+TTS+Speech Recognition and deploy it on A311D. To see if I can practice british accent with it. Now I'm rethinking about what I want to do. This way we are going, it doesn't lead to a good destination. I would hate it if I would have to talk to a voice enabled chatbot as support agent rather than a human.

And don't get me wrong. Voice enabled chatbots can have tons of good uses. But replacing humans with LLMs, not a good one. I don't think so.

#LLM #AI #TTS #ASR #speechrecognition #speechai #ML #MachineLearning #chatbot #chatbots #artificialintelligence