ElevenLabs integriert mit der Speech Engine eine neue Audio-Ebene für textbasierte Chatbots.

Die Lösung wird über ein SDK implementiert und nutzt das WebSocket-Protokoll, ohne bestehende LLM- oder RAG-Architekturen zu verändern. Sie bietet Transkription in 90 Sprachen, eine Sprechpausen-Erkennung und automatisches Unterbrechungs-Management.

#ElevenLabs #SpeechEngine #Voicebots #WebRTC #AIGeneratedImage

https://www.all-ai.de/news/news26top/elevenlabs-chatbot-voicebot

ElevenLabs verwandelt Chatbots in Voicebots mit einem Prompt

Die neue Speech Engine stattet bestehende Chat-Agenten unkompliziert mit einer natürlichen Sprachausgabe aus.

All-AI.de

"Trợ lý giọng nói thất bại không phải vì AI kém mà vì thiết kế như câu hỏi thường gặp. Con người muốn tương tác, không phải thẩm vấn. #TrợLýGiọngNói #VoiceBots #AI #ThiếtKế #Interaction #TươngTác"

https://www.reddit.com/r/SideProject/comments/1ovt0ct/voice_bots_fail_not_because_of_bad_ai_but_because/

💸 #OpenAI is going after one of the major pain points of its audio-native models: price. The newest audio model, `gpt-4o-realtime-preview-2024-12-17`, will cost 60% less than its predecessor. #gpt-4o-mini also becomes available through Realtime API, at 10x cheaper cost per token than the old gpt-4o-realtime (10$/M token input, 20$/M tokens output) [3].

[3] https://openai.com/api/pricing/

#GenAI #VoiceBots #Chatbots #AI #LLMs #Agents #RealtimeAPI

Pricing

OpenAI is an AI research and deployment company. Our mission is to ensure that artificial general intelligence benefits all of humanity.

OpenAI

🚦 #LiveKit released a transformers-based, semantic End-of-Turn detector, #opensource on #HuggingFace[1]! This model complements voice activity detectors (#VAD) by predicting whether the user's sentence is complete. This helps reduce false starts up to 85% according to their own testing, and is text-based, with a very low latency (~50ms). Find all the details in their post [2].

[1] https://huggingface.co/livekit/turn-detector

[2] https://blog.livekit.io/using-a-transformer-to-improve-end-of-turn-detection

#GenAI #VoiceBots #Chatbots #AI #LLMs #Agents #RealtimeAPI

livekit/turn-detector · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.