"Trợ lý giọng nói thất bại không phải vì AI kém mà vì thiết kế như câu hỏi thường gặp. Con người muốn tương tác, không phải thẩm vấn. #TrợLýGiọngNói #VoiceBots #AI #ThiếtKế #Interaction #TươngTác"

https://www.reddit.com/r/SideProject/comments/1ovt0ct/voice_bots_fail_not_because_of_bad_ai_but_because/

💸 #OpenAI is going after one of the major pain points of its audio-native models: price. The newest audio model, `gpt-4o-realtime-preview-2024-12-17`, will cost 60% less than its predecessor. #gpt-4o-mini also becomes available through Realtime API, at 10x cheaper cost per token than the old gpt-4o-realtime (10$/M token input, 20$/M tokens output) [3].

[3] https://openai.com/api/pricing/

#GenAI #VoiceBots #Chatbots #AI #LLMs #Agents #RealtimeAPI

Pricing

OpenAI is an AI research and deployment company. Our mission is to ensure that artificial general intelligence benefits all of humanity.

OpenAI

🚦 #LiveKit released a transformers-based, semantic End-of-Turn detector, #opensource on #HuggingFace[1]! This model complements voice activity detectors (#VAD) by predicting whether the user's sentence is complete. This helps reduce false starts up to 85% according to their own testing, and is text-based, with a very low latency (~50ms). Find all the details in their post [2].

[1] https://huggingface.co/livekit/turn-detector

[2] https://blog.livekit.io/using-a-transformer-to-improve-end-of-turn-detection

#GenAI #VoiceBots #Chatbots #AI #LLMs #Agents #RealtimeAPI

livekit/turn-detector · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

🏃‍♀️ The competition between text-based voice #bots and audio-native models is just getting tougher! Today, both #OpenAI and #LiveKit released new features, just in time for some holiday experiments 🎁

A thread 👇

#GenAI #VoiceBots #Chatbots #AI #LLMs #Agents #RealtimeAPI