The Audience Nobody Saw

디즈니의 새 CEO가 감성적 스토리텔링을 강조하는 가운데, 장애인 접근성에 대한 주주는 개선 요구를 거부했다. 아마존 프라임 비디오는 고전 영화에 대해 AI 음성 합성 기반 오디오 설명(AD)을 제공하지만, 비영어권 영화나 일부 명작에는 AD가 없어 접근성 표준과 커뮤니티 참여가 부재함을 드러낸다. AI 음성 데이터 훈련에 사용된 인간 음성에 대한 동의 및 보상 체계가 마련되지 않은 상황에서, AI 음성 복제 관련 법적 분쟁은 유명인 중심으로만 진행되고 있다. 이는 AI 접근성 도구 개발과 문화 콘텐츠 접근성에 있어 구조적 배제와 불평등 문제를 시사한다.

https://fromthelittoral.substack.com/p/the-audience-nobody-saw

#audiodescription #accessibility #amazonpolly #aivoice #mediainclusion

The Audience Nobody Saw

On whose experience the industry decided was worth preserving.

From The Littoral | Macy Lao | Substack

WP A DAY es el podcast de Blogpocket hecho con #IA. A partir de una selección manual de los artículos más interesantes publicados durante la semana, interactuamos con #ChatGPT y creamos automáticamente los guiones para llevarlos a #AmazonPolly y generar el audio.

Esta semana:

- #WordPress 6.5 será lanzado el 26 de marzo de 2024
- #Gutenberg 17.2
- WordPress 6.4.2 ya está disponible

https://www.blogpocket.com/podcast/wp-a-day-18-wordpress-6-5-sera-lanzado-el-26-de-marzo-de-2024-gutenberg-17-2-wordpress-6-4-2-ya-esta-disponible/

WP A DAY #18: WordPress 6.5 será lanzado el 26 de marzo de 2024; Gutenberg 17.2; WordPress 6.4.2 ya está disponible

WP A DAY es un podcast con las últimas noticias, trucos e ideas sobre WordPress, hecho con IA.

Blogpocket

WP A DAY is the experimental podcast I’m creating using AI. The audio construction process (in Spanish and English) has two parts. In the first, through a development in PHP and from a file that contains the RSS files of different blogs that usually publish news about WordPress, a text is obtained as a script for each episode. And, in the second, that text is transformed into audio.

The second part involves the use of Amazon Polly, which gives more than acceptable naturalness in speech.

The first part is where the magic is. And from the first versions of the code, very basic, to the one I obtained this weekend – a little more elaborate – I have approached the problem in several different ways.

In the image, you can see the first step: the creation of a web form where you can choose the summaries that will be included in the podcast script. I had to try many options until I found the one that seemed most practical to me: the form already contains the summary obtained by ChatGPT and they are already recorded in a file. This allows me to run the RSS information extraction again and interact with ChatGPT again, if necessary, before selecting the posts that will go to the script.

Of the entire process, the critical part is in the generation of the summaries. And I think the result is already convincing because I am able to iterate – as many times as necessary – with ChatGPT, running a script to obtain the summaries and, consequently, the form for selecting the articles that will go into the script.

Naturally, this version 1.0 of the code can be improved but the important thing is that I have achieved a consistent and effective process.

Look, for example, at an example of a generated script:

Now I must optimize the text of the scripts, generating introductions and goodbyes; more connecting phrases, in a more natural way. In this section, ChatGPT does not intervene and I do it using arrays with phrases and the random access instruction. But, I have to think about it to see if ChatGPT could improve it.

The 3,000 character limitation that Amazon Polly imposes to avoid having to use S3 storage does not worry me at the moment.

https://acambronero.wordpress.com/2023/11/05/primeros-resultados-convincentes-en-mi-aplicacion-para-crear-un-podcast-con-ia/

#AmazonPolly #ChatGPT

Just published another episode of WP A DAY.

WP A DAY is a podcast with the latest news, tricks and ideas about WordPress, made with AI, using #ChatGPT and #AmazonPolly

This week, the summary of three notable news, such as the discussion over the plugin lists on WordPress.com and WordPress.org.

#WordPress #WPADAY

https://social.blogpocket.com/wp-a-day-6-the-discussion-over-plugin-lists-on-wordpress-com-and-wordpress-org-and-more-news-and-more-news/

@wordpress @[email protected]

I made an Infinite Podcast Machine with #GPT3 #dalle2 and #AmazonPolly streaming on https://twitch.tv/abr8 all content and images are fully generative and it can go in any direction! will be live for the next few hours. #AIart #ChatGPT
abr8 - Twitch

Infinite Podcast Machine - an OpenAI experiment. All content and images are 100% generated by OpenAI with no guidance.

Twitch

Here's the code for my #Swift #SwiftUI based text to voice app that uses #AmazonAWS #AmazonPolly to create very good speech using AI. It's a bit rough, but hopefully you can see what's going on. Let me know if you have a play with it or if the instructions aren't clear.

https://github.com/brindy/Parakeet

GitHub - brindy/Parakeet: A SwiftUI macOS app that uses the AWS Swift SDK to execute Polly (text to voice).

A SwiftUI macOS app that uses the AWS Swift SDK to execute Polly (text to voice). - GitHub - brindy/Parakeet: A SwiftUI macOS app that uses the AWS Swift SDK to execute Polly (text to voice).

GitHub

El ciclo se ha completado 😂

#ChatGPT #amazonpolly #stablediffusion