Mastodawn

Whisper или GigaAM для русского ASR в продакшене: три ловушки бенчмарка, которые перевернут ваши выводы

Полгода назад мы публиковали статью про то, как получили 3.3% WER для русского ASR с GigaAM. Замеры шли на пяти TTS-фрагментах из аудиокниг, что подтверждало тезис «специализация бьёт универсальность». С тех пор мы перемерили обе модели на реальных продакшен-записях и попали в три ловушки бенчмарка. Первый замер показал «GigaAM впереди Whisper на 7 pp». На тех же данных, после небольшой чистки, обе модели идут вровень. А на самом шумном клипе с реверберацией Whisper уходит вперёд на 19 pp. Это всё на одном подкасте, с одними и теми же скриптами, одними и теми же моделями. Детали разбираем под катом. Протестировали 10 методов «улучшения» аудио (большинство сделали хуже), измерили RTF на RTX 4090 и сформулировали финальный выбор: GPU - до обученный Whisper-turbo, CPU - GigaAM v3-e2e-rnnt. И почему именно так.

https://habr.com/ru/articles/1042574/

#распознавание_речи #ASR #Whisper #GigaAM #WER #fasterwhisper #бенчмарк #finetuning #русский_ASR #оффлайнраспознавание

Whisper или GigaAM для русского ASR в продакшене: три ловушки бенчмарка, которые перевернут ваши выводы

Пару месяцев назад мы публиковали статью про то, как получили 3.3% WER для русского ASR на CPU с GigaAM - главный тезис тогда был «специализация бьёт универсальность». Замеры в той статье шли на пяти...

Хабр

Tino Eberl 3d ago

#Steady #Klimacrew

Wenn #Sprachmodelle plötzlich menschenverachtende Aussagen treffen oder gefährliche Tipps geben, läuft etwas gewaltig schief.

Eine aktuelle Untersuchung zeigt, wie schnell #Feintuning unerwartete Folgen haben kann – mit brisanten Konsequenzen für reale Anwendungen.

Das Phänomen wurde nur durch Zufall entdeckt.

https://tino-eberl.de/missbrauch-kuenstlicher-intelligenz/gefaehrliches-finetuning-ki-modelle-koennen-aus-dem-ruder-laufen/

#KI #Sprachmodelle #Finetuning #AIRisiko #LLM #AISecurity #KIMissbrauch #Retröt

Gefährliches Finetuning: KI-Modelle können aus dem Ruder laufen

Du denkst, KI ist berechenbar? Diese neue Studie zeigt, wie Finetuning Sprachmodelle plötzlich gefährlich machen kann.

Tino Eberl

Habr 6d ago

Ожидание: сделать ИИ-примерочную обоев за 2 дня. Реальность: пришлось добучать свою модель на SD

В условиях жесткой конкуренции на рынке отделочных материалов любому магазину жизненно необходимо хоть какое-то осязаемое преимущество. Стандартными каталогами и скидками уже никого не удивить. Так у нас родилась идея: сделать онлайн-примерочную обоев. Кажется, звучит как киллер-фича — дать клиенту возможность до покупки увидеть, как конкретный паттерн будет смотреться в его реальном интерьере. На тот момент на рынке вовсю хайповали генеративные модели (такие как «Nano Banana»). На первый взгляд казалось, что проблема решается в два клика. План был надежен, как швейцарские часы: получить API-ключ, отправить по эндпоинту фотографию интерьера и текстуру обоев, сопроводить это правильным промптом (с указанием учитывать перспективу, освещение и масштаб) и забирать готовый результат. Но на практике оказалось, что задача не просто нетривиальная. Она вскрыла целый пласт проблем, о которых создатели популярных генеративок предпочитают умалчивать.

https://habr.com/ru/articles/1039804/

#computer_vision #stable_diffusion #нейросети #finetuning #ecommerce #визуализация_интерьеров #chatgpt

Ожидание: сделать ИИ-примерочную обоев за 2 дня. Реальность: пришлось добучать свою модель на SD

Хабр

Chema Alonso

May 23

El lado del mal - Cómo optimizar el gasto en IA con arquitecturas clasificadas, orquestadas y/o destilación. El problema de la Predictibilidad de los Costes de la IA https://www.elladodelmal.com/2026/05/como-optimizar-el-gasto-en-ia-con.html #IA #AI #Costes #Presupuesto #Optimización #Prompting #ArquitecturaSW #Destilación #FineTuning #MachineLearning

Cómo optimizar el gasto en IA con arquitecturas clasificadas, orquestadas y/o destilación. El problema de la Predictibilidad de los Costes de la IA

Blog personal de Chema Alonso ( https://MyPublicInbox.com/ChemaAlonso ): Ciberseguridad, IA, Innovación, Tecnología, Cómics & Cosas Personasles.

Philo Sophies May 19

🌞#Sun – The billion-year energy scandal!⚡

The #Zoomposium with Prof. Dr. #ThomasNaumann and Dr. #IljaBohnet focuses precisely on such fundamental #unansweredquestions in #science:

📎https://philosophies.de/index.php/2022/10/26/zoomposium-naumann-bohnet-das-raetselhafte-universum/

📺https://youtu.be/k22eSYJgPD0

#NuclearFusion #Physics #Cosmology #Energy #Einstein #TheoryOfRelativity #OpenQuestions #DarkMatter #StringTheory #Metaphysics #DESY #PhilosophyOfScience #Universe #Cosmos #FineTuning #Research #Science #Philosophy #Astrophysics #NuclearPhysics

Fritzlabs Health Tech Ethics May 19

"Out of Tune: Fine-Tuning Foundation Models Leads to Unpredictable Safety Drift" Benign fine-tuning unpredictably shifts #AI safety. Small updates compromise safeguards regardless of model size. #AISafety #FineTuning https://cdt.org/insights/out-of-tune-fine-tuning-foundation-models-leads-to-unpredictable-safety-drift/

Out of Tune: Fine-Tuning Foundation Models Leads to Unpredictable Safety Drift

Conclusionpted-models" href="#revisiting-ai-governance-and-policy-for-adapted-models" class="toc-anchor">Revisiting AI Governance and Policy for Adapted Modelssafety-behavior-in-high-stakes-domains" href="#the-unpredictable-effects-of-model-modifications-on-safety-behavior-in-high-stakes-domains" class="toc-anchor">The unpredictable effects of model modifications on safety behavior in high-stakes domainsof-model-modification-for-ai-supply-chain-governance" href="#the-challenge-of-model-modification-for-ai-supply-chain-governance" class="toc-anchor">The Challenge of Model Modification for AI Supply Chain Governancef the Algorithmic Alignment Group at the Massachusetts Institute of Technology (MIT). [ Read full report ] General-purpose AI models are increasingly […]

Center for Democracy and Technology

Arint - SEO+KI May 19

RT @jun_song: One of my best friends from my US college days works as an AI engineer at Big Tech and is about to finish his PhD. I only got my bachelor's, came back to Korea, and worked in a completely different field: strategic planning. My job was planning new businesses and making factories and affiliates run efficiently. My only involvement with AI was building and implementing workflow automation when they asked for it. I was talking to my friend recently. He knows everything about his specific field, but he knew absolutely nothing about how local LLMs work or post-training. That made me realize something: AI has so many different subfields, and having a degree doesn’t mean you know everything. Curiosity for new things and the drive to learn them will be way more important than a degree going forward. And I’ve said this before, but I’m not posting this motivation to sell you a course. I will never do that. Set up a research multi-agent for the latest information and study new things. It will help you massively. If you can leverage your current domain knowledge to figure out which fields will be promising in the future, that’s the best scenario. Thanks for reading this long post. I genuinely want all my followers to succeed, and I hope this information was helpful. 송준 Jun Song (@jun_song) A year ago, I didn't care about fine-tuning or post-training at all. But when I thought about corporate security, it hit me: the demand for fine-tuning is going to be massive. I locked in for a few months. Using nothing but my MacBook, I fine-tuned the SuperGemma4 series entirely on my own, and it r…

mehr auf Arint.info

#agent #finetuning #Huggingface #nitter #opensource #things #US #arint_info

https://x.com/jun_song/status/2056591055064318143#m

Arint - SEO+KI (@[email protected])

RT @jun_song: One of my best friends from my US college days works as an AI engineer at Big Tech and is about to finish his PhD. I only got my bachelor's, came back to Korea, and worked in a completely different field: strategic planning. My job was planning new businesses and making factories and affiliates run efficiently. My only involvement with AI was building and implementing workflow automation when they asked for it. I was talking to my friend recently. He knows everything about his specific field, but he knew absolutely nothing about how local LLMs work or post-training. That made me realize something: AI has so many different subfields, and having a degree doesn’t mean you know everything. Curiosity for new things and the drive to learn them will be way more important than a degree going forward. And I’ve said this before, but I’m not posting this motivation to sell you a course. I will never do that. Set up a research multi-agent for the latest information and study new things. It will help you massively. If you can leverage your current domain knowledge to figure out which fields will be promising in the future, that’s the best scenario. Thanks for reading this long post. I genuinely want all my followers to succeed, and I hope this information was helpful. 송준 Jun Song (@jun_song) A year ago, I didn't care about fine-tuning or post-training at all. But when I thought about corporate security, it hit me: the demand for fine-tuning is going to be massive. I locked in for a few months. Using nothing but my MacBook, I fine-tuned the SuperGemma4 series entirely on my own, and it r… <a href="https://arint.info/@Arint/116600661987139175">mehr</a> auf <a href="https://arint.info/">Arint.info</a> #agent #finetuning #Huggingface #nitter #opensource #things #US #arint_info <a href="https://x.com/jun_song/status/2056591055064318143#m">https://x.com/jun_song/status/2056591055064318143#m</a>

Mastodon Glitch Edition

Show thread

Arte es Ética May 3

La leyenda urbana de «entrenar tu propio modelo de IAG» sigue siendo un anzuelo para monetizar tutoriales, cursos, masterclass y demás productos que los gurúes y promotores de la IA generativa usan para seguir lucrando a costa de todos los autores vulnerados. ¡No se dejen engañar!

#AI #MachineLearning #data #training #finetuning #AImodel #genAI #generativeAI #pretraining #Copyright #opensource

Andreas Becker May 3

Das Oxford Internet Institute zeigt: Empathisches Fine-Tuning von LLMs erhöht Fehlerquoten.

Modelle wie GPT-4o, Llama-70b und Qwen-32b liefern nach Warm-Persona-Tuning bis zu 30 Prozentpunkte häufiger falsche Fakten. Sie bestätigen fehlerhafte Nutzerannahmen, statt zu korrigieren. Kontrollgruppen mit kaltem Profil blieben stabil.

#LLM #FineTuning #OxfordInternetInstitute #Sycophancy #AIGeneratedImage

https://www.all-ai.de/news/news26top/sprachmodelle-freundlich-studie