Cộng đồng đang tìm kiếm các mô hình ngôn ngữ lớn (LLM) chuyên biệt cho tác vụ dịch thuật đa ngôn ngữ với tiêu chí: nhỏ, nhẹ và tốc độ cao để xử lý quy mô lớn (100-500 văn bản/phút).

Một số ứng cử viên sáng giá cho tác vụ này thường là dòng NLLB (No Language Left Behind) của Meta hoặc các mô hình chuyên biệt như Madlad-400, giúp tối ưu chi phí vận hành và hiệu suất so với các model tổng quát.

#AI #MachineLearning #Translation #LLM #DichThuat #CongNghe #LocalLLaMA #NLLB

https://www.reddit.com/

reddit

Reddit is a network of communities where people can dive into their interests, hobbies and passions. There's a community for whatever you're interested in on Reddit.

Reddit
for some reason the things that i hate about #libretranslate
is the models and it`s workflow, the base models are very bad especially for #persian.
last year i had tried to fix it and gave 15 hours of my time to train the model and I didn't see any progress.

now im thinking about try to write a libretranslate compatible translator based on #facebook #nllb.
#TIL that @wikipedia already uses the #NLLB model to automatically translate articles: https://www.mediawiki.org/wiki/Content_translation/Machine_Translation/MinT.
Content translation/Machine Translation/MinT - MediaWiki

MediaWiki

`OpenNMT/CTranslate2` の #Rust bindingsを開発中です。Rustから #meta#nllb を使って翻訳できます。

https://github.com/jkawamoto/ctranslate2-rs

GitHub - jkawamoto/ctranslate2-rs: Rust bindings for OpenNMT/CTranslate2

Rust bindings for OpenNMT/CTranslate2. Contribute to jkawamoto/ctranslate2-rs development by creating an account on GitHub.

GitHub
ما كنت لأرى نفسي أذكر عملاً لزكربيرق بخير، إلا أن القدر شاء أن يعمل فريق المطورين لشركة #ميتا (#meta) على نماذج #ذكاء_اصطناعي لغوية ومرئية مفتوحة المصدر، كانت سبباً مباشراً لظهور العديد من التقنيات والنماذج والخدمات المبنية عليها وهي نماذج:
- #ان_ال_ال_بي #NLLB
- #لاما #LLaMA
- #سيقمنت_انيثنق #SegmentAnything
وجميعها نماذج متقدمة تقنياً بالرغم من أن الذكاء الاصطناعي ليس توكيد ميتا الأساسي.

"#Tigrinya and the rest of African languages, and by extension the hundreds of millions of people that speak these languages, are an afterthought for them."

"When you compare #NLLB systems for African languages against those supported by Lesan (https://lesan.ai) or Ghana NLP (https://ghananlp.org), their systems have lower quality and are generally sub-optimal."

https://ghananlp.org

Lesan AI

Die Facebook-Mutter will Menschen mit automatischen Übersetzungen vernetzen und ins Metaversum locken. Ein Algorithmus soll dafür 200 Sprachen unterstützen.
Meta AI: KI liefert jetzt "hervorragende Übersetzungen" für 200 Sprachen
Meta AI: KI liefert jetzt "hervorragende Übersetzungen" für 200 Sprachen

Die Facebook-Mutter will Menschen mit automatischen Übersetzungen vernetzen und ins Metaversum locken. Ein Algorithmus soll dafür 200 Sprachen unterstützen.

heise online
» No Language Left Behind - #Meta AI https://t.co/ZPdv9jluGs // 維基百科的編輯現在可以透過維基媒體基金會(Wikimedia Foundation)的內容翻譯工具運用 NLLB-200 模型背後的技術,將資訊翻譯成他們的母語或慣用語言。
#NLLB
No Language Left Behind