Mozilla lays off 60 people, wants to build AI into Firefox

https://lemmy.today/post/6536791

Mozilla lays off 60 people, wants to build AI into Firefox - Lemmy Today

lol rip
What if you don’t have a decent graphics card? Wait 5 minutes for your URL completion to finish?
Using an LLM is quite fast, especially if it’s optimised to run on normal hardware
Decent models are huge; an average one requires 8GB to be kept in memory (better models requires something like 40 to 70 GB), and most currently available engines are extremely slow on a CPU and requires dedicated hardware (and even relatively powerful GPU requires a few seconds of “thinking” time). It is unlikely that these requirements will be easily squeezable in current computers, and more likely that dedicated hardware will be required.

I don’t think any inference engines have actually been optimised to run on CPUs. You’re stuck with 32-bit floats but OTOH that just means that you can do gigantic winograd transformations with the excess precision, needing far fewer fmuladds in total and CPUs are better at dealing with the memory access patterns that come with transforming the convolution. Most people have at least around 1TFLOP of compute in their CPU (e.g. a Ryzen 3600 has that much) that’s not ever seeing the light of day. About a fifth of what an RX 570 has, it’s a difference but not a magnitude and you can run SDXL with that kind of class of card (maybe not the 570 dunno about software support but a 5500 works, despite AMD’s best efforts to cripple rocm).

Also from what I gather they’re more or less doing summarybot for your browsing history, that’s not a ChatGPT or Llama-style giant model you can talk with.

Also to all those people complaining: There’s already AI in firefox, the translation models are about 17MB per language pair, gzipped.

firefox-translations-models/models/prod/deen at main · mozilla/firefox-translations-models

CPU-optimized Neural Machine Translation models for Firefox Translations - mozilla/firefox-translations-models

GitHub