And this is the crap I'm talking about, I find a model on HF, it says I should run it on #vllm but vLLM doesn't support the model type.... because I need to go setup a *custom* build of vLLM to run it.

"Production grade LLM"? Buuuuuushit.
I suspect this "production grade" determination was made by a BA turned Python developer.