Is there any website like Geekbench that shows tokens / second an LLM can generate, by machine, model, context, quantization etc?
@schlu @martinhoeller
Good question. I have seen https://kamilstanuch.github.io/LLM-token-generation-simulator, but not sure about accuracy.