Can anyone recommend a privacy friendly SaaS #llm #inference provider? It needs to support *function calling* on at least one of the more recent #openweights models:
- gpt-oss
- Olma3
- Apertus? (I did not yet succeed using it)
There should be some level of cost control. Ideally a hourly rate limit. European solutions are preferred.
Use case is to have a fallback for demos or experiments where local inference is not practical. Monthly costs should go towards 0 when not used.