Mastodawn

Can anyone recommend a privacy friendly SaaS #llm #inference provider? It needs to support *function calling* on at least one of the more recent #openweights models:

- gpt-oss
- Olma3
- Apertus? (I did not yet succeed using it)

There should be some level of cost control. Ideally a hourly rate limit. European solutions are preferred.

Use case is to have a fallback for demos or experiments where local inference is not practical. Monthly costs should go towards 0 when not used.

#selfhosting

Show thread

o lаvrоvsky Mar 9

@guesser Stay tuned: the next release of Apertus promises to improve performance here

Show thread

Roman Mar 9

@loleg interesting. Looking forward to it. So far olmo 3.1 is my favourite. Works quite well for example as q4_0 quantization.

Show thread

o lаvrоvsky

@guesser your question was about inference providers, there are a few listed here that support Olmo & Apertus https://apertvs.ai/pages/get-started/

Other popular European ones are listed (no affiliation to me) at https://llm-tracker.eu

APERTVS.ai

Fully Open Foundation Model for Sovereign AI