Mastodawn

#Kimi proposes 6 critical tests to restore confidence in the #opensourcellm ecosystem with a new tool
https://www.kimi.com/blog/kimi-vendor-verifier

Kimi Vendor Verifier

Rebuilding the

Show thread

o lаvrоvsky Mar 27

More information on #Apertus can be found in the dedicated page on the hackathon platform #Dribdat #OpenSourceLLM https://bd.hack4socialgood.ch/project/143

Apertus

Fully Open Foundation Model for Sovereign AI. Developed by the Swiss AI Initiative as a collaborative effort between EPFL, ETH Zurich, and CSCS. Open weights, open data, open science.

ǝʌɐp Mar 16

This column is a good survey of where we're at right now and is written by someone working on actual open source models (ie. training data provided):

"The most successful open models will be complementary tools to closed agents. This is a path for open models to complement and accelerate the frontier of progress."

... "These models need to be almost brain-numbingly boring and specific. In a world dominated by coding agents, I want to build open models that Claude Code is desperate to use as a tool, letting its sub agents unlock entirely new areas of work. This is possible, but remarkably under-explored. Small models from the likes of Qwen and co. are still marketed on general-task benchmarks. The hype of “open models catching the frontier” distracts the world from this very large area of demand."

https://www.interconnects.ai/p/the-next-phase-of-open-models

#llm #openmodels #localmodels #opensourcellm

What comes next with open models

Markets, capabilities, cope, and bewilderment in the industrialization of language models.

Interconnects AI

AI Daily Post Mar 4

Alibaba's flagship Qwen team sees a key departure right after the open-source launch of Qwen 3.5. The move raises questions about the future of agentic inflection, AI workers and enterprise-grade LLMs. What does this mean for open-source AI? Dive into the details. #QwenAI #AlibabaAI #OpenSourceLLM #EnterpriseAI

🔗 https://aidailypost.com/news/alibaba-sees-key-qwen-ai-staff-exit-after-qwen35-open-source-release

AI Daily Post Feb 26

Alibaba just released the Qwen‑3.5‑Medium model as open‑source, delivering Sonnet 4.5‑level performance on a single GPU. It uses a Mixture‑of‑Experts architecture and a new “Thinking Mode” to boost AI inference efficiency while staying lightweight. Dive into the details and see how this could reshape open‑source LLM development. #Qwen3_5 #OpenSourceLLM #MixtureOfExperts #ModelEfficiency

🔗 https://aidailypost.com/news/alibaba-open-sources-qwen35-medium-models-sonnet-45-performance

o lаvrоvsky Feb 15

RE: https://hachyderm.io/@loleg/116041701680591995

Collected impressions of the #OpenSourceLLM summit on my personal blog https://log.alets.ch/121/

Show thread

o lаvrоvsky Feb 9

The workshop sessions and panels rounded off an excellent day, with thanks to the EPFL AI Center team for organising and fellow participants for the #ShareEverything vibes of #OpenSourceLLM

Show thread

o lаvrоvsky Feb 9

Have everything well documented. Use CI. Work closely together, use fairly, and continuously improve shared capabilities across the Swiss AI Initiative. Joost VandeVondele (CSCS) at #OpenSourceLLM

Show thread

o lаvrоvsky Feb 9

A #Reachy greets visitors at #OpenSourceLLM #EPFL - leading to reflections on the multiple modalities, disruptive industries, and historic experiences of communications technology. See also https://hachyderm.io/@loleg/116040818446611641