An Open Training Set For AI Goes Global
https://fed.brid.gy/r/https://www.techdirt.com/2026/03/24/an-open-training-set-for-ai-goes-global/
An Open Training Set For AI Goes Global
https://fed.brid.gy/r/https://www.techdirt.com/2026/03/24/an-open-training-set-for-ai-goes-global/
Quelques informations supplémentaires sur la journée du 21 mars à Jussieu, dont la composition de la table-ronde dans ce communiqué de presse. https://www.sorbonne-universite.fr/actualites/seulement-02-de-donnees-francophones-dans-lia-wikimedia-france-et-sorbonne-universite
As many of the AI stories on Walled Culture attest, one of the most contentious areas in the latest stage of AI development concerns the sourcing of training data. To create high-quality large language models (LLMs) massive quantities of training data are required. In the current genAI stampede, many companies are simply scraping everything they can off the Internet. Quite how that will work […]
#aiAlliance #commonCorpus #curation #euAiAct #financeCommons #france #gdpr #github #legalCommons #llms #multilingual #openCulture #openGovernment #openScience #openSource #openWeb #pdf #permissiveLicensing #pleias #publicDomain #scraping #tokens #toxicity #wikimedia #youtube https://walledculture.org/common-corpus-an-open-training-set-for-ai-goes-global-and-so-should-support-for-it/> Today, we are announcing #Amazon, #Meta, #Microsoft, #mistralai , and #Perplexity for the first time as they join our roster of partners, which includes #Google, #Ecosia, #Nomic, #Pleias, #ProRata, and #ReefMedia. All these organizations utilize #WikimediaEnterprise to integrate human-governed knowledge into their platforms at scale. By doing so, they help ensure that the work of our global volunteer community reaches billions of people with the accuracy and transparency that Wikipedia represents.
And that a good new for me.
https://enterprise.wikimedia.com/blog/wikipedia-25-enterprise-partners/

Amazon, Meta, Microsoft, Mistral AI, and Perplexity have officially joined the Wikimedia Enterprise ecosystem as we celebrate 25 years of Wikipedia. Discover how we provide the dedicated infrastructure to deliver human-governed knowledge to the world’s most influential platforms.