Microsoft removes guide on how to train LLMs on pirated Harry Potter books
The now-deleted Harry Potter data set was "mistakenly" marked public domain.
https://arstechnica.com/tech-policy/2026/02/microsoft-removes-guide-on-how-to-train-llms-on-pirated-harry-potter-books/?utm_brand=arstechnica&utm_social-type=owned&utm_source=mastodon&utm_medium=social
The now-deleted Harry Potter data set was "mistakenly" marked public domain.
https://arstechnica.com/tech-policy/2026/02/microsoft-removes-guide-on-how-to-train-llms-on-pirated-harry-potter-books/?utm_brand=arstechnica&utm_social-type=owned&utm_source=mastodon&utm_medium=social
