Mastodawn

David Gerard Dec 26, 2023

today's (unreleased, likely ~spring 2024) legal opinion article: AI models are likely uncopyrightable (no human hand) and too easy to reverse-engineer for trade secrets to work. So the VC billions are going into stuff they can't own. Which is the funniest possible result.

Show thread

Jon Dec 27, 2023

@davidgerard Sorry, I think you are confusing the available to download models, like on HuggingFace, where your argument has some value with the proprietary and not downloadable models like GPT 4. Good luck trying to reverse engineer GPT 4 from typing in prompts and recording responses. The valuable intellectual property models don’t depend on copyright law to retain their value, they depend on secrecy and security - like the recipe for Coke.

Show thread

g-clef Dec 27, 2023

@JonC @davidgerard Look for papers on “model leeching”. Copying a model by prompting it millions of times is a thing. It’s a fun question: if two models return the same answer for every prompt (or 90-plus percent of them), are they copies of each other? Even if their internal weights and vocabs are totally different?

Show thread

Jon

@gclef @davidgerard There is a big difference between using a set of prompts and responses (ala https://arxiv.org/pdf/2309.10544.pdf) to train a student model and having a reverse engineered copy of a model like GPT.